LLM Name | Baichuan2 13B Chat Gptq 32g Act |
Repository ๐ค | https://huggingface.co/yfshi123/baichuan2-13b-chat-gptq-32g-act |
Model Size | 13b |
Required VRAM | 9.9 GB |
Updated | 2025-04-29 |
Maintainer | yfshi123 |
Model Type | baichuan |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq|4bit |
Model Architecture | BaichuanForCausalLM |
Model Max Length | 4096 |
Transformers Version | 4.33.0 |
Vocabulary Size | 125696 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Sakura 13B LNovel V0.8 3bit | 0K / 7.5 GB | 10 | 1 |
Sakura 13B LNovel V0.8 8bit | 0K / 9.1 GB | 10 | 1 |
Sakura 13B LNovel V0.8 4bit | 0K / 9.1 GB | 3 | 2 |
Baichuan2 13B Chat GPTQ | 0K / 9.1 GB | 104 | 20 |
Baichuan2 13B Chat GPTQ Int4 | 0K / 9.1 GB | 10 | 2 |
Baichuan 13B Instruction GPTQ | 0K / 7.9 GB | 20 | 4 |
Baichuan 13B Chat 8bit | 0K / 14.1 GB | 15 | 9 |
Tiny Random Baichuan2 13B | 0K / 0.1 GB | 66461 | 0 |
Baichuan2 13B Chat | 0K / 27.8 GB | 6369 | 424 |
Baichuan 13B Chat | 0K / 26.5 GB | 3356 | 631 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐