LLM Name | Llama 2 7B Chat 4bit Gptq |
Repository ๐ค | https://huggingface.co/hoang1123/Llama-2-7b-chat-4bit-gptq |
Base Model(s) | |
Model Size | 7b |
Required VRAM | 3.9 GB |
Updated | 2025-02-22 |
Maintainer | hoang1123 |
Model Type | llama |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq|4bit |
Model Architecture | LlamaForCausalLM |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.39.3 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Yarn Llama 2 7B 128K GPTQ | 128K / 3.9 GB | 80 | 7 |
Yarn Llama 2 7B 64K GPTQ | 64K / 3.9 GB | 44 | 1 |
... 7B 32K Instructions V4 Marlin | 32K / 4.1 GB | 6 | 0 |
Aixcoder 7B GPTQ | 32K / 4.5 GB | 77 | 1 |
Calm2 7B Chat GPTQ | 32K / 4.4 GB | 71 | 6 |
...Calm2 7B Chat GPTQ Calib Ja 1K | 32K / 4.4 GB | 18 | 5 |
Llama 2 7B 32K Instruct GPTQ | 32K / 3.9 GB | 57 | 27 |
Codebear 7B 4bit | 16K / 3.9 GB | 4 | 1 |
...a 7B Instruct GPTQ Calib Ja 1K | 16K / 3.9 GB | 12 | 0 |
CodeLlama 7B Instruct GPTQ | 16K / 3.9 GB | 592 | 46 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐