LLM Name | Internlm Chat 7B 4bit Gptq |
Repository ๐ค | https://huggingface.co/cczhong/internlm-chat-7b-4bit-gptq |
Model Size | 7b |
Required VRAM | 5.1 GB |
Updated | 2025-02-05 |
Maintainer | cczhong |
Model Type | internlm |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq|4bit |
Model Architecture | InternLMForCausalLM |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.31.0.dev0 |
Is Biased | 1 |
Tokenizer Class | InternLMTokenizer |
Padding Token | </s> |
Vocabulary Size | 103168 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...m Chat 7B 4bit Gptq Safetensor | 2K / 8.2 GB | 5 | 3 |
Agent FLAN 7B | 8K / 13.5 GB | 14 | 19 |
Internlm Chat 7B | 2K / 14.6 GB | 18170 | 101 |
CFGPT1 Sft 7B Full | 2K / 14.8 GB | 92 | 1 |
CFGPT1 Pt 7B | 2K / 14.8 GB | 62 | 1 |
Internlm 7B | 2K / 14.6 GB | 2090 | 93 |
Firefly Internlm 7B | 2K / 14.7 GB | 12 | 3 |
Internlm Chat 7B W4 | 2K / 5.1 GB | 158 | 3 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐