LLM Name | Internlm3 8B Instruct Gptq Int4 |
Repository ๐ค | https://huggingface.co/internlm/internlm3-8b-instruct-gptq-int4 |
Model Size | 8b |
Required VRAM | 6.1 GB |
Updated | 2025-02-22 |
Maintainer | internlm |
Model Type | internlm3 |
Instruction-Based | Yes |
Model Files | |
Supported Languages | en zh |
GPTQ Quantization | Yes |
Quantization Type | gptq|4bit |
Model Architecture | InternLM3ForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.48.0.dev0 |
Is Biased | 0 |
Tokenizer Class | InternLM3Tokenizer |
Padding Token | </s> |
Vocabulary Size | 128512 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Internlm3 8B Instruct | 32K / 17.6 GB | 33373 | 203 |
Internlm3 8B Instruct AWQ | 32K / 6.2 GB | 396 | 3 |
...3 8B Instruct Smoothquant Int8 | 32K / 9.9 GB | 41 | 4 |
...ernlm3 8B Instruct Abliterated | 32K / 17.6 GB | 46 | 3 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐