Model Type |
| |||||||||
Additional Notes |
| |||||||||
Supported Languages |
| |||||||||
Input Output |
|
LLM Name | Llama 3.2 3B Instruct 4bit |
Repository ๐ค | https://huggingface.co/mlx-community/Llama-3.2-3B-Instruct-4bit |
Base Model(s) | |
Model Size | 3b |
Required VRAM | 1.8 GB |
Updated | 2024-12-21 |
Maintainer | mlx-community |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
Supported Languages | en de fr it pt hi es th |
Quantization Type | 4bit |
Model Architecture | LlamaForCausalLM |
License | llama3.2 |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.45.0.dev0 |
Tokenizer Class | PreTrainedTokenizerFast |
Vocabulary Size | 128256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 3.2 3B Instruct Bnb 4bit | 128K / 2.2 GB | 211165 | 14 |
Komodo Llama 3.2 3B V2 Fp16 | 128K / 6.5 GB | 641 | 5 |
Gladiator Mini Exp 1211 3B | 128K / 6.5 GB | 17 | 0 |
Reasoning Llama 3B V0.1 | 128K / 6.5 GB | 395 | 9 |
Llama 3.2 3B Overthinker | 128K / 6.5 GB | 168 | 19 |
Llama Sentient 3.2 3B Instruct | 128K / 6.5 GB | 394 | 10 |
Llama Magpie 3.2 3B Instruct | 128K / 6.5 GB | 195 | 7 |
...truct Gptqmodel 4bit Vortex V3 | 128K / 3.2 GB | 1612 | 4 |
Llama 3.2 3B Fluxed | 128K / 6.5 GB | 460 | 0 |
Llama 3.2 3B Promptist Mini | 128K / 6.5 GB | 87 | 8 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐