LLM Name | Llama 3.2 3B Instruct 4bit |
Repository ๐ค | https://huggingface.co/mlx-community/Llama-3.2-3B-Instruct-4bit |
Base Model(s) | |
Model Size | 3b |
Required VRAM | 1.8 GB |
Updated | 2024-10-04 |
Maintainer | mlx-community |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
Supported Languages | en de fr it pt hi es th |
Quantization Type | 4bit |
Model Architecture | LlamaForCausalLM |
License | llama3.2 |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.45.0.dev0 |
Tokenizer Class | PreTrainedTokenizerFast |
Vocabulary Size | 128256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 3.2 3B Instruct Bnb 4bit | 128K / 2.2 GB | 25021 | 7 |
...3b V2 Python Instruct 0.1 4bit | 8K / 2.5 GB | 5 | 0 |
...ma 3b V2 Python Instruct 0.1.2 | 8K / 6.9 GB | 5 | 0 |
PRIME 3B | 8K / 5.8 GB | 4 | 1 |
... Granite 4bit 3B Code Instruct | 2K / 2 GB | 5 | 0 |
... Granite 8bit 3B Code Instruct | 2K / 3.6 GB | 5 | 0 |
Llama 3.2 3B Instruct | 128K / 6.5 GB | 82474 | 249 |
Llama 3.2 3B Instruct | 128K / 6.4 GB | 6902 | 9 |
Llama3.2 3B Enigma | 128K / 12.8 GB | 49 | 5 |
Llama 3.2 3B Instruct | 128K / 6.5 GB | 418 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐