LLM Name | Llama 3.2 3B 8bit |
Repository ๐ค | https://huggingface.co/mlx-community/Llama-3.2-3B-8bit |
Model Size | 3b |
Required VRAM | 3.4 GB |
Updated | 2024-10-11 |
Maintainer | mlx-community |
Model Type | llama |
Model Files | |
Supported Languages | en de fr it pt hi es th |
Quantization Type | 8bit |
Model Architecture | LlamaForCausalLM |
License | llama3.2 |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.45.0.dev0 |
Tokenizer Class | PreTrainedTokenizerFast |
Vocabulary Size | 128256 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Llama 3.2 3B Instruct Bnb 4bit | 128K / 2.2 GB | 82167 | 8 |
Llama 3.2 3B Bnb 4bit | 128K / 2.2 GB | 18681 | 2 |
Llama 3.2 3B Instruct 4bit | 128K / 1.8 GB | 2375 | 6 |
...nstruct Medical Conversational | 128K / 6.5 GB | 231 | 3 |
Llama3.2 3B 4bit | 128K / 2.2 GB | 43 | 0 |
FineTome Llama3.2 3B 1002 | 128K / 6.5 GB | 39 | 1 |
Llama3 Finetuned For Civivox | 128K / 6.5 GB | 21 | 0 |
...3b V2 Python Instruct 0.1 4bit | 8K / 2.5 GB | 5 | 0 |
...ma 3b V2 Python Instruct 0.1.2 | 8K / 6.9 GB | 6 | 0 |
PRIME 3B | 8K / 5.8 GB | 4 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐