LLM Name | Llama 3.2 3B VanRossum |
Repository ๐ค | https://huggingface.co/theprint/Llama-3.2-3B-VanRossum |
Base Model(s) | |
Model Size | 3b |
Required VRAM | 6.5 GB |
Updated | 2025-02-15 |
Maintainer | theprint |
Instruction-Based | Yes |
Model Files | |
Supported Languages | en |
GGUF Quantization | Yes |
Quantization Type | 4bit|gguf |
Model Architecture | AutoModel |
License | apache-2.0 |
Model Max Length | 131072 |
Is Biased | none |
Tokenizer Class | PreTrainedTokenizerFast |
Padding Token | <|finetune_right_pad_id|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | o_proj|gate_proj|up_proj|k_proj|q_proj|v_proj|down_proj |
LoRA Alpha | 16 |
LoRA Dropout | 0 |
R Param | 8 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
MedQwen3B Reasoner | 0K / 6.2 GB | 701 | 8 |
...a Song Stream 3B Instruct GGUF | 0K / 2 GB | 312 | 11 |
QwQ LCoT 3B Instruct GGUF | 0K / 1.9 GB | 278 | 12 |
...ma Magpie 3.2 3B Instruct GGUF | 0K / 2 GB | 339 | 8 |
...ma Doctor 3.2 3B Instruct GGUF | 0K / 2 GB | 387 | 11 |
... Sentient 3.2 3B Instruct GGUF | 0K / 2 GB | 233 | 11 |
Llama 3.2 3B Instruct GGUF | 0K / 2 GB | 242 | 8 |
Granite 3B Code Instruct GGUF | 0K / 1.3 GB | 10600 | 1 |
Phi 3 Phituguese 3B Gguf 16bit | 0K / 7.6 GB | 33 | 1 |
Phi 3 Phituguese 3B Q4 K M | 0K / 2.3 GB | 36 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐