Llama 3.2 3B By meta-llama: Benchmarks, Features and Detailed Analysis. Insights on Llama 3.2 3B.

Model Card on HF 🤗: https://huggingface.co/meta-llama/Llama-3.2-3B

Llama 3.2 3B Benchmarks

MMLU Pro: 16.53

GPQA: 2.35

MUSR: 3.81

BBH: 14.23

IFEval: 13.37 vs 88 (so35)^-84.8%

MATH Lvl 5: 1.89

LLME Score: 0.37507

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 3.2 3B Parameters and Internals

Model Type

text generation, text summarization, multilingual

Use Cases

Areas:

commercial, research

Applications:

knowledge retrieval, summarization, writing assistants, query and prompt rewriting

Primary Use Cases:

assistant chat, agentic applications

Limitations:

Use beyond supported languages without compliance with license, Violation of Acceptable Use Policy, Illegal activities

Considerations:

Encourage responsible use.

Additional Notes

Initial release for commercial and research purposes.

Supported Languages

en (English), de (German), fr (French), it (Italian), pt (Portuguese), hi (Hindi), es (Spanish), th (Thai)

Training Details

Data Volume:

9 trillion tokens

Methodology:

Supervised Fine-Tuning (SFT), Rejection Sampling (RS), Direct Preference Optimization (DPO).

Context Length:

128000

Training Time:

916k GPU hours

Hardware Used:

H100-80GB GPUs

Model Architecture:

Optimized transformer architecture with autoregressive language model.

Safety Evaluation

Methodologies:

Red teaming, Safety fine-tuning

Findings:

N/A

Risk Categories:

CBRNE

Responsible Ai Considerations

Mitigation Strategies:

System safeguards include Llama Guard, Prompt Guard, and Code Shield.

Input Output

Input Format:

Multilingual Text

Accepted Modalities:

text

Output Format:

Multilingual Text and code

LLM Name	Llama 3.2 3B
Repository 🤗	https://huggingface.co/meta-llama/Llama-3.2-3B
Model Size	3b
Required VRAM	6.5 GB
Updated	2025-05-31
Maintainer	meta-llama
Model Type	llama
Model Files	5.0 GB: 1-of-2 1.5 GB: 2-of-2
Supported Languages	en de fr it pt hi es th
Model Architecture	LlamaForCausalLM
License	llama3.2
Context Length	131072
Model Max Length	131072
Transformers Version	4.45.0.dev0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	128256
Torch Data Type	bfloat16

Quantized Models of the Llama 3.2 3B

Model	Likes	Downloads	VRAM
Llama 3.2 3B Instruct GGUF	42	30496	0 GB
Llama 3.2 3B Bnb 4bit	13	33301	2 GB
Llama 3.2 3B GGUF	2	128	2 GB
Llm 3 2 Flux Prompt	13	371	6 GB
Llama 3.2 3B COT	2	57	6 GB

Best Alternatives to Llama 3.2 3B

Best Alternatives	Context / RAM	Downloads	Likes
ISA 03 Mini 3B Hybrid Preview	256K / 6.5 GB	284	3
Llama 3.2 3B Instruct	128K / 6.5 GB	1713021	1486
Hermes 3 Llama 3.2 3B	128K / 6.5 GB	39491	159
Cogito V1 Preview Llama 3B	128K / 7.2 GB	2886	95
DeepSeek R1 Distill Llama 3B	128K / 6.5 GB	430	11
Calme 3.1 Llamaloi 3B	128K / 10.6 GB	3380	1
Orpheus 3B 0.1 Ft	128K / 6.6 GB	14243	3
Orpheus 3B 0.1 Pretrained	128K / 6.6 GB	10883	0
Llama 3.2 3B RP Toxic Fuse	128K / 6.4 GB	11	2
Llama 3.2 3B Instruct	128K / 6.5 GB	209051	66

Note: green Score (e.g. "73.2") means that the model is better than meta-llama/Llama-3.2-3B.

Rank the Llama 3.2 3B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer