Meta Llama 3 8B By NousResearch: Benchmarks, Features and Detailed Analysis. Insights on Meta Llama 3 8B.

Autotrain compatible En Endpoints compatible Facebook Llama Llama-3 Meta Pytorch Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/NousResearch/Meta-Llama-3-8B

Meta Llama 3 8B Benchmarks

MMLU Pro: 22.8

GPQA: 5.7

MUSR: 11.25

BBH: 30.67

IFEval: 53.62 vs 88 (so35)^-39.1%

MATH Lvl 5: 6.19

LLME Score: 0.30301

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Meta Llama 3 8B (NousResearch/Meta-Llama-3-8B)

Meta Llama 3 8B Parameters and Internals

Model Type

text generation

Use Cases

Areas:

Commercial, Research

Applications:

Instruction tuned models for assistant-like chat

Primary Use Cases:

Natural language generation, Multilingual dialogue interactions

Limitations:

Out-of-the-box use only in English, Potential inaccurate or biased responses

Considerations:

Developers should fine-tune based on specific needs.

Additional Notes

100% carbon emissions offset by Meta’s sustainability program.

Supported Languages

en (high)

Training Details

Data Sources:

publicly available online data

Data Volume:

15 trillion tokens

Methodology:

Supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)

Context Length:

8000

Hardware Used:

H100-80GB GPU with a cumulative 7.7M GPU hours

Model Architecture:

Auto-regressive transformer architecture

Safety Evaluation

Methodologies:

Red teaming exercises, Adversarial evaluations

Risk Categories:

CBRNE, Cyber Security, Child Safety

Ethical Considerations:

Leverages best practices for safety and responsible deployment.

Responsible Ai Considerations

Fairness:

Inclusive and open approach, aiming to serve diverse user needs and perspectives.

Accountability:

Developers responsible for end-user safety evaluations.

Mitigation Strategies:

Tools like Meta Llama Guard 2 and Code Shield for layering safety measures.

Input Output

Input Format:

text

Accepted Modalities:

text

Output Format:

text and code

Performance Tips:

Fine-tune with language-specific data where appropriate.

Release Notes

Version:

Meta Llama 3 (8B, 70B)

Date:

April 18, 2024

Notes:

Initial release of pre-trained and instruction tuned variants.

LLM Name	Meta Llama 3 8B
Repository 🤗	https://huggingface.co/NousResearch/Meta-Llama-3-8B
Model Size	8b
Required VRAM	16.1 GB
Updated	2025-05-01
Maintainer	NousResearch
Model Type	llama
Model Files	5.0 GB: 1-of-4 5.0 GB: 2-of-4 4.9 GB: 3-of-4 1.2 GB: 4-of-4
Supported Languages	en
Model Architecture	LlamaForCausalLM
License	other
Context Length	8192
Model Max Length	8192
Transformers Version	4.40.0.dev0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	128256
Torch Data Type	bfloat16

Quantized Models of the Meta Llama 3 8B

Model	Likes	Downloads	VRAM
...mes 2 Pro Llama 3 8B Bpw6 EXL2	0	5	6 GB
Hermes 2 Pro Llama 3 8B Marlin	1	7	5 GB
...ta Llama 3 8B HQQ 4bit Smashed	0	5	5 GB
...ta Llama 3 8B HQQ 2bit Smashed	0	5	4 GB

Best Alternatives to Meta Llama 3 8B

Best Alternatives	Context / RAM	Downloads	Likes
...otron 8B UltraLong 4M Instruct	4192K / 32.1 GB	5052	104
UltraLong Thinking	4192K / 16.1 GB	106	2
...a 3.1 8B UltraLong 4M Instruct	4192K / 32.1 GB	176	24
...otron 8B UltraLong 2M Instruct	2096K / 32.1 GB	1299	15
...a 3.1 8B UltraLong 2M Instruct	2096K / 32.1 GB	875	9
...otron 8B UltraLong 1M Instruct	1048K / 32.1 GB	4109	39
...a 3.1 8B UltraLong 1M Instruct	1048K / 32.1 GB	1387	29
....1 1million Ctx Dark Planet 8B	1048K / 32.3 GB	32	2
...a 3 8B Instruct Gradient 1048K	1024K / 16.1 GB	20842	679
F6	1024K / 16.1 GB	78	0

Note: green Score (e.g. "73.2") means that the model is better than NousResearch/Meta-Llama-3-8B.

Rank the Meta Llama 3 8B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 46943 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer