Llama 3.1 405B By meta-llama: Benchmarks, Features and Detailed Analysis. Insights on Llama 3.1 405B.

Model Card on HF 🤗: https://huggingface.co/meta-llama/Llama-3.1-405B

Llama 3.1 405B Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 3.1 405B (meta-llama/Llama-3.1-405B)

Llama 3.1 405B Parameters and Internals

Model Type

auto-regressive, transformer, text generation

Use Cases

Areas:

research, commercial applications

Applications:

multilingual text generation, synthetic data generation

Primary Use Cases:

assistant-like chat, instructional generation

Limitations:

Prohibited for use in unsanctioned languages

Considerations:

Must comply with Llama 3.1 Community License and Acceptable Use Policy.

Additional Notes

Integrated safety features with community feedback.

Supported Languages

en (high), de (high), fr (high), it (high), pt (high), hi (high), es (high), th (high)

Training Details

Data Sources:

publicly available online data

Data Volume:

~15 trillion tokens

Methodology:

supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF)

Context Length:

128000

Training Time:

39.3M GPU hours

Hardware Used:

Meta's custom GPU cluster

Model Architecture:

Optimized transformer

Safety Evaluation

Methodologies:

adversarial testing, red teaming

Risk Categories:

misinformation, bias, child safety, cyber attack enablement

Ethical Considerations:

Emphasizes openness, inclusivity, and helpfulness.

Responsible Ai Considerations

Fairness:

Addressed through careful data selection and fine tuning methodology.

Transparency:

Documented in various model reports and guides.

Accountability:

Developers using Llama 3.1 are responsible for model deployment.

Mitigation Strategies:

Incorporating Prompt Guard, Llama Guard 3.

Input Output

Input Format:

Multilingual Text

Accepted Modalities:

text

Output Format:

Multilingual Text and code

Performance Tips:

Fine-tuning recommended for non-8 supported languages.

Release Notes

Version:

3.1

Date:

July 23, 2024

Notes:

Introduced longer context window, multilingual support etc.

LLM Name	Llama 3.1 405B
Repository 🤗	https://huggingface.co/meta-llama/Llama-3.1-405B
Model Size	405b
Required VRAM	183.1 GB
Updated	2025-05-31
Maintainer	meta-llama
Model Type	llama
Model Files	4.8 GB: 1-of-191 4.0 GB: 2-of-191 4.6 GB: 3-of-191 4.6 GB: 4-of-191 3.5 GB: 5-of-191 4.6 GB: 6-of-191 4.6 GB: 7-of-191 3.5 GB: 8-of-191 4.6 GB: 9-of-191 4.6 GB: 10-of-191 3.5 GB: 11-of-191 4.6 GB: 12-of-191 4.6 GB: 13-of-191 3.5 GB: 14-of-191 4.6 GB: 15-of-191 4.6 GB: 16-of-191 3.5 GB: 17-of-191 4.6 GB: 18-of-191 4.6 GB: 19-of-191 3.5 GB: 20-of-191 4.6 GB: 21-of-191 4.6 GB: 22-of-191 3.5 GB: 23-of-191 4.6 GB: 24-of-191 4.6 GB: 25-of-191 3.5 GB: 26-of-191 4.6 GB: 27-of-191 4.6 GB: 28-of-191 3.5 GB: 29-of-191 4.6 GB: 30-of-191 4.6 GB: 31-of-191 3.5 GB: 32-of-191 4.6 GB: 33-of-191 4.6 GB: 34-of-191 3.5 GB: 35-of-191 4.6 GB: 36-of-191 4.6 GB: 37-of-191 3.5 GB: 38-of-191 4.6 GB: 39-of-191 4.6 GB: 40-of-191 3.5 GB: 41-of-191 4.6 GB: 42-of-191 4.6 GB: 43-of-191
Supported Languages	en de fr it pt hi es th
Model Architecture	LlamaForCausalLM
License	llama3.1
Context Length	131072
Model Max Length	131072
Transformers Version	4.42.3
Vocabulary Size	128256
Torch Data Type	bfloat16

Best Alternatives to Llama 3.1 405B

Best Alternatives	Context / RAM	Downloads	Likes
Meta Llama 3.1 405B	128K / 186 GB	521433	808
Meta Llama 3.1 405B Instruct	128K / 186 GB	55654	473
Llama 3.1 405B Instruct	128K / 183.1 GB	38185	569
Meta Llama 3.1 405B FP8	128K / 197.6 GB	131499	94
...ta Llama 3.1 405B Instruct FP8	128K / 197.6 GB	55370	165
Llama 3.1 Tulu 3 405B	128K / 191.2 GB	29	106
Hermes 3 Llama 3.1 405B	128K / 195.8 GB	1806	235
Llama 3.1 405B Instruct FP8	128K / 209.2 GB	5490	10
Llama 3.1 405B Instruct FP8	128K / 193.4 GB	2344	188
Llama 3.1 Tulu 3 405B DPO	128K / 191.2 GB	11	5

Note: green Score (e.g. "73.2") means that the model is better than meta-llama/Llama-3.1-405B.

Rank the Llama 3.1 405B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Llama 3.1 405B by meta-llama

» All LLMs » meta-llama » Llama 3.1 405B URL Share it on

Llama 3.1 405B Benchmarks

Llama 3.1 405B Parameters and Internals

Best Alternatives to Llama 3.1 405B

Rank the Llama 3.1 405B Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.