Meta Llama 3.1 405B FP8 By NousResearch: Benchmarks, Features and Detailed Analysis. Insights on Meta Llama 3.1 405B FP8.

Model Card on HF 🤗: https://huggingface.co/NousResearch/Meta-Llama-3.1-405B-FP8

Meta Llama 3.1 405B FP8 Benchmarks

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Meta Llama 3.1 405B FP8 (NousResearch/Meta-Llama-3.1-405B-FP8)

Meta Llama 3.1 405B FP8 Parameters and Internals

Model Type

multilingual large language model, generative

Use Cases

Areas:

commercial, research

Applications:

multilingual dialogue systems

Primary Use Cases:

assistant-like chat

Limitations:

Prohibited uses as described in Acceptable Use Policy and License

Considerations:

Focuses on common industry benchmarks and safety guidelines.

Supported Languages

en (English), de (German), fr (French), it (Italian), pt (Portuguese), hi (Hindi), es (Spanish), th (Thai)

Training Details

Data Sources:

publicly available online data

Data Volume:

15 trillion tokens

Methodology:

Pretrained and instruction-tuned using SFT and RLHF

Context Length:

128000

Training Time:

39.3M GPU hours

Hardware Used:

H100-80GB GPUs

Model Architecture:

Optimized transformer architecture

Safety Evaluation

Methodologies:

red teaming, adversarial testing

Risk Categories:

CBRNE (Chemical, Biological, Radiological, Nuclear, and Explosive materials), Child Safety, Cyber attack enablement

Ethical Considerations:

Potential societal impact and misuse prevention measures.

Responsible Ai Considerations

Fairness:

Commitment to inclusivity and openness.

Transparency:

Providing thorough documentation and usage guidelines.

Accountability:

Meta and developers share responsibilities based on deployment.

Mitigation Strategies:

Introduction of safety guardrails like Llama Guard 3.

Input Output

Input Format:

Multilingual Text

Accepted Modalities:

text

Output Format:

Multilingual Text and code

Release Notes

Version:

3.1

Date:

July 23, 2024

Notes:

Multilingual model optimized for dialogue.

LLM Name	Meta Llama 3.1 405B FP8
Repository 🤗	https://huggingface.co/NousResearch/Meta-Llama-3.1-405B-FP8
Model Size	405b
Required VRAM	197.6 GB
Updated	2025-02-05
Maintainer	NousResearch
Model Type	llama
Model Files	4.9 GB: 1-of-109 4.0 GB: 2-of-109 4.7 GB: 3-of-109 4.7 GB: 4-of-109 4.5 GB: 5-of-109 4.4 GB: 6-of-109 4.7 GB: 7-of-109 4.7 GB: 8-of-109 4.7 GB: 9-of-109 4.5 GB: 10-of-109 4.4 GB: 11-of-109 4.7 GB: 12-of-109 4.7 GB: 13-of-109 4.7 GB: 14-of-109 4.5 GB: 15-of-109 4.4 GB: 16-of-109 4.7 GB: 17-of-109 4.7 GB: 18-of-109 4.7 GB: 19-of-109 4.5 GB: 20-of-109 4.4 GB: 21-of-109 4.7 GB: 22-of-109 4.7 GB: 23-of-109 4.7 GB: 24-of-109 4.5 GB: 25-of-109 4.4 GB: 26-of-109 4.7 GB: 27-of-109 4.7 GB: 28-of-109 4.7 GB: 29-of-109 4.5 GB: 30-of-109 4.4 GB: 31-of-109 4.7 GB: 32-of-109 4.7 GB: 33-of-109 4.7 GB: 34-of-109 4.5 GB: 35-of-109 4.4 GB: 36-of-109 4.7 GB: 37-of-109 4.7 GB: 38-of-109 4.7 GB: 39-of-109 4.5 GB: 40-of-109 4.4 GB: 41-of-109 4.7 GB: 42-of-109 4.7 GB: 43-of-109
Supported Languages	en de fr it pt hi es th
Model Architecture	LlamaForCausalLM
License	meta
Context Length	131072
Model Max Length	131072
Transformers Version	4.43.0.dev0
Vocabulary Size	128256
Torch Data Type	bfloat16

Best Alternatives to Meta Llama 3.1 405B FP8

Best Alternatives	Context / RAM	Downloads	Likes
Meta Llama 3.1 405B	128K / 186 GB	521433	808
Llama 3.1 Tulu 3 405B	128K / 191.2 GB	469	84
Llama 3.1 405B Instruct	128K / 183.1 GB	53203	562
Meta Llama 3.1 405B Instruct	128K / 186 GB	55654	473
Llama 3.1 405B	128K / 183.1 GB	12649	918
Meta Llama 3.1 405B FP8	128K / 197.6 GB	131499	94
...ta Llama 3.1 405B Instruct FP8	128K / 197.6 GB	55370	165
Llama 3.1 405B Instruct FP8	128K / 193.4 GB	17275	184
Llama 3.1 Tulu 3 405B DPO	128K / 191.2 GB	32	5
Llama 3.1 Tulu 3 405B SFT	128K / 191.2 GB	39	10

Note: green Score (e.g. "73.2") means that the model is better than NousResearch/Meta-Llama-3.1-405B-FP8.

Rank the Meta Llama 3.1 405B FP8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Meta Llama 3.1 405B FP8 by NousResearch

» All LLMs » NousResearch » Meta Llama 3.1 405B FP8 URL Share it on

Meta Llama 3.1 405B FP8 Benchmarks

Meta Llama 3.1 405B FP8 Parameters and Internals

Best Alternatives to Meta Llama 3.1 405B FP8

Rank the Meta Llama 3.1 405B FP8 Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.