Llama 3.1 405B Instruct FP8 By nvidia: Benchmarks, Features and Detailed Analysis. Insights on Llama 3.1 405B Instruct FP8.

Autotrain compatible Base model:finetune:meta-llama... Base model:meta-llama/llama-3.... Conversational Endpoints compatible Instruct Llama Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/nvidia/Llama-3.1-405B-Instruct-FP8

Llama 3.1 405B Instruct FP8 Benchmarks

LLME Score: 0.23757

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama 3.1 405B Instruct FP8 (nvidia/Llama-3.1-405B-Instruct-FP8)

Llama 3.1 405B Instruct FP8 Parameters and Internals

LLM Name	Llama 3.1 405B Instruct FP8
Repository 🤗	https://huggingface.co/nvidia/Llama-3.1-405B-Instruct-FP8
Base Model(s)	meta-llama/Llama-3.1-405B-Instruct meta-llama/Llama-3.1-405B-Instruct
Model Size	405b
Required VRAM	209.2 GB
Updated	2025-05-31
Maintainer	nvidia
Model Type	llama
Instruction-Based	Yes
Model Files	4.8 GB: 1-of-86 4.9 GB: 2-of-86 4.6 GB: 3-of-86 4.9 GB: 4-of-86 4.6 GB: 5-of-86 4.9 GB: 6-of-86 4.6 GB: 7-of-86 4.9 GB: 8-of-86 4.6 GB: 9-of-86 4.9 GB: 10-of-86 4.6 GB: 11-of-86 4.9 GB: 12-of-86 4.6 GB: 13-of-86 4.9 GB: 14-of-86 4.6 GB: 15-of-86 4.9 GB: 16-of-86 4.6 GB: 17-of-86 4.9 GB: 18-of-86 4.6 GB: 19-of-86 4.9 GB: 20-of-86 4.6 GB: 21-of-86 4.9 GB: 22-of-86 4.6 GB: 23-of-86 4.9 GB: 24-of-86 4.6 GB: 25-of-86 4.9 GB: 26-of-86 4.6 GB: 27-of-86 4.9 GB: 28-of-86 4.6 GB: 29-of-86 4.9 GB: 30-of-86 4.6 GB: 31-of-86 4.9 GB: 32-of-86 4.6 GB: 33-of-86 4.9 GB: 34-of-86 4.6 GB: 35-of-86 4.9 GB: 36-of-86 4.6 GB: 37-of-86 4.9 GB: 38-of-86 4.6 GB: 39-of-86 4.9 GB: 40-of-86 4.6 GB: 41-of-86 4.9 GB: 42-of-86 4.6 GB: 43-of-86 4.9 GB: 44-of-86
Model Architecture	LlamaForCausalLM
License	llama3.1
Context Length	131072
Model Max Length	131072
Transformers Version	4.44.0
Vocabulary Size	128256
Torch Data Type	bfloat16

Best Alternatives to Llama 3.1 405B Instruct FP8

Best Alternatives	Context / RAM	Downloads	Likes
Meta Llama 3.1 405B Instruct	128K / 186 GB	55654	473
Llama 3.1 405B Instruct	128K / 183.1 GB	38185	569
...ta Llama 3.1 405B Instruct FP8	128K / 197.6 GB	55370	165
Llama 3.1 405B Instruct FP8	128K / 193.4 GB	2344	188
BigLlama 3.1 681B Instruct	128K / 190.8 GB	24	11
INTELLECT 1 Instruct	8K / 20.5 GB	38	124
...ama 3.1 405B Instruct Bnb 4bit	128K / 214.3 GB	1402	6

Note: green Score (e.g. "73.2") means that the model is better than nvidia/Llama-3.1-405B-Instruct-FP8.

Rank the Llama 3.1 405B Instruct FP8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Llama 3.1 405B Instruct FP8 by nvidia

» All LLMs » nvidia » Llama 3.1 405B Instruct FP8 URL Share it on

Llama 3.1 405B Instruct FP8 Benchmarks

Llama 3.1 405B Instruct FP8 Parameters and Internals

Best Alternatives to Llama 3.1 405B Instruct FP8

Rank the Llama 3.1 405B Instruct FP8 Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.