Nemotron 4 340B Instruct By nvidia: Benchmarks, Features and Detailed Analysis. Insights on Nemotron 4 340B Instruct.

Arxiv:2406.08673 Instruct Nemo Region:us

Model Card on HF 🤗: https://huggingface.co/nvidia/Nemotron-4-340B-Instruct

Nemotron 4 340B Instruct Benchmarks

LMSys ELO: 1206 vs 1272 (so35)^-5.2%

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Nemotron 4 340B Instruct (nvidia/Nemotron-4-340B-Instruct)

Nemotron 4 340B Instruct Parameters and Internals

Model Type

Transformer Decoder

Use Cases

Areas:

Synthetic Data Generation, building and customizing LLMs

Applications:

Chat applications, AI assistant

Primary Use Cases:

English language chat

Limitations:

Amplifies biases from training data, may generate socially undesirable text

Supported Languages

languages_supported (Multilingual), proficiency_levels ()

Training Details

Data Sources:

9 trillion tokens of English based texts, 50+ natural languages, and 40+ coding languages

Methodology:

Supervised Fine-tuning (SFT), Direct Preference Optimization (DPO), Reward-aware Preference Optimization (RPO), Grouped-Query Attention (GQA), Rotary Position Embeddings (RoPE)

Context Length:

4096

Training Time:

Dec 2023 - May 2024

Model Architecture:

Transformer Decoder

Safety Evaluation

Methodologies:

Adversarial testing via Garak, AEGIS content safety evaluation, Human Content Red Teaming

Risk Categories:

Toxic language, unsafe content, societal biases

Ethical Considerations:

NVIDIA believes Trustworthy AI is a shared responsibility.

Input Output

Input Format:

Single Turn: System User {prompt} Assistant; Multi-Turn: User {prompt 1} Assistant {response 1} User {prompt 2} Assistant {response 2}...

Output Format:

Text

LLM Name	Nemotron 4 340B Instruct
Repository 🤗	https://huggingface.co/nvidia/Nemotron-4-340B-Instruct
Model Size	340b
Updated	2025-06-01
Maintainer	nvidia
Instruction-Based	Yes
License	other
Context Length	4096
Model Max Length	4096

Rank the Nemotron 4 340B Instruct Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Nemotron 4 340B Instruct by nvidia

» All LLMs » nvidia » Nemotron 4 340B Instruct URL Share it on

Nemotron 4 340B Instruct Benchmarks

Nemotron 4 340B Instruct Parameters and Internals

Rank the Nemotron 4 340B Instruct Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.