Mistral NeMo Minitron 8B Instruct By nvidia: Benchmarks, Features and Detailed Analysis. Insights on Mistral NeMo Minitron 8B Instruct.

Arxiv:2406.11704 Arxiv:2407.14679 Autotrain compatible Base model:finetune:nvidia/mis... Base model:nvidia/mistral-nemo... Conversational Endpoints compatible Instruct Mistral Nemo Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Instruct

Mistral NeMo Minitron 8B Instruct Benchmarks

MMLU Pro: 33.23

GPQA: 5.03

MUSR: 7.37

BBH: 34.13

IFEval: 50.04 vs 88 (so35)^-43.1%

MATH Lvl 5: 11.63

LLME Score: 0.31942

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Mistral NeMo Minitron 8B Instruct (nvidia/Mistral-NeMo-Minitron-8B-Instruct)

Mistral NeMo Minitron 8B Instruct Parameters and Internals

Model Type

Transformer Decoder (Auto-regressive Language Model)

Use Cases

Primary Use Cases:

Roleplaying, Retrieval augmented generation, Function calling

Limitations:

Model may amplify societal biases and return toxic responses, May produce inaccurate or unacceptable text

Considerations:

Validate the imported packages are from a trusted source for end-to-end security.

Training Details

Methodology:

Multi-stage SFT and preference-based alignment with NeMo Aligner

Context Length:

8192

Model Architecture:

Transformer Decoder with Grouped-Query Attention (GQA) and Rotary Position Embeddings (RoPE), 40 layers, 32 attention heads.

Safety Evaluation

Methodologies:

Garak automated LLM vulnerability scanner, AEGIS content safety evaluation, Human Content Red Teaming

Ethical Considerations:

NVIDIA encourages working with internal model team to ensure model meets specific industry and use case requirements.

Input Output

Input Format:

System {system prompt} User {prompt} Assistant\n

Performance Tips:

The model may not perform optimally without the recommended prompt template.

LLM Name	Mistral NeMo Minitron 8B Instruct
Repository 🤗	https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Instruct
Base Model(s)	Mistral NeMo Minitron 8B Base nvidia/Mistral-NeMo-Minitron-8B-Base
Model Size	8b
Required VRAM	16.8 GB
Updated	2025-06-01
Maintainer	nvidia
Model Type	mistral
Instruction-Based	Yes
Model Files	4.9 GB: 1-of-4 5.0 GB: 2-of-4 4.9 GB: 3-of-4 2.0 GB: 4-of-4
Model Architecture	MistralForCausalLM
License	other
Context Length	8192
Model Max Length	8192
Transformers Version	4.44.0
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	131072
Torch Data Type	bfloat16

Best Alternatives to Mistral NeMo Minitron 8B Instruct

Best Alternatives	Context / RAM	Downloads	Likes
Ministral 8B Instruct 2410 HF	32K / 32 GB	82	10
Ministrations 8B V1	32K / 16.1 GB	28	20
Ministral 8B Slerp	32K / 29.2 GB	7	0
DeepOpus 1 8B Preview	32K / 16.1 GB	75	2
DeepNeo 1 8B Preview	32K / 16.1 GB	63	2
Bigger Body 8B	32K / 16.1 GB	11	5
Forgotten Safeword 8B V2.2	32K / 16.1 GB	6	1
Reflect Single Mini8B SftT12	32K / 16.1 GB	8	0
...ct Mini8B MistlrgOrcl460kSftT1	32K / 16.1 GB	9	0
...ct Mini8B MistlrgOrcl460kSftT2	32K / 16.1 GB	7	0

Rank the Mistral NeMo Minitron 8B Instruct Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Mistral NeMo Minitron 8B Instruct by nvidia

» All LLMs » nvidia » Mistral NeMo Minitron 8B Instruct URL Share it on

Mistral NeMo Minitron 8B Instruct Benchmarks

Mistral NeMo Minitron 8B Instruct Parameters and Internals

Best Alternatives to Mistral NeMo Minitron 8B Instruct

Rank the Mistral NeMo Minitron 8B Instruct Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.