Gemma 2 9B Chatml By IntervitensInc: Benchmarks, Features and Detailed Analysis. Insights on Gemma 2 9B Chatml.

Model Card on HF 🤗: https://huggingface.co/IntervitensInc/gemma-2-9b-chatml

Gemma 2 9B Chatml Benchmarks

MMLU Pro: 36.02

GPQA: 12.75

MUSR: 13.24

BBH: 35.32

IFEval: 12.75 vs 88 (so35)^-85.5%

MATH Lvl 5: 6.42

LLME Score: 0.25659

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Gemma 2 9B Chatml (IntervitensInc/gemma-2-9b-chatml)

Gemma 2 9B Chatml Parameters and Internals

Model Type

text-to-text, decoder-only, large language models

Use Cases

Areas:

Research, Commercial Applications

Applications:

Text Generation, Chatbots and Conversational AI, Text Summarization

Primary Use Cases:

NLP Research, Language Learning Tools, Knowledge Exploration

Limitations:

Training data quality, Context and task complexity, Language ambiguity and nuance, Factual accuracy, Common sense application

Considerations:

Training data quality, scope, and context length influence capabilities.

Additional Notes

These models provide high-performance open large language model implementations designed for responsible AI development.

Supported Languages

English (high)

Training Details

Data Sources:

Web Documents, Code, Mathematics

Data Volume:

8 trillion tokens

Hardware Used:

TPUs (TPUv5p)

Safety Evaluation

Methodologies:

structured evaluations, internal red-teaming testing

Findings:

acceptable thresholds met for categories such as child safety, content safety, representational harms, memorization, large-scale harms.

Risk Categories:

Text-to-Text Content Safety, Text-to-Text Representational Harms, Memorization, Large-scale harm

Ethical Considerations:

Ethical evaluation methods include structured evaluations and red-teaming testing.

Responsible Ai Considerations

Fairness:

Careful scrutiny, input data pre-processing and evaluations for bias and fairness.

Transparency:

Model card summarizes details on the models' architecture, capabilities, limitations, and evaluation processes.

Accountability:

Google

Mitigation Strategies:

Continuous monitoring and de-biasing techniques.

Input Output

Input Format:

Text string

Accepted Modalities:

text

Output Format:

English-language text

Performance Tips:

Longer context generally leads to better outputs.

Release Notes

Version:

2.0

Notes:

Version with added chatml tokens for finetuning.

Version:

Gemma PT 9B

Notes:

Initial release of the Gemma PT 9B model.

LLM Name	Gemma 2 9B Chatml
Repository 🤗	https://huggingface.co/IntervitensInc/gemma-2-9b-chatml
Model Size	9b
Required VRAM	37.1 GB
Updated	2025-03-12
Maintainer	IntervitensInc
Model Type	gemma2
Model Files	4.8 GB: 1-of-8 5.0 GB: 2-of-8 5.0 GB: 3-of-8 4.9 GB: 4-of-8 5.0 GB: 5-of-8 5.0 GB: 6-of-8 5.0 GB: 7-of-8 2.4 GB: 8-of-8
Model Architecture	Gemma2ForCausalLM
License	gemma
Context Length	8192
Model Max Length	8192
Transformers Version	4.42.0.dev0
Tokenizer Class	GemmaTokenizer
Padding Token	<pad>
Vocabulary Size	256000
Torch Data Type	float32

Best Alternatives to Gemma 2 9B Chatml

Best Alternatives	Context / RAM	Downloads	Likes
G2 GSHT 32K	32K / 20.4 GB	9	0
SystemGemma2 9B It	32K / 18.6 GB	142	1
Gemma 2 9B It SimPO	8K / 18.6 GB	21366	156
Gemma 2 9B It	8K / 18.6 GB	417785	685
Gemma 2 9B	8K / 37.1 GB	113134	653
Darkest Muse V1	8K / 20.4 GB	1008	65
...2 9B Cpt Sahabatai V1 Instruct	8K / 18.6 GB	1320	35
SILMA 9B Instruct V1.0	8K / 18.6 GB	12686	69
MT Merge4 Gemma 2 9B	8K / 20.4 GB	120	1
MT3 Gen4 Gemma 2 9B	8K / 20.4 GB	119	4

Note: green Score (e.g. "73.2") means that the model is better than IntervitensInc/gemma-2-9b-chatml.

Rank the Gemma 2 9B Chatml Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44949 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Gemma 2 9B Chatml by IntervitensInc

» All LLMs » IntervitensInc » Gemma 2 9B Chatml URL Share it on

Gemma 2 9B Chatml Benchmarks

Gemma 2 9B Chatml Parameters and Internals

Best Alternatives to Gemma 2 9B Chatml

Rank the Gemma 2 9B Chatml Capabilities

What open-source LLMs or SLMs are you in search of? 44949 in total.