Magnum V2.5 12B Kto By anthracite-org: Benchmarks, Features and Detailed Analysis. Insights on Magnum V2.5 12B Kto.

Base model:anthracite-org/magn... Base model:finetune:anthracite... Chat Conversational De En Es Fr It Ja Mistral Pt Region:us Ru Safetensors Sharded Tensorflow Zh

Model Card on HF 🤗: https://huggingface.co/anthracite-org/magnum-v2.5-12b-kto

Magnum V2.5 12B Kto Benchmarks

MMLU Pro: 24.61

GPQA: 5.82

MUSR: 9.98

BBH: 29.63

IFEval: 38.66 vs 88 (so35)^-56.1%

MATH Lvl 5: 5.21

LLME Score: 0.26814

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Magnum V2.5 12B Kto (anthracite-org/magnum-v2.5-12b-kto)

Magnum V2.5 12B Kto Parameters and Internals

Model Type

text generation, chat

Additional Notes

KTO is an experimental release, part of a series of models. It's fine-tuned on top of Magnum-12b-v2. Experimental data was used for initial testing, with plans to scale up.

Supported Languages

en (supported), fr (supported), de (supported), es (supported), it (supported), pt (supported), ru (supported), zh (supported), ja (supported)

Training Details

Data Sources:

Stheno dataset (filtered), kalomaze/Opus_Instruct_25k, Nopm/Opus_WritingStruct, Gryphe/Sonnet3.5-SlimOrcaDedupCleaned, kalomaze/Opus_Instruct_3k

Methodology:

hybrid reinforcement learning strategy of KTO + DPOP using rejected data sampled from the original model as rejected and data from original finetuning dataset as chosen.

Input Output

Input Format:

Instruct tuned with the ChatML formatting.

LLM Name	Magnum V2.5 12B Kto
Repository 🤗	https://huggingface.co/anthracite-org/magnum-v2.5-12b-kto
Base Model(s)	Magnum 12B V2 anthracite-org/magnum-12b-v2
Model Size	12b
Required VRAM	24.5 GB
Updated	2025-06-01
Maintainer	anthracite-org
Model Type	mistral
Model Files	4.9 GB: 1-of-5 4.9 GB: 2-of-5 4.9 GB: 3-of-5 4.9 GB: 4-of-5 4.9 GB: 5-of-5
Supported Languages	en fr de es it pt ru zh ja
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	1024000
Model Max Length	1024000
Transformers Version	4.43.3
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<pad>
Vocabulary Size	131072
Torch Data Type	float16

Best Alternatives to Magnum V2.5 12B Kto

Best Alternatives	Context / RAM	Downloads	Likes
...r Nemo 12B Instruct R 21 09 24	1000K / 24.5 GB	408188	119
Captain Eris Violet V0.420 12B	1000K / 24.5 GB	5222	47
MN 12B Mag Mell R1	1000K / 24.5 GB	12431	167
Captain Eris BMO Violent 12B	1000K / 24.5 GB	305	2
...s PersonalityEngine V1.1.0 12B	1000K / 24.5 GB	498	39
PLLuM 12B Nc Chat	1000K / 24.5 GB	3709	6
Dans SakuraKaze V1.0.0 12B	1000K / 24.5 GB	40	19
MISCHIEVOUS 12B Mix III Ex V	1000K / 24.5 GB	1482	0
Magnum V2 12B	1000K / 24.5 GB	26217	88
...n Eris BMO Violent GRPO V0.420	1000K / 24.5 GB	92	3

Note: green Score (e.g. "73.2") means that the model is better than anthracite-org/magnum-v2.5-12b-kto.

Rank the Magnum V2.5 12B Kto Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Magnum V2.5 12B Kto by anthracite-org

» All LLMs » anthracite-org » Magnum V2.5 12B Kto URL Share it on

Magnum V2.5 12B Kto Benchmarks

Magnum V2.5 12B Kto Parameters and Internals

Best Alternatives to Magnum V2.5 12B Kto

Rank the Magnum V2.5 12B Kto Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.