Magnum 12B V2.5 Kto By anthracite-org: Benchmarks, Features and Detailed Analysis. Insights on Magnum 12B V2.5 Kto.

Chat Conversational De En Es Fr It Ja Mistral Pt Region:us Ru Safetensors Sharded Tensorflow Zh

Model Card on HF 🤗: https://huggingface.co/anthracite-org/magnum-12b-v2.5-kto

Magnum 12B V2.5 Kto Benchmarks

LLME Score: 0.20246

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Magnum 12B V2.5 Kto (anthracite-org/magnum-12b-v2.5-kto)

Magnum 12B V2.5 Kto Parameters and Internals

Model Type

text generation, chat

Additional Notes

KTO is an experimental release, part of a series of models. It's fine-tuned on top of Magnum-12b-v2. Experimental data was used for initial testing, with plans to scale up.

Supported Languages

en (supported), fr (supported), de (supported), es (supported), it (supported), pt (supported), ru (supported), zh (supported), ja (supported)

Training Details

Data Sources:

Stheno dataset (filtered), kalomaze/Opus_Instruct_25k, Nopm/Opus_WritingStruct, Gryphe/Sonnet3.5-SlimOrcaDedupCleaned, kalomaze/Opus_Instruct_3k

Methodology:

hybrid reinforcement learning strategy of KTO + DPOP using rejected data sampled from the original model as rejected and data from original finetuning dataset as chosen.

Input Output

Input Format:

Instruct tuned with the ChatML formatting.

LLM Name	Magnum 12B V2.5 Kto
Repository 🤗	https://huggingface.co/anthracite-org/magnum-12b-v2.5-kto
Model Size	12b
Required VRAM	24.5 GB
Updated	2024-08-18
Maintainer	anthracite-org
Model Type	mistral
Model Files	4.9 GB: 1-of-5 4.9 GB: 2-of-5 4.9 GB: 3-of-5 4.9 GB: 4-of-5 4.9 GB: 5-of-5
Supported Languages	en fr de es it pt ru zh ja
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	1024000
Model Max Length	1024000
Transformers Version	4.43.3
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	<pad>
Vocabulary Size	131072
Torch Data Type	float16

Best Alternatives to Magnum 12B V2.5 Kto

Best Alternatives	Context / RAM	Downloads	Likes
...r Nemo 12B Instruct R 21 09 24	1000K / 24.5 GB	408188	119
Captain Eris Violet V0.420 12B	1000K / 24.5 GB	5841	47
MN 12B Mag Mell R1	1000K / 24.5 GB	12463	167
Captain Eris BMO Violent 12B	1000K / 24.5 GB	305	2
...s PersonalityEngine V1.1.0 12B	1000K / 24.5 GB	489	39
Dans SakuraKaze V1.0.0 12B	1000K / 24.5 GB	41	19
Magnum V2 12B	1000K / 24.5 GB	26217	88
MISCHIEVOUS 12B Mix III Ex V	1000K / 24.5 GB	1436	0
PLLuM 12B Nc Chat	1000K / 24.5 GB	3709	6
...n Eris BMO Violent GRPO V0.420	1000K / 24.5 GB	92	3

Note: green Score (e.g. "73.2") means that the model is better than anthracite-org/magnum-12b-v2.5-kto.

Rank the Magnum 12B V2.5 Kto Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Magnum 12B V2.5 Kto by anthracite-org

» All LLMs » anthracite-org » Magnum 12B V2.5 Kto URL Share it on

Magnum 12B V2.5 Kto Benchmarks

Magnum 12B V2.5 Kto Parameters and Internals

Best Alternatives to Magnum 12B V2.5 Kto

Rank the Magnum 12B V2.5 Kto Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.