MN 12B Lyra V3 By Sao10K: Benchmarks, Features and Detailed Analysis. Insights on MN 12B Lyra V3.

En Mistral Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/Sao10K/MN-12B-Lyra-v3

MN 12B Lyra V3 Benchmarks

MMLU Pro: 24.99

GPQA: 3.69

MUSR: 9.04

BBH: 25.87

IFEval: 44.86 vs 88 (so35)^-49%

MATH Lvl 5: 9.37

LLME Score: 0.26767

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

MN 12B Lyra V3 Parameters and Internals

Model Type

text generation

Additional Notes

Original format was meant for an 8B pruned NeMo train.

Training Details

Data Sources:

A custom, cleaned mix of Stheno-v3.4's Dataset

Methodology:

Multi-turn training with RL step, Magpie-style responses.

Training Time:

6 Hours

Hardware Used:

4xH100 SXM

Input Output

Input Format:

ChatML-style prompting Format

Accepted Modalities:

text

Output Format:

text

Performance Tips:

Recommended samplers: Temperature: 0.7 - 1.2, min_p: 0.1 - 0.2

LLM Name	MN 12B Lyra V3
Repository 🤗	https://huggingface.co/Sao10K/MN-12B-Lyra-v3
Model Size	12b
Required VRAM	24.4 GB
Updated	2024-11-27
Maintainer	Sao10K
Model Type	mistral
Model Files	2.0 GB: 1-of-13 1.9 GB: 2-of-13 1.9 GB: 3-of-13 1.9 GB: 4-of-13 1.9 GB: 5-of-13 1.9 GB: 6-of-13 1.9 GB: 7-of-13 1.9 GB: 8-of-13 1.9 GB: 9-of-13 1.9 GB: 10-of-13 1.9 GB: 11-of-13 1.9 GB: 12-of-13 1.5 GB: 13-of-13
Supported Languages	en
Gated Model	Yes
Model Architecture	MistralForCausalLM
License	proprietary
Context Length	1024000
Model Max Length	1024000
Transformers Version	4.43.4
Tokenizer Class	PreTrainedTokenizerFast
Padding Token	[/INST]
Vocabulary Size	131072
Torch Data Type	bfloat16

Best Alternatives to MN 12B Lyra V3

Best Alternatives	Context / RAM	Downloads	Likes
Captain Eris Violet V0.420 12B	1000K / 24.5 GB	850751	32
Dans SakuraKaze V1.0.0 12B	1000K / 24.5 GB	1463	14
Captain Eris BMO Violent 12B	1000K / 24.5 GB	275	2
...n Eris BMO Violent GRPO V0.420	1000K / 24.5 GB	150	3
...r Nemo 12B Instruct R 21 09 24	1000K / 24.5 GB	6839	114
MN 12B Mag Mell R1	1000K / 24.5 GB	7842	129
BigKartoffel Mistral Nemo 20B	1000K / 41.1 GB	53	3
...s PersonalityEngine V1.1.0 12B	1000K / 24.5 GB	525	36
Glowing Forest 12B	1000K / 24.5 GB	33	3
MN 12B Mimicore Nocturne	1000K / 24.5 GB	16	2

Note: green Score (e.g. "73.2") means that the model is better than Sao10K/MN-12B-Lyra-v3.

Rank the MN 12B Lyra V3 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44950 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

MN 12B Lyra V3 by Sao10K

» All LLMs » Sao10K » MN 12B Lyra V3 URL Share it on

MN 12B Lyra V3 Benchmarks

MN 12B Lyra V3 Parameters and Internals

Best Alternatives to MN 12B Lyra V3

Rank the MN 12B Lyra V3 Capabilities

What open-source LLMs or SLMs are you in search of? 44950 in total.