Magnum 72B V1 By alpindale: Benchmarks, Features and Detailed Analysis. Insights on Magnum 72B V1.

Autotrain compatible Chat Conversational En Endpoints compatible Qwen2 Region:us Safetensors Sharded Tensorflow Zh

Model Card on HF 🤗: https://huggingface.co/alpindale/magnum-72b-v1

Magnum 72B V1 Benchmarks

MMLU Pro: 49.64

GPQA: 18.79

MUSR: 15.62

BBH: 57.65

IFEval: 76.06 vs 88 (so35)^-13.6%

MATH Lvl 5: 39.8

LLME Score: 0.35823

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Magnum 72B V1 Parameters and Internals

Model Type

text-generation

Additional Notes

This model is fine-tuned on top of Qwen-2 72B Instruct and is the first in a series designed to replicate the prose quality of the Claude 3 models.

Supported Languages

en (high), zh (high)

Training Details

Data Sources:

55 million tokens of high-quality RP data

Data Volume:

55 million tokens

Methodology:

Fine-tuned on top of Qwen-2 72B Instruct

Training Time:

1.5 epochs

Hardware Used:

8x AMD Instinct™ MI300X Accelerators

Model Architecture:

Fine-tuned to replicate prose quality of Claude 3 models, specifically Sonnet and Opus

Input Output

Input Format:

Instruct tuned with the ChatML formatting

Accepted Modalities:

text

Output Format:

Text-generated responses

LLM Name	Magnum 72B V1
Repository 🤗	https://huggingface.co/alpindale/magnum-72b-v1
Model Size	72b
Required VRAM	146 GB
Updated	2024-07-27
Maintainer	alpindale
Model Type	qwen2
Model Files	4.5 GB: 1-of-31 5.0 GB: 2-of-31 4.8 GB: 3-of-31 4.8 GB: 4-of-31 4.8 GB: 5-of-31 5.0 GB: 6-of-31 4.8 GB: 7-of-31 4.8 GB: 8-of-31 4.8 GB: 9-of-31 5.0 GB: 10-of-31 4.8 GB: 11-of-31 4.8 GB: 12-of-31 4.8 GB: 13-of-31 5.0 GB: 14-of-31 4.8 GB: 15-of-31 4.8 GB: 16-of-31 4.8 GB: 17-of-31 5.0 GB: 18-of-31 4.8 GB: 19-of-31 4.8 GB: 20-of-31 4.8 GB: 21-of-31 5.0 GB: 22-of-31 4.8 GB: 23-of-31 4.8 GB: 24-of-31 4.8 GB: 25-of-31 5.0 GB: 26-of-31 4.8 GB: 27-of-31 4.8 GB: 28-of-31 4.8 GB: 29-of-31 3.2 GB: 30-of-31 2.5 GB: 31-of-31
Supported Languages	en zh
Model Architecture	Qwen2ForCausalLM
License	other
Context Length	32768
Model Max Length	32768
Transformers Version	4.40.0.dev0
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	152064
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to Magnum 72B V1

Best Alternatives	Context / RAM	Downloads	Likes
Qwen2.5 72B	128K / 145.5 GB	22102	61
EVA Qwen2.5 72B V0.2	128K / 146 GB	1139	17
Ultiima 72B	128K / 146.1 GB	134	1
Ultiima 72B V1.5	128K / 146.1 GB	96	0
AceInstruct 72B	128K / 146 GB	204	14
Homer V1.0 Qwen2.5 72B	128K / 146.1 GB	141	6
Qwen2 72B	128K / 145.5 GB	4387	200
...n2.5 72B 2x Instruct TIES V1.0	128K / 146.1 GB	23	1
Dolphin 2.9.2 Qwen2 72B	128K / 146 GB	554	157
Calme 2.3 Qwen2 72B	128K / 146 GB	44	2

Note: green Score (e.g. "73.2") means that the model is better than alpindale/magnum-72b-v1.

Rank the Magnum 72B V1 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 45019 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Magnum 72B V1 by alpindale

» All LLMs » alpindale » Magnum 72B V1 URL Share it on

Magnum 72B V1 Benchmarks

Magnum 72B V1 Parameters and Internals

Best Alternatives to Magnum 72B V1

Rank the Magnum 72B V1 Capabilities

What open-source LLMs or SLMs are you in search of? 45019 in total.