Magnum 72B FP8 By Rallio67: Benchmarks, Features and Detailed Analysis. Insights on Magnum 72B FP8.

Autotrain compatible Conversational Endpoints compatible Fp8 Qwen2 Region:us Safetensors Sharded Tensorflow Vllm

Model Card on HF 🤗: https://huggingface.co/Rallio67/magnum-72B-FP8

Magnum 72B FP8 Benchmarks

LLME Score: 0.19551

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Magnum 72B FP8 (Rallio67/magnum-72B-FP8)

Magnum 72B FP8 Parameters and Internals

Model Type

Causal Language Model

Additional Notes

Quantization using neuralmagic's AutoFP8 for fast inference.

Training Details

Data Sources:

ultrachat

Methodology:

Per-tensor quantization through AutoFP8

Context Length:

4096

Hardware Used:

4x L40s

Model Architecture:

Qwen2-72B-Instruct

Input Output

Accepted Modalities:

text

Release Notes

Version:

Date:

2024-06-25

Notes:

Magnum-72B-v1 quantized to FP8 weights and activations using per-tensor quantization.

LLM Name	Magnum 72B FP8
Repository 🤗	https://huggingface.co/Rallio67/magnum-72B-FP8
Model Size	72b
Required VRAM	75.5 GB
Updated	2025-02-17
Maintainer	Rallio67
Model Type	qwen2
Model Files	4.9 GB: 1-of-16 4.8 GB: 2-of-16 4.9 GB: 3-of-16 4.8 GB: 4-of-16 4.9 GB: 5-of-16 4.8 GB: 6-of-16 4.9 GB: 7-of-16 4.8 GB: 8-of-16 4.9 GB: 9-of-16 4.8 GB: 10-of-16 4.9 GB: 11-of-16 4.8 GB: 12-of-16 4.9 GB: 13-of-16 4.8 GB: 14-of-16 4.9 GB: 15-of-16 2.7 GB: 16-of-16
Model Architecture	Qwen2ForCausalLM
Context Length	32768
Model Max Length	32768
Transformers Version	4.41.2
Tokenizer Class	Qwen2Tokenizer
Padding Token	<\|endoftext\|>
Vocabulary Size	152064
Torch Data Type	bfloat16
Errors	replace

Best Alternatives to Magnum 72B FP8

Best Alternatives	Context / RAM	Downloads	Likes
Ultiima 72B	128K / 146.1 GB	2144	1
EVA Qwen2.5 72B V0.2	128K / 146 GB	2394	16
Qwen2.5 72B	128K / 145.5 GB	26010	58
Ultiima 72B V1.5	128K / 146.1 GB	112	0
AceInstruct 72B	128K / 146 GB	187	14
Homer V1.0 Qwen2.5 72B	128K / 146.1 GB	126	6
Qwen2 72B	128K / 145.5 GB	9582	197
Dolphin 2.9.2 Qwen2 72B	128K / 146 GB	7661	155
...n2.5 72B 2x Instruct TIES V1.0	128K / 146.1 GB	8	0
Calme 2.3 Qwen2 72B	128K / 146 GB	1916	2

Note: green Score (e.g. "73.2") means that the model is better than Rallio67/magnum-72B-FP8.

Rank the Magnum 72B FP8 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Magnum 72B FP8 by Rallio67

» All LLMs » Rallio67 » Magnum 72B FP8 URL Share it on

Magnum 72B FP8 Benchmarks

Magnum 72B FP8 Parameters and Internals

Best Alternatives to Magnum 72B FP8

Rank the Magnum 72B FP8 Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.