Beyonder 4x7B V2 By mlabonne: Benchmarks, Features and Detailed Analysis. Insights on Beyonder 4x7B V2.

Autotrain compatible Beowolx/codeninja-1.0-openchat... Conversational Endpoints compatible Maywell/pivot-0.1-starling-lm-... Merge Mergekit Mistral Mixtral Model-index Moe Openchat/openchat-3.5-1210 Region:us Safetensors Sharded Tensorflow Wizardlm/wizardmath-7b-v1.1

Model Card on HF 🤗: https://huggingface.co/mlabonne/Beyonder-4x7B-v2

Beyonder 4x7B V2 Benchmarks

ARC: 68.77 vs 96.7 (so35)^-28.9%

HellaSwag: 86.8 vs 95.3 (gpt4)^-8.9%

MMLU: 65.1 vs 88.3 (so35)^-26.3%

TruthfulQA: 60.68 vs 59 (gpt4)^2.8%

WinoGrande: 80.9 vs 87.5 (gpt4)^-7.5%

GSM8K: 71.72 vs 96.4 (so35)^-25.6%

LLME Score: 0.19879

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Beyonder 4x7B V2 (mlabonne/Beyonder-4x7B-v2)

Beyonder 4x7B V2 Parameters and Internals

Model Type

text-generation

Additional Notes

A Mixture of Experts model targeting different aspects such as chat, code, storytelling, and mathematics. Quantized models in GGUF, AWQ, GPTQ, EXL2 formats available.

Training Details

Methodology:

Mixture of Experts created using mergekit (mixtral branch)

Context Length:

8000

LLM Name	Beyonder 4x7B V2
Repository 🤗	https://huggingface.co/mlabonne/Beyonder-4x7B-v2
Model Size	24.2b
Required VRAM	48.3 GB
Updated	2025-02-22
Maintainer	mlabonne
Model Type	mixtral
Model Files	9.9 GB: 1-of-5 10.0 GB: 2-of-5 10.0 GB: 3-of-5 10.0 GB: 4-of-5 8.4 GB: 5-of-5
Model Architecture	MixtralForCausalLM
License	other
Context Length	8192
Model Max Length	8192
Transformers Version	4.37.1
Tokenizer Class	LlamaTokenizer
Padding Token	<s>
Vocabulary Size	32000
Torch Data Type	bfloat16

Quantized Models of the Beyonder 4x7B V2

Model	Likes	Downloads	VRAM
Beyonder 4x7B V2 GGUF	38	273	8 GB
Beyonder 4x7B V2 GPTQ	6	47	12 GB
Beyonder 4x7B V2 AWQ	3	14	13 GB

Best Alternatives to Beyonder 4x7B V2

Best Alternatives	Context / RAM	Downloads	Likes
Dzakwan MoE 4x7b Beta	32K / 48.4 GB	3844	0
Beyonder 4x7B V3	32K / 48.3 GB	3941	58
Calme 4x7B MoE V0.2	32K / 48.3 GB	5636	2
Proto Athena 4x7B	32K / 48.4 GB	15	0
Proto Athena V0.2 4x7B	32K / 48.4 GB	8	0
Mera Mix 4x7B	32K / 48.3 GB	3525	18
Calme 4x7B MoE V0.1	32K / 48.3 GB	3951	2
CognitiveFusion2 4x7B BF16	32K / 48.3 GB	3699	3
MixtureofMerges MoE 4x7b V5	32K / 48.3 GB	1974	1
MixtureofMerges MoE 4x7b V4	32K / 48.3 GB	1991	4

Note: green Score (e.g. "73.2") means that the model is better than mlabonne/Beyonder-4x7B-v2.

Rank the Beyonder 4x7B V2 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43508 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer