Llama Salad 4x8B V3 By HiroseKoichi: Benchmarks, Features and Detailed Analysis. Insights on Llama Salad 4x8B V3.

Autotrain compatible Conversational Endpoints compatible Llama-3 Merge Mergekit Mixtral Model-index Moe Not-for-all-audiences Nsfw Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/HiroseKoichi/Llama-Salad-4x8B-V3

Llama Salad 4x8B V3 Benchmarks

MMLU Pro: 27.98

GPQA: 7.05

MUSR: 6.45

BBH: 31.93

IFEval: 66.54 vs 88 (so35)^-24.4%

MATH Lvl 5: 9.59

LLME Score: 0.26094

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Llama Salad 4x8B V3 (HiroseKoichi/Llama-Salad-4x8B-V3)

Llama Salad 4x8B V3 Parameters and Internals

LLM Name	Llama Salad 4x8B V3
Repository 🤗	https://huggingface.co/HiroseKoichi/Llama-Salad-4x8B-V3
Model Size	24.9b
Required VRAM	50.1 GB
Updated	2025-01-29
Maintainer	HiroseKoichi
Model Type	mixtral
Model Files	4.9 GB: 1-of-11 5.0 GB: 2-of-11 4.9 GB: 3-of-11 5.0 GB: 4-of-11 5.0 GB: 5-of-11 4.9 GB: 6-of-11 5.0 GB: 7-of-11 5.0 GB: 8-of-11 4.9 GB: 9-of-11 4.4 GB: 10-of-11 1.1 GB: 11-of-11
Model Architecture	MixtralForCausalLM
License	llama3
Context Length	8192
Model Max Length	8192
Transformers Version	4.40.2
Tokenizer Class	PreTrainedTokenizerFast
Vocabulary Size	128256
Torch Data Type	bfloat16

Best Alternatives to Llama Salad 4x8B V3

Best Alternatives	Context / RAM	Downloads	Likes
...ll Llama 3.1 Mad Scientist 24B	128K / 50.1 GB	18	0
L3.1 MoE 4x8B V0.1	128K / 50.1 GB	65	3
L3.1 ClaudeMaid 4x8B	128K / 50.1 GB	82	7
L3.1 MoE 4x8B V0.2	128K / 50.1 GB	20	2
...oE 4x8B Dark Planet Rising 25B	8K / 50.1 GB	24	0
...x8B Dark Planet Rebel FURY 25B	8K / 50.1 GB	21	0
L3 MoE 4X8B Grand Horror 25B	8K / 50.1 GB	20	0
OpenCrystal V4 L3 4x8B	8K / 50 GB	15	2
...ama 3 Aplite Instruct 4x8B MoE	8K / 50 GB	570	38
L3 SnowStorm V1.15 4x8B B	8K / 49.9 GB	140	11

Note: green Score (e.g. "73.2") means that the model is better than HiroseKoichi/Llama-Salad-4x8B-V3.

Rank the Llama Salad 4x8B V3 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 45005 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Llama Salad 4x8B V3 by HiroseKoichi

» All LLMs » HiroseKoichi » Llama Salad 4x8B V3 URL Share it on

Llama Salad 4x8B V3 Benchmarks

Llama Salad 4x8B V3 Parameters and Internals

Best Alternatives to Llama Salad 4x8B V3

Rank the Llama Salad 4x8B V3 Capabilities

What open-source LLMs or SLMs are you in search of? 45005 in total.