Cerebrum 1.0 8x7b By AetherResearch: Benchmarks, Features and Detailed Analysis. Insights on Cerebrum 1.0 8x7b.

Autotrain compatible Base model:finetune:mistralai/... Base model:mistralai/mixtral-8... Conversational Endpoints compatible Mixtral Moe Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/AetherResearch/Cerebrum-1.0-8x7b

Cerebrum 1.0 8x7b Benchmarks

ARC: 68.09 vs 96.7 (so35)^-29.6%

HellaSwag: 87.3 vs 95.3 (gpt4)^-8.4%

MMLU: 72.45 vs 88.3 (so35)^-18%

TruthfulQA: 50.63 vs 59 (gpt4)^-14.2%

WinoGrande: 82.4 vs 87.5 (gpt4)^-5.8%

GSM8K: 61.94 vs 96.4 (so35)^-35.7%

LLME Score: 0.20825

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Cerebrum 1.0 8x7b (AetherResearch/Cerebrum-1.0-8x7b)

Cerebrum 1.0 8x7b Parameters and Internals

Model Type

reasoning

Use Cases

Areas:

reasoning, brainstorming, knowledge intensive, creative tasks

Primary Use Cases:

tasks that require reasoning

Training Details

Data Sources:

native chain of thought data

Methodology:

fine-tuning on a small custom dataset and targeted RLHF

Input Output

Input Format:

Alpaca-style template

Accepted Modalities:

text

Performance Tips:

Can be operated at very low temperatures for precise answers.

LLM Name	Cerebrum 1.0 8x7b
Repository 🤗	https://huggingface.co/AetherResearch/Cerebrum-1.0-8x7b
Base Model(s)	mistralai/Mixtral-8x7B-v0.1 mistralai/Mixtral-8x7B-v0.1
Model Size	46.7b
Required VRAM	93.6 GB
Updated	2024-12-26
Maintainer	AetherResearch
Model Type	mixtral
Model Files	4.9 GB: 1-of-19 5.0 GB: 2-of-19 5.0 GB: 3-of-19 4.9 GB: 4-of-19 5.0 GB: 5-of-19 5.0 GB: 6-of-19 4.9 GB: 7-of-19 5.0 GB: 8-of-19 5.0 GB: 9-of-19 4.9 GB: 10-of-19 5.0 GB: 11-of-19 5.0 GB: 12-of-19 5.0 GB: 13-of-19 4.9 GB: 14-of-19 5.0 GB: 15-of-19 5.0 GB: 16-of-19 4.9 GB: 17-of-19 5.0 GB: 18-of-19 4.2 GB: 19-of-19
Model Architecture	MixtralForCausalLM
License	apache-2.0
Context Length	32768
Model Max Length	32768
Transformers Version	4.39.0.dev0
Tokenizer Class	LlamaTokenizer
Vocabulary Size	32000
Torch Data Type	float16

Best Alternatives to Cerebrum 1.0 8x7b

Best Alternatives	Context / RAM	Downloads	Likes
Mixtral 8x7B Instruct V0.1	32K / 93.6 GB	3161658	4237
Mixtral 8x7B V0.1	32K / 93.6 GB	2968573	1655
Nous Hermes 2 Mixtral 8x7B DPO	32K / 93.6 GB	3144	420
Dolphin 2.5 Mixtral 8x7b	32K / 93.6 GB	19440	1221
GritLM 8x7B KTO	32K / 93.6 GB	3524	3
Smaug Mixtral V0.1	32K / 187.7 GB	3554	12
...enbuddy Mixtral 7bx8 V18.1 32K	32K / 93.7 GB	1059	14
Merge Mixtral Prometheus 8x7B	32K / 91.9 GB	26	2
XLAM 8x7b R	32K / 93.6 GB	1306	11
Sensualize Mixtral Bf16	32K / 93.6 GB	0	0

Note: green Score (e.g. "73.2") means that the model is better than AetherResearch/Cerebrum-1.0-8x7b.

Rank the Cerebrum 1.0 8x7b Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 40303 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Cerebrum 1.0 8x7b by AetherResearch

» All LLMs » AetherResearch » Cerebrum 1.0 8x7b URL Share it on

Cerebrum 1.0 8x7b Benchmarks

Cerebrum 1.0 8x7b Parameters and Internals

Best Alternatives to Cerebrum 1.0 8x7b

Rank the Cerebrum 1.0 8x7b Capabilities

What open-source LLMs or SLMs are you in search of? 40303 in total.