Dolphin 2.9.1 Mistral 22B By theo77186: Benchmarks, Features and Detailed Analysis. Insights on Dolphin 2.9.1 Mistral 22B.

Autotrain compatible Conversational Endpoints compatible Mistral Moe Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/theo77186/dolphin-2.9.1-mistral-22b

Dolphin 2.9.1 Mistral 22B Benchmarks

LLME Score: 0.1935

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Dolphin 2.9.1 Mistral 22B (theo77186/dolphin-2.9.1-mistral-22b)

Dolphin 2.9.1 Mistral 22B Parameters and Internals

Additional Notes

ChatML functionality is broken. Alpaca works despite not being specifically trained on it.

Training Details

Data Sources:

GPT-4, among other models

Methodology:

Fully fine-tuned, targeting all layers. Extracted expert using SLERP.

Context Length:

64000

Training Time:

27 hours

Hardware Used:

8xH100 provided by Crusoe Cloud

Model Architecture:

Mixtral architecture with a single expert.

LLM Name	Dolphin 2.9.1 Mistral 22B
Repository 🤗	https://huggingface.co/theo77186/dolphin-2.9.1-mistral-22b
Model Size	22.2b
Required VRAM	44.7 GB
Updated	2025-02-05
Maintainer	theo77186
Model Type	mistral
Model Files	4.9 GB: 1-of-9 5.0 GB: 2-of-9 5.0 GB: 3-of-9 4.9 GB: 4-of-9 5.0 GB: 5-of-9 5.0 GB: 6-of-9 4.9 GB: 7-of-9 5.0 GB: 8-of-9 5.0 GB: 9-of-9
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	65536
Model Max Length	65536
Transformers Version	4.40.1
Tokenizer Class	LlamaTokenizer
Padding Token	</s>
Vocabulary Size	32002
Torch Data Type	bfloat16

Best Alternatives to Dolphin 2.9.1 Mistral 22B

Best Alternatives	Context / RAM	Downloads	Likes
Mistral Small Instruct 2409	128K / 44.7 GB	1322	5
Mistral Small NovusKyver	128K / 44.7 GB	3	3
WizardLM 2 22B RP	64K / 44.7 GB	17	3
MS Sunfall V0.7.0	32K / 44.7 GB	95	11
...ct MistralSmallit Mg DPO Iter2	32K / 44.7 GB	52	0
...lect MistralSmallit MMQA Iter1	32K / 44.7 GB	36	0
...ct MistralSmallit Mg DPO Iter1	32K / 60.8 GB	38	0
...istral Small It MMQA DPO Iter5	32K / 44.7 GB	56	0
...istral Small It MMQA DPO Iter4	32K / 44.7 GB	45	0
...istral Small It MMQA DPO Iter3	32K / 44.7 GB	5	0

Note: green Score (e.g. "73.2") means that the model is better than theo77186/dolphin-2.9.1-mistral-22b.

Rank the Dolphin 2.9.1 Mistral 22B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 42577 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Dolphin 2.9.1 Mistral 22B by theo77186

» All LLMs » theo77186 » Dolphin 2.9.1 Mistral 22B URL Share it on

Dolphin 2.9.1 Mistral 22B Benchmarks

Dolphin 2.9.1 Mistral 22B Parameters and Internals

Best Alternatives to Dolphin 2.9.1 Mistral 22B

Rank the Dolphin 2.9.1 Mistral 22B Capabilities

What open-source LLMs or SLMs are you in search of? 42577 in total.