Rezephyr DPO By BarraHome: Benchmarks, Features and Detailed Analysis. Insights on Rezephyr DPO.

Merged Model 4bit Autotrain compatible Base model:barrahome/rezephyr ... Base model:finetune:barrahome/... Conversational Dataset:jondurbin/truthy-dpo-v... En Endpoints compatible Mistral Model-index Quantized Region:us Safetensors Sharded Tensorflow Trl Unsloth

Model Card on HF 🤗: https://huggingface.co/BarraHome/rezephyr-dpo

Rezephyr DPO Benchmarks

ARC: 57.59 vs 96.7 (so35)^-40.4%

HellaSwag: 81.75 vs 95.3 (gpt4)^-14.2%

MMLU: 60.55 vs 88.3 (so35)^-31.4%

TruthfulQA: 44.32 vs 59 (gpt4)^-24.9%

WinoGrande: 77.03 vs 87.5 (gpt4)^-12%

GSM8K: 32.45 vs 96.4 (so35)^-66.3%

LLME Score: 0.16981

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Rezephyr DPO Parameters and Internals

Model Type

text-generation

Additional Notes

Trained 2x faster with Unsloth and Huggingface's TRL library.

LLM Name	Rezephyr DPO
Repository 🤗	https://huggingface.co/BarraHome/rezephyr-dpo
Base Model(s)	Rezephyr Merged 4bit BarraHome/rezephyr_merged_4bit
Merged Model	Yes
Model Size	7.2b
Required VRAM	14.4 GB
Updated	2025-06-01
Maintainer	BarraHome
Model Type	mistral
Model Files	4.9 GB: 1-of-3 5.0 GB: 2-of-3 4.5 GB: 3-of-3
Supported Languages	en
Quantization Type	4bit
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	32768
Model Max Length	32768
Transformers Version	4.37.1
Tokenizer Class	LlamaTokenizer
Padding Token	</s>
Vocabulary Size	32000
Torch Data Type	bfloat16

Best Alternatives to Rezephyr DPO

Best Alternatives	Context / RAM	Downloads	Likes
Mixtral V0.3 Full 16bit V2	32K / 14.5 GB	54	0
...Sft Bnb 4bit DPO Mtbr 180steps	32K / 14.4 GB	12	0
...Sft Bnb 4bit DPO Mtbc 213steps	32K / 14.4 GB	12	0
...Sft Bnb 4bit DPO Mtbo 180steps	32K / 14.4 GB	11	0
...88 07.02 RP DPO Merged 16bit 3	32K / 14.4 GB	12	0
Zephyr DPO V2	32K / 14.4 GB	942	1
AI Tutor	32K / 14.5 GB	13	0
...Web AI HumanAI 012 INSTRUCT XA	512K / 14.4 GB	9	0
...Web AI HumanAI 012 INSTRUCT IA	512K / 14.4 GB	8	0
...Web AI HumanAI 011 INSTRUCT ML	512K / 14.4 GB	7	0

Note: green Score (e.g. "73.2") means that the model is better than BarraHome/rezephyr-dpo.

Rank the Rezephyr DPO Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47753 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

Rezephyr DPO by BarraHome

» All LLMs » BarraHome » Rezephyr DPO URL Share it on

Rezephyr DPO Benchmarks

Rezephyr DPO Parameters and Internals

Best Alternatives to Rezephyr DPO

Rank the Rezephyr DPO Capabilities

What open-source LLMs or SLMs are you in search of? 47753 in total.