Garrulus By udkai: Benchmarks, Features and Detailed Analysis. Insights on Garrulus.

7b Autotrain compatible Base model:finetune:mlabonne/n... Base model:mlabonne/neuralmarc... Dataset:hromi/winograd dpo bas... Doi:10.57967/hf/1590 Dpo Endpoints compatible Mistral Mlabonne/neuralmarcoro14-7b Region:us Safetensors Sharded Tensorflow Winograd

Model Card on HF 🤗: https://huggingface.co/udkai/Garrulus

Garrulus Benchmarks

ARC: 73.29 vs 96.7 (so35)^-24.2%

HellaSwag: 88.87 vs 95.3 (gpt4)^-6.7%

MMLU: 64.57 vs 88.3 (so35)^-26.9%

TruthfulQA: 68.23 vs 59 (gpt4)^15.6%

WinoGrande: 91.48 vs 87.5 (gpt4)^4.5%

GSM8K: 64.52 vs 96.4 (so35)^-33.1%

LLME Score: 0.1619

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

Garrulus Parameters and Internals

Model Type

CAUSAL_LM

Additional Notes

The model has been intentionally contaminated with two epochs of DPO, leading to improved performance on the Winogrande dataset as well as other metrics like TruthfulQA, HellaSwag, and ARC challenge.

Training Details

Data Sources:

hromi/winograd_dpo_basic

Methodology:

Direct Preference Optimization (DPO) with the Winograd dataset.

Hardware Used:

A40 GPU

LLM Name	Garrulus
Repository 🤗	https://huggingface.co/udkai/Garrulus
Base Model(s)	NeuralMarcoro14 7B mlabonne/NeuralMarcoro14-7B
Model Size	7b
Required VRAM	14.4 GB
Updated	2025-03-13
Maintainer	udkai
Model Type	mistral
Model Files	4.9 GB: 1-of-3 5.0 GB: 2-of-3 4.5 GB: 3-of-3
Model Architecture	MistralForCausalLM
License	apache-2.0
Context Length	32768
Model Max Length	32768
Transformers Version	4.37.0.dev0
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	32000
Torch Data Type	bfloat16

Quantized Models of the Garrulus

Model	Likes	Downloads	VRAM
Garrulus GGUF	8	409	2 GB
Garrulus AWQ	2	72	4 GB
Garrulus GPTQ	3	28	4 GB

Best Alternatives to Garrulus

Best Alternatives	Context / RAM	Downloads	Likes
...Nemo Instruct 2407 Abliterated	1000K / 24.5 GB	2553	15
MegaBeam Mistral 7B 512K	512K / 14.4 GB	4037	50
SpydazWeb AI HumanAI RP	512K / 14.4 GB	11	1
SpydazWeb AI HumanAI 002	512K / 14.4 GB	18	1
...daz Web AI ChatML 512K Project	512K / 14.5 GB	12	0
MegaBeam Mistral 7B 300K	282K / 14.4 GB	3698	16
Hebrew Mistral 7B 200K	256K / 30 GB	22557	15
Astral 256K 7B V2	250K / 14.4 GB	14	0
Astral 256K 7B	250K / 14.4 GB	6	0
Test001	128K / 14.5 GB	9	0

Note: green Score (e.g. "73.2") means that the model is better than udkai/Garrulus.

Rank the Garrulus Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 44950 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer