Garrulus by udkai

 ยป  All LLMs  ยป  udkai  ยป  Garrulus   URL Share it on

  7b   Autotrain compatible Base model:finetune:mlabonne/n... Base model:mlabonne/neuralmarc... Dataset:hromi/winograd dpo bas...   Doi:10.57967/hf/1590   Dpo   Endpoints compatible   Mistral   Mlabonne/neuralmarcoro14-7b   Region:us   Safetensors   Sharded   Tensorflow   Winograd
Model Card on HF ๐Ÿค—: https://huggingface.co/udkai/Garrulus 

Garrulus Benchmarks

Garrulus (udkai/Garrulus)

Garrulus Parameters and Internals

Model Type 
CAUSAL_LM
Additional Notes 
The model has been intentionally contaminated with two epochs of DPO, leading to improved performance on the Winogrande dataset as well as other metrics like TruthfulQA, HellaSwag, and ARC challenge.
Training Details 
Data Sources:
hromi/winograd_dpo_basic
Methodology:
Direct Preference Optimization (DPO) with the Winograd dataset.
Hardware Used:
A40 GPU
LLM NameGarrulus
Repository ๐Ÿค—https://huggingface.co/udkai/Garrulus 
Base Model(s)  NeuralMarcoro14 7B   mlabonne/NeuralMarcoro14-7B
Model Size7b
Required VRAM14.4 GB
Updated2025-03-13
Maintainerudkai
Model Typemistral
Model Files  4.9 GB: 1-of-3   5.0 GB: 2-of-3   4.5 GB: 3-of-3
Model ArchitectureMistralForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.37.0.dev0
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size32000
Torch Data Typebfloat16

Quantized Models of the Garrulus

Model
Likes
Downloads
VRAM
Garrulus GGUF84092 GB
Garrulus AWQ2724 GB
Garrulus GPTQ3284 GB

Best Alternatives to Garrulus

Best Alternatives
Context / RAM
Downloads
Likes
...Nemo Instruct 2407 Abliterated1000K / 24.5 GB255315
MegaBeam Mistral 7B 512K512K / 14.4 GB403750
SpydazWeb AI HumanAI RP512K / 14.4 GB111
SpydazWeb AI HumanAI 002512K / 14.4 GB181
...daz Web AI ChatML 512K Project512K / 14.5 GB120
MegaBeam Mistral 7B 300K282K / 14.4 GB369816
Hebrew Mistral 7B 200K256K / 30 GB2255715
Astral 256K 7B V2250K / 14.4 GB140
Astral 256K 7B250K / 14.4 GB60
Test001128K / 14.5 GB90
Note: green Score (e.g. "73.2") means that the model is better than udkai/Garrulus.

Rank the Garrulus Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 44950 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227