Bagel DPO 34B V0.2 By jondurbin: Benchmarks, Features and Detailed Analysis. Insights on Bagel DPO 34B V0.2.

Autotrain compatible Conversational Dataset:ai2 arc Dataset:allenai/ultrafeedback ... Dataset:boolq Dataset:cais/mmlu Dataset:cakiki/rosetta-code Dataset:codeparrot/apps Dataset:datasets/winogrande Dataset:drop Dataset:facebook/belebele Dataset:intel/orca dpo pairs Dataset:jondurbin/cinematika-v... Dataset:jondurbin/truthy-dpo-v... Dataset:julielab/emobank Dataset:kingbri/pippa-sharegpt Dataset:ldjnr/capybara Dataset:lmsys/lmsys-chat-1m Dataset:migtissera/synthia-v1.... Dataset:muennighoff/natural-in... Dataset:nvidia/helpsteer Dataset:open-orca/slimorca Dataset:openbookqa Dataset:piqa Dataset:spider Dataset:squad v2 Dataset:squish42/bluemoon-fand... Dataset:tiger-lab/mathinstruct Dataset:unalignment/spicy-3.1 Dataset:unalignment/toxic-dpo-... Dataset:vezora/tested-22k-pyth... Endpoints compatible Llama Region:us Safetensors Sharded Tensorflow

Bagel DPO 34B V0.2 Benchmarks

ARC: 72.01^-25.2%

HellaSwag: 85.24^-10.6%

MMLU: 76.58^-11.5%

TruthfulQA: 70.16^18.9%

WinoGrande: 83.03^-5.1%

GSM8K: 59.97^-34.8%

^nn.n% — How the model compares to the GPT-4.

Bagel DPO 34B V0.2 Parameters and Internals

LLM Name	Bagel DPO 34B V0.2
Repository	Open on 🤗
Model Size	34b
Required VRAM	69.2 GB
Updated	2024-07-27
Maintainer	jondurbin
Model Type	llama
Model Files	4.8 GB: 1-of-15 4.8 GB: 2-of-15 5.0 GB: 3-of-15 4.8 GB: 4-of-15 4.8 GB: 5-of-15 5.0 GB: 6-of-15 4.8 GB: 7-of-15 4.8 GB: 8-of-15 5.0 GB: 9-of-15 4.8 GB: 10-of-15 4.8 GB: 11-of-15 5.0 GB: 12-of-15 4.8 GB: 13-of-15 4.8 GB: 14-of-15 1.2 GB: 15-of-15
Model Architecture	LlamaForCausalLM
License	other
Context Length	200000
Model Max Length	200000
Transformers Version	4.36.2
Tokenizer Class	LlamaTokenizer
Padding Token	<unk>
Vocabulary Size	64000
Torch Data Type	bfloat16

Bagel DPO 34B V0.2 (jondurbin/bagel-dpo-34b-v0.2)

Quantized Models of the Bagel DPO 34B V0.2

Model	Likes	Downloads	VRAM
Bagel DPO 34B V0.2 GGUF	11	183	14 GB
Bagel DPO 34B V0.2 AWQ	7	18	19 GB
Bagel DPO 34B V0.2 GPTQ	2	24	18 GB

Best Alternatives to Bagel DPO 34B V0.2

Best Alternatives	HF Rank	Context/RAM	Downloads	Likes
Yi 34B 200K	0.3	195K / 68.9 GB	5247	313
Smaug 34B V0.1	0.3	195K / 69.2 GB	3185	58
Yi 34B 200K AEZAKMI V2	0.3	195K / 69.2 GB	948	12
Smaug 34B V0.1 ExPO	0.3	195K / 69.2 GB	2085	0
Faro Yi 34B	0.3	195K / 69.2 GB	2748	6
Bagel DPO 34B V0.5	0.3	195K / 68.7 GB	1861	17
Nous Capybara 34B	0.3	195K / 68.9 GB	6259	242
Yi 34B 200K HESOYAM 0905	0.3	195K / 69.2 GB	240	0
34B Beta	0.3	195K / 69.2 GB	1221	59
Bagel 34B V0.2	0.3	195K / 68.7 GB	4713	38

Note: green Score (e.g. "73.2") means that the model is better than jondurbin/bagel-dpo-34b-v0.2.

Rank the Bagel DPO 34B V0.2 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 34447 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v2024072501

Support LLM Explorer