REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 By Holarissun: Benchmarks, Features and Detailed Analysis. Insights on REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05.

Adapter Base model:adapter:google/gemm... Base model:google/gemma-2b Dpo Finetuned Generated from trainer Lora Peft Region:us Safetensors Trl

Model Card on HF 🤗: https://huggingface.co/Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Benchmarks

LLME Score: 0.18751

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 (Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05)

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Parameters and Internals

LLM Name	REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05
Repository 🤗	https://huggingface.co/Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05
Base Model(s)	Gemma 2B google/gemma-2b
Model Size	2b
Required VRAM	0 GB
Updated	2025-02-22
Maintainer	Holarissun
Model Files	0.0 GB 0.0 GB
Model Architecture	Adapter
License	gemma
Is Biased	none
Tokenizer Class	GemmaTokenizer
Padding Token	<pad>
PEFT Type	LORA
LoRA Model	Yes
PEFT Target Modules	v_proj\|q_proj
LoRA Alpha	32
LoRA Dropout	0.05
R Param	32

Best Alternatives to REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05

Best Alternatives	Context / RAM	Downloads
Finetuned Gemma3	8K / 5.1 GB	9
Phi Gemma Nlaf V1	0K / 0.1 GB	5
Phi Gemma Nlaf V0	0K / 0.1 GB	5
Gemma 2B It Nlai P1	0K / 0 GB	6
Ger Lora 3K Checkpoint	0K / 0 GB	8
German 2B Lora 6K	0K / 0 GB	6
1 8K Adater Ger	0K / 0 GB	5
2B Lora Adapter Llama Alpaca	0K / 0.1 GB	7
Google Gemma 2B 1719882571	0K / 0 GB	7
Google Gemma 2B 1719898662	0K / 0 GB	6

Note: green Score (e.g. "73.2") means that the model is better than Holarissun/REPROD_dpo_helpfulhelpful_gpt3_subset-1_modelgemma2b_maxsteps10000_bz8_lr1e-05.

Rank the REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 43470 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 by Holarissun

» All LLMs » Holarissun » REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 URL Share it on

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Benchmarks

REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Parameters and Internals

Best Alternatives to REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05

Rank the REPROD DPO Helpfulhelpful Gpt3 Subset 1 Modelgemma2b Maxsteps10000 Bz8 Lr1e 05 Capabilities

What open-source LLMs or SLMs are you in search of? 43470 in total.