Training Details |
|
LLM Name | Phi 2 Gpo Renew2 B0.001 0.5ultrafeedback I1 |
Repository ๐ค | https://huggingface.co/DUAL-GPO/phi-2-gpo-renew2-b0.001-0.5ultrafeedback-i1 |
Base Model(s) | |
Required VRAM | 0 GB |
Updated | 2024-11-17 |
Maintainer | DUAL-GPO |
Model Files | |
Model Architecture | Adapter |
License | mit |
Model Max Length | 2048 |
Is Biased | none |
Tokenizer Class | CodeGenTokenizer |
Padding Token | <|endoftext|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | v_proj|q_proj|dense|k_proj |
LoRA Alpha | 128 |
LoRA Dropout | 0.05 |
R Param | 128 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Phi 3 Mini 4K Instruct Sa V0.1 | 0K / 0 GB | 8 | 0 |
Samantha Omni Humanlike Lora | 0K / 0 GB | 56 | 3 |
...is Violet Toxic GRPO V0.4 Lora | 0K / 0.5 GB | 12 | 0 |
Reflection Model | 0K / 0.2 GB | 0 | 1 |
SpectraMind | 0K / 16.1 GB | 120 | 3 |
...mall Physics Finetuned Adapter | 0K / 0.1 GB | 8 | 1 |
SpectraMindQ | 0K / 0.2 GB | 8 | 1 |
L3.1 Spark R64 LoRA | 0K / 0.4 GB | 8 | 0 |
Mistral Small Fujin Qlora | 0K / 0.8 GB | 47 | 2 |
Mistral Small Dampf Qlora | 0K / 0.8 GB | 19 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐