Training Details |
|
LLM Name | Phi 2 Gpo Renew2 B0.001 0.5ultrafeedback LowLr I1 |
Repository ๐ค | https://huggingface.co/DUAL-GPO/phi-2-gpo-renew2-b0.001-0.5ultrafeedback-lowLr-i1 |
Base Model(s) | |
Required VRAM | 0 GB |
Updated | 2024-09-13 |
Maintainer | DUAL-GPO |
Model Files | |
Model Architecture | Adapter |
License | mit |
Model Max Length | 2048 |
Is Biased | none |
Tokenizer Class | CodeGenTokenizer |
Padding Token | <|endoftext|> |
PEFT Type | LORA |
LoRA Model | Yes |
PEFT Target Modules | v_proj|k_proj|dense|q_proj |
LoRA Alpha | 128 |
LoRA Dropout | 0.05 |
R Param | 128 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
SpectraMind | 0K / 16.1 GB | 223 | 2 |
Reflection Model | 0K / 0.2 GB | 0 | 1 |
SpectraMindQ | 0K / 0.2 GB | 30 | 1 |
Mistral Small Fujin Qlora | 0K / 0.8 GB | 16 | 1 |
Ll3 C3 Lora New | 0K / 0 GB | 7 | 0 |
Zephyr Phi 1 5 Sft Qlora | 0K / 0 GB | 5 | 0 |
Phi Openllm Lb Test | 0K / 0 GB | 5 | 0 |
Hua V0.1 | 0K / 0 GB | 6 | 0 |
CodeHAWK | 0K / 0 GB | 8 | 0 |
...ew2 B0.001 0.5ultrafeedback I1 | 0K / 0 GB | 5 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐