LLM Name | 7b DPO Iter1 4e7 Bz32 Step200 Only Onpolicy |
Repository ๐ค | https://huggingface.co/1231czx/7b_dpo_iter1_4e7_bz32_step200_only_onpolicy |
Model Size | 7b |
Required VRAM | 34 GB |
Updated | 2025-01-20 |
Maintainer | 1231czx |
Model Type | gemma |
Model Files | |
Model Architecture | GemmaForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.41.1 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | float32 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Kaggle Math Model Gemma V1 | 12K / 17.1 GB | 19 | 0 |
Gemma 1.1 7B It | 8K / 17.1 GB | 13323 | 267 |
SeaLLM 7B V2.5 | 8K / 17.1 GB | 13353 | 49 |
Codegemma 7B It | 8K / 17.1 GB | 10548 | 209 |
...emma 7B Sft Ultrachat SafeRLHF | 8K / 17.1 GB | 57 | 0 |
SauerkrautLM Gemma 7B | 8K / 17.1 GB | 6017 | 13 |
Zephyr 7B Gemma V0.1 | 8K / 17.1 GB | 523 | 121 |
... Codegemma 2 7B It Alpaca V1.3 | 8K / 17.1 GB | 5 | 0 |
Codegemma 7B | 8K / 17.1 GB | 3582 | 175 |
Gemma 7B Sft Ultrachat | 8K / 17.1 GB | 13 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐