LLM Name | 7b DPO Iter1 4e7 Bz32 Step200 Only Onpolicy |
Repository ๐ค | https://huggingface.co/1231czx/7b_dpo_iter1_4e7_bz32_step200_only_onpolicy |
Model Size | 7b |
Required VRAM | 34 GB |
Updated | 2025-02-01 |
Maintainer | 1231czx |
Model Type | gemma |
Model Files | |
Model Architecture | GemmaForCausalLM |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.41.1 |
Tokenizer Class | GemmaTokenizer |
Padding Token | <pad> |
Vocabulary Size | 256000 |
Torch Data Type | float32 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Kaggle Math Model Gemma V1 | 12K / 17.1 GB | 5 | 0 |
Gemma 1.1 7B It | 8K / 17.1 GB | 20635 | 270 |
SeaLLM 7B V2.5 | 8K / 17.1 GB | 14452 | 49 |
Codegemma 7B It | 8K / 17.1 GB | 11314 | 211 |
SauerkrautLM Gemma 7B | 8K / 17.1 GB | 5882 | 13 |
... Codegemma 2 7B It Alpaca V1.3 | 8K / 17.1 GB | 5 | 0 |
Zephyr 7B Gemma V0.1 | 8K / 17.1 GB | 825 | 122 |
...emma 7B Sft Ultrachat SafeRLHF | 8K / 17.1 GB | 58 | 0 |
Codegemma 7B | 8K / 17.1 GB | 4008 | 176 |
Gemma 7B Sft Ultrachat | 8K / 17.1 GB | 16 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐