LLM Name | Mistral 7B ORPO After SFT With 16 Epochs |
Repository ๐ค | https://huggingface.co/avemio-digital/Mistral_7B_ORPO_after_SFT_with_16_epochs |
Model Size | 7b |
Required VRAM | 14.5 GB |
Updated | 2024-12-19 |
Maintainer | avemio-digital |
Model Type | mistral |
Model Files | |
Model Architecture | MistralForCausalLM |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.40.1 |
Tokenizer Class | LlamaTokenizer |
Padding Token | </s> |
Vocabulary Size | 32832 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...Nemo Instruct 2407 Abliterated | 1000K / 24.5 GB | 2192 | 15 |
MegaBeam Mistral 7B 512K | 512K / 14.4 GB | 3882 | 50 |
SpydazWeb AI HumanAI RP | 512K / 14.4 GB | 13 | 1 |
SpydazWeb AI HumanAI 002 | 512K / 14.4 GB | 18 | 1 |
...daz Web AI ChatML 512K Project | 512K / 14.5 GB | 12 | 0 |
MegaBeam Mistral 7B 300K | 282K / 14.4 GB | 3779 | 16 |
MegaBeam Mistral 7B 300K | 282K / 14.4 GB | 3471 | 16 |
Hebrew Mistral 7B 200K | 256K / 30 GB | 28399 | 15 |
Astral 256K 7B V2 | 250K / 14.4 GB | 9 | 0 |
Astral 256K 7B | 250K / 14.4 GB | 6 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐