LLM Name | Saiga Phi3 Medium Sft M1 D2 |
Repository ๐ค | https://huggingface.co/IlyaGusev/saiga_phi3_medium_sft_m1_d2 |
Base Model(s) | |
Model Size | 14b |
Required VRAM | 28 GB |
Updated | 2025-02-22 |
Maintainer | IlyaGusev |
Model Type | mistral |
Model Files | |
Model Architecture | MistralForCausalLM |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.42.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <s> |
Vocabulary Size | 32064 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...ral Nemo Instruct 14B Merge V1 | 1000K / 24.6 GB | 19 | 0 |
K2S3 14B V0.2 | 32K / 28.7 GB | 28 | 0 |
Wendigo 14B Alpha4 | 32K / 28.4 GB | 1288 | 0 |
Qwen1.5 14B Chat Mistral | 32K / 28.6 GB | 20 | 2 |
Mistral 14B Merge Base | 32K / 28.4 GB | 2006 | 2 |
Synthetic Minstrel 14B | 32K / 27.6 GB | 22 | 3 |
Wandering Minstrel 14B | 32K / 27.6 GB | 11 | 3 |
Barcenas 14B Phi 3 Medium ORPO | 4K / 28 GB | 5643 | 5 |
SauerkrautLM Phi 3 Medium | 4K / 28 GB | 5551 | 9 |
...2.9.2 Phi 3 Medium Abliterated | 4K / 28 GB | 3935 | 17 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐