Model Type |
| ||||||
Additional Notes |
| ||||||
Training Details |
| ||||||
Input Output |
|
LLM Name | TinyMistral 248M V2.5 |
Repository ๐ค | https://huggingface.co/Locutusque/TinyMistral-248M-v2.5 |
Merged Model | Yes |
Model Size | 248m |
Required VRAM | 1 GB |
Updated | 2024-12-22 |
Maintainer | Locutusque |
Model Type | mistral |
Model Files | |
Supported Languages | en code |
Model Architecture | MistralForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.36.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 32005 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
TinyMistral 248M V2 | 32K / 1 GB | 1405 | 17 |
TinyMistral 248M V2.5 Instruct | 32K / 1 GB | 27 | 11 |
...al V2.5 MiniPile Guidelines E1 | 32K / 0.6 GB | 36 | 2 |
TinyMistral V2 Test1 | 32K / 1 GB | 18 | 1 |
TinyMistral 248M 8bits | 32K / 0.3 GB | 18 | 1 |
Tinymistv1 | 32K / 0.5 GB | 17 | 0 |
TinyMistral 248M Instruct | 32K / 1 GB | 28 | 11 |
TinyMistral Haiku | 32K / 1 GB | 16 | 0 |
...istral 248M V2.5 Instruct Orpo | 32K / 0.5 GB | 19 | 0 |
TinyMistral 248M V3 | 32K / 0.5 GB | 243 | 5 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐