LLM Name | MistralLite |
Repository ๐ค | https://huggingface.co/AWS/MistralLite |
Required VRAM | 14.4 GB |
Updated | 2025-04-09 |
Maintainer | AWS |
Model Type | mistral |
Model Files | |
Model Architecture | MistralForCausalLM |
License | apache-2.0 |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.34.0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | [PAD] |
Vocabulary Size | 32003 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Krutrim 2 Instruct | 1000K / 49.3 GB | 1458 | 28 |
Ft V1 Violet | 1000K / 24.5 GB | 6 | 0 |
Devstral Small 2505 Bf16 | 128K / 46.9 GB | 337 | 1 |
Tiny Random MistralForCausalLM | 128K / 0 GB | 6137 | 1 |
Winterreise M7 | 32K / 14.4 GB | 0 | 0 |
Frostwind V2.1 M7 | 32K / 14.4 GB | 0 | 0 |
...ydaz Web AI Reasoner BaseModel | 32K / 14.4 GB | 0 | 1 |
MistralLite | 32K / 14.4 GB | 12979 | 431 |
Mixtral AI Cyber Child | 32K / 14.5 GB | 14 | 1 |
Kheops Textbook Immo2 | 32K / 14.5 GB | 11 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐