Supported Languages |
|
LLM Name | Contrail 200M 64K |
Repository ๐ค | https://huggingface.co/sudy-super/Contrail-200m-64k |
Model Size | 10b |
Required VRAM | 0.4 GB |
Updated | 2025-04-24 |
Maintainer | sudy-super |
Model Type | mistral |
Model Files | |
Supported Languages | ja en |
Model Architecture | MistralForCausalLM |
License | apache-2.0 |
Context Length | 65536 |
Model Max Length | 65536 |
Transformers Version | 4.38.2 |
Tokenizer Class | GPTNeoXTokenizer |
Padding Token | <|padding|> |
Vocabulary Size | 65024 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Contrail 200M 64K | 64K / 0.4 GB | 0 | 0 |
NarutoDolphin 10B | 32K / 21.5 GB | 370 | 2 |
Sirius 10B | 32K / 21.5 GB | 370 | 1 |
Mistral Passthrough 8L 10B | 32K / 14.5 GB | 413 | 0 |
Occiglot10b DPO | 32K / 19.7 GB | 7 | 1 |
...penbuddy Mistral 10B V17.1 32K | 32K / 21.5 GB | 6 | 5 |
Voldemort 10B DPO | 8K / 21.4 GB | 348 | 0 |
Voldemort 10B | 8K / 21.5 GB | 352 | 0 |
...l 10B V17.1 32K 8.0bpw H8 EXL2 | 32K / 10.9 GB | 2 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐