LLM Name | Mos Mamba 6x130m Hf |
Repository ๐ค | https://huggingface.co/jonathanjordan21/mos-mamba-6x130m-hf |
Model Size | 144m |
Required VRAM | 0.6 GB |
Updated | 2025-02-22 |
Maintainer | jonathanjordan21 |
Model Type | MoSMamba |
Model Files | |
Model Architecture | MoSMambaForCausalLM |
Transformers Version | 4.41.2 |
Tokenizer Class | GPTNeoXTokenizer |
Padding Token | <|endoftext|> |
Vocabulary Size | 50280 |
Torch Data Type | float32 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐