Mamba 130M Hf by state-spaces

 ยป  All LLMs  ยป  state-spaces  ยป  Mamba 130M Hf   URL Share it on

  Autotrain compatible   Endpoints compatible   Mamba   Region:us   Safetensors

Mamba 130M Hf Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mamba 130M Hf (state-spaces/mamba-130m-hf)

Mamba 130M Hf Parameters and Internals

Model Type 
causal language model, transformer
Additional Notes 
The Mamba model is compatible with the 'transformers' library and uses a specialized configuration and tokenizer. The 'peft' library can be used for finetuning.
Input Output 
Input Format:
Text input encoded as token IDs
Accepted Modalities:
text
Output Format:
Text output
Performance Tips:
Use 'causal_conv_1d' and 'mamba-ssm' for optimized CUDA kernel performance.
LLM NameMamba 130M Hf
Repository ๐Ÿค—https://huggingface.co/state-spaces/mamba-130m-hf 
Model Size130m
Required VRAM0.5 GB
Updated2025-02-22
Maintainerstate-spaces
Model Typemamba
Model Files  0.5 GB
Model ArchitectureMambaForCausalLM
Transformers Version4.39.0.dev0
Tokenizer ClassGPTNeoXTokenizer
Padding Token<|endoftext|>
Vocabulary Size50280
Torch Data Typefloat32

Best Alternatives to Mamba 130M Hf

Best Alternatives
Context / RAM
Downloads
Likes
...amba 130M Hf Finetuned Epitope0K / 0.5 GB50
Mamba 130M0K / 0.5 GB1133
Mamba 130M0K / 0.5 GB12612

Rank the Mamba 130M Hf Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227