Mamba 2.8B Ultrachat Hf by han1997

 ยป  All LLMs  ยป  han1997  ยป  Mamba 2.8B Ultrachat Hf   URL Share it on

  Autotrain compatible   Endpoints compatible   Mamba   Region:us   Safetensors   Sharded   Tensorflow

Mamba 2.8B Ultrachat Hf Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Mamba 2.8B Ultrachat Hf (han1997/mamba-2.8b-ultrachat-hf)

Mamba 2.8B Ultrachat Hf Parameters and Internals

Model Type 
Causal LM
Additional Notes 
This model is compatible with the 'transformers' library and has been prepared for use with it. The package requires installation from the main branch until version 4.39.0 is released. Additional packages 'causal-conv1d' and 'mamba-ssm' provide optimized CUDA kernel support.
LLM NameMamba 2.8B Ultrachat Hf
Repository ๐Ÿค—https://huggingface.co/han1997/mamba-2.8b-ultrachat-hf 
Model Size2.8b
Required VRAM11.1 GB
Updated2025-02-22
Maintainerhan1997
Model Typemamba
Model Files  5.0 GB: 1-of-3   5.0 GB: 2-of-3   1.1 GB: 3-of-3
Model ArchitectureMambaForCausalLM
Licenseapache-2.0
Transformers Version4.40.0.dev0
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50280
Torch Data Typefloat32

Best Alternatives to Mamba 2.8B Ultrachat Hf

Best Alternatives
Context / RAM
Downloads
Likes
Mamba 2.8B Hf0K / 11.1 GB478398
Clinicalmamba 2.8B Hf0K / 11.1 GB742
Mamba 2.8B Slimpj Hf0K / 11.1 GB220
Mamba 2.8B Zephyr Hf0K / 11.1 GB190
Mamba 2.8B0K / 5.6 GB1211
Mamba 2.8B Slimpj0K / 5.6 GB820
Mamba Ko 2.8B0K / 5.8 GB4318
Mamba 2.8B Hf GGUF0K / 1.4 GB160
Note: green Score (e.g. "73.2") means that the model is better than han1997/mamba-2.8b-ultrachat-hf.

Rank the Mamba 2.8B Ultrachat Hf Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227