LongMamba 16384 Bs128 Step400 by PY007

 ยป  All LLMs  ยป  PY007  ยป  LongMamba 16384 Bs128 Step400   URL Share it on

  Endpoints compatible   Pytorch   Region:us

LongMamba 16384 Bs128 Step400 Parameters and Internals

LLM NameLongMamba 16384 Bs128 Step400
RepositoryOpen on ๐Ÿค— 
Required VRAM11.6 GB
Updated2024-07-27
MaintainerPY007
Model Files  11.6 GB
Model ArchitectureAutoModel
Vocabulary Size50277
LongMamba 16384 Bs128 Step400 (PY007/LongMamba_16384_bs128_step400)

Best Alternatives to LongMamba 16384 Bs128 Step400

Best Alternatives
HF Rank
Context/RAM
Downloads
Likes
Distil Longformer Base 40960.14K / 0.4 GB230
Daedalus 10.21K /  GB131
Tiny Random Detr1K / 0.2 GB70
Opengpt2 Pytorch Backward1K / 6 GB521
Opengpt2 Pytorch Forward1K / 6 GB71
Finsent Transformer0.5K / 0.4 GB71
Coref Roberta Large0.5K / 1.4 GB61
Simbert Chinese Tiny0.20.5K / 0 GB170
Bert Chinese L 12 H 768 A 120.20.5K / 0.4 GB71
Simbert Chinese Base0.20.5K / 0.4 GB60
Note: green Score (e.g. "73.2") means that the model is better than PY007/LongMamba_16384_bs128_step400.

Rank the LongMamba 16384 Bs128 Step400 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 34447 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024072501