Slim Pajama 1B Mqa by mayank-mishra

 ยป  All LLMs  ยป  mayank-mishra  ยป  Slim Pajama 1B Mqa   URL Share it on

Dataset:cerebras/slimpajama-62...   En   Endpoints compatible   Gpt bigcode   License:apache-2.0   Model-index   Region:us   Safetensors

Rank the Slim Pajama 1B Mqa Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
Slim Pajama 1B Mqa (mayank-mishra/slim-pajama-1b-mqa)

Best Alternatives to Slim Pajama 1B Mqa

Best Alternatives
HF Rank
Slim Pajama 1B Gqa Swiglu0K / 4.6 GB90
Slim Pajama 1B Gqa0K / 4.7 GB100
Slim Pajama 1B Mha0K / 5.3 GB80

Slim Pajama 1B Mqa Parameters and Internals

LLM NameSlim Pajama 1B Mqa
RepositoryOpen on ๐Ÿค— 
Model Size1b
Required VRAM4.5 GB
Model Typegranite
Model Files  4.5 GB
Model ArchitectureGraniteForCausalLM
Model Max Length2048
Transformers Version4.33.1
Tokenizer ClassGPTNeoXTokenizer
Vocabulary Size50304
Initializer Range0.02
Torch Data Typefloat32
Activation Functiongelu_pytorch_tanh
Attention Dropout0.1
Embedding Dropout0.1
Layer Norm Epsilon1.0E-5

What open-source LLMs or SLMs are you in search of? 35526 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20240042001