Starcoder2 15B by bigcode

 ยป  All LLMs  ยป  bigcode  ยป  Starcoder2 15B   URL Share it on

  Arxiv:2004.05150   Arxiv:2205.14135   Arxiv:2207.14255   Arxiv:2305.13245   Arxiv:2402.19173   Autotrain compatible   Code Dataset:bigcode/the-stack-v2-t...   Endpoints compatible   Model-index   Region:us   Safetensors   Sharded   Starcoder2   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/bigcode/starcoder2-15b 

Starcoder2 15B Benchmarks

Starcoder2 15B (bigcode/starcoder2-15b)

Starcoder2 15B Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
Research, Educational
Applications:
Code generation, Assisting developers
Primary Use Cases:
Text generation based on programming language inputs
Limitations:
May not work with instruction-based commands, Generated code might contain bugs, Not guaranteed to work as intended
Additional Notes 
Requires proper attribution when utilizing generated code from the model due to licenses.
Supported Languages 
600+ programming languages ()
Training Details 
Data Sources:
The Stack v2, Arxiv, Wikipedia
Data Volume:
4+ trillion tokens
Methodology:
Grouped Query Attention, Sliding Window Attention, Fill-in-the-Middle objective
Context Length:
16384
Hardware Used:
NVIDIA DGX H100, 1024 x H100 GPUs
Model Architecture:
Transformer decoder
Input Output 
Input Format:
Text prompt
Output Format:
Generated text/code
LLM NameStarcoder2 15B
Repository ๐Ÿค—https://huggingface.co/bigcode/starcoder2-15b 
Model Size15b
Required VRAM63.8 GB
Updated2025-04-19
Maintainerbigcode
Model Typestarcoder2
Model Files  4.6 GB: 1-of-14   4.6 GB: 2-of-14   4.6 GB: 3-of-14   4.6 GB: 4-of-14   4.6 GB: 5-of-14   4.6 GB: 6-of-14   4.6 GB: 7-of-14   4.6 GB: 8-of-14   4.6 GB: 9-of-14   4.6 GB: 10-of-14   4.6 GB: 11-of-14   4.6 GB: 12-of-14   4.6 GB: 13-of-14   4.0 GB: 14-of-14
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.37.0.dev0
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Torch Data Typefloat32

Quantized Models of the Starcoder2 15B

Model
Likes
Downloads
VRAM
Starcoder2 15B AWQ129 GB
StarCoder2 15B GGUF257746 GB

Best Alternatives to Starcoder2 15B

Best Alternatives
Context / RAM
Downloads
Likes
Starchat2 15B V0.116K / 31.9 GB6218110
Starcoder2 15B Instruct V0.116K / 31.9 GB762101
CodeFuse StarCoder2 15B16K / 31.9 GB122
Starcoder2 15B Finetuned Drake16K / 63.8 GB50
...aceH4 Starchat2 15B V0.1 4bits16K / 9.9 GB50
Starcoder2 15B Instruct V0.116K / 53.4 GB50
Dolphincoder Starcoder2 15B16K / 31.9 GB12769
Starchat2 15B Sft V0.116K / 31.9 GB25
Starcoder2 15B Instruct16K / 31.9 GB47
OpenCodeInterpreter SC2 15B16K / 31.9 GB34

Rank the Starcoder2 15B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 46490 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227