Starcoder2 15B by bigcode

 ยป  All LLMs  ยป  bigcode  ยป  Starcoder2 15B   URL Share it on

  Arxiv:2004.05150   Arxiv:2205.14135   Arxiv:2207.14255   Arxiv:2305.13245   Arxiv:2402.19173   Autotrain compatible   Code Dataset:bigcode/the-stack-v2-t...   Endpoints compatible   Model-index   Region:us   Safetensors   Sharded   Starcoder2   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/bigcode/starcoder2-15b 

Starcoder2 15B Benchmarks

Starcoder2 15B (bigcode/starcoder2-15b)

Starcoder2 15B Parameters and Internals

Model Type 
text-generation
Use Cases 
Areas:
Research, Educational
Applications:
Code generation, Assisting developers
Primary Use Cases:
Text generation based on programming language inputs
Limitations:
May not work with instruction-based commands, Generated code might contain bugs, Not guaranteed to work as intended
Additional Notes 
Requires proper attribution when utilizing generated code from the model due to licenses.
Supported Languages 
600+ programming languages ()
Training Details 
Data Sources:
The Stack v2, Arxiv, Wikipedia
Data Volume:
4+ trillion tokens
Methodology:
Grouped Query Attention, Sliding Window Attention, Fill-in-the-Middle objective
Context Length:
16384
Hardware Used:
NVIDIA DGX H100, 1024 x H100 GPUs
Model Architecture:
Transformer decoder
Input Output 
Input Format:
Text prompt
Output Format:
Generated text/code
LLM NameStarcoder2 15B
Repository ๐Ÿค—https://huggingface.co/bigcode/starcoder2-15b 
Model Size15b
Required VRAM63.8 GB
Updated2025-02-22
Maintainerbigcode
Model Typestarcoder2
Model Files  4.6 GB: 1-of-14   4.6 GB: 2-of-14   4.6 GB: 3-of-14   4.6 GB: 4-of-14   4.6 GB: 5-of-14   4.6 GB: 6-of-14   4.6 GB: 7-of-14   4.6 GB: 8-of-14   4.6 GB: 9-of-14   4.6 GB: 10-of-14   4.6 GB: 11-of-14   4.6 GB: 12-of-14   4.6 GB: 13-of-14   4.0 GB: 14-of-14
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.37.0.dev0
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Torch Data Typefloat32

Quantized Models of the Starcoder2 15B

Model
Likes
Downloads
VRAM
Starcoder2 15B AWQ1419 GB
StarCoder2 15B GGUF249556 GB

Best Alternatives to Starcoder2 15B

Best Alternatives
Context / RAM
Downloads
Likes
Starchat2 15B V0.116K / 31.9 GB15029112
Starcoder2 15B Instruct V0.116K / 31.9 GB1273101
CodeFuse StarCoder2 15B16K / 31.9 GB112
Starcoder2 15B Finetuned Drake16K / 63.8 GB50
...aceH4 Starchat2 15B V0.1 4bits16K / 9.9 GB70
Starcoder2 15B Instruct V0.116K / 53.4 GB110
Dolphincoder Starcoder2 15B16K / 31.9 GB14869
Starchat2 15B Sft V0.116K / 31.9 GB185
Starcoder2 15B Instruct16K / 31.9 GB227
Opencsg Starcoder2 15B V0.116K / 31.9 GB312

Rank the Starcoder2 15B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227