Starcoder GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Starcoder GPTQ   URL Share it on

  Arxiv:1911.02150   Arxiv:2205.14135   Arxiv:2207.14255   Arxiv:2305.06161   4-bit   Autotrain compatible   Code   Codegen Dataset:bigcode/the-stack-dedu...   Gpt bigcode   Gptq   Model-index   Quantized   Region:us   Safetensors
Model Card on HF ๐Ÿค—: https://huggingface.co/TheBloke/starcoder-GPTQ 

Starcoder GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Starcoder GPTQ (TheBloke/starcoder-GPTQ)

Starcoder GPTQ Parameters and Internals

Model Type 
text-generation
Use Cases 
Primary Use Cases:
Technical assistant with Tech Assistant prompt
Limitations:
The model is not an instruction model; commands like 'Write a function that computes the square root.' do not work well.
Additional Notes 
The model was trained on GitHub code and is primarily designed for code generation, not general text generation.
Supported Languages 
languages_and_proficiency_levels (>80 programming languages)
Training Details 
Data Sources:
The Stack (v1.2) with opt-out requests excluded
Data Volume:
1 trillion tokens
Methodology:
Fill-in-the-Middle objective, Multi Query Attention
Context Length:
8192
Training Time:
24 days
Hardware Used:
512 Tesla A100
Model Architecture:
GPT-2 model with multi-query attention and Fill-in-the-Middle objective
Input Output 
Input Format:
Token sequence input
Accepted Modalities:
text
Output Format:
Token sequence output
Performance Tips:
Use the Tech Assistant prompt for better results.
LLM NameStarcoder GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/starcoder-GPTQ 
Model Size2.6b
Required VRAM8.9 GB
Updated2024-12-14
MaintainerTheBloke
Model Typegpt_bigcode
Model Files  8.9 GB
GPTQ QuantizationYes
Quantization Typegptq
Generates CodeYes
Model ArchitectureGPTBigCodeForCausalLM
Licensebigcode-openrail-m
Transformers Version4.28.1
Tokenizer ClassGPT2Tokenizer
Vocabulary Size49152
Torch Data Typefloat32
Activation Functiongelu

Best Alternatives to Starcoder GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
Starchat Beta GPTQ0K / 8.9 GB1727
Starcoderplus GPTQ0K / 8.9 GB1625
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/starcoder-GPTQ.

Rank the Starcoder GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 39237 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124