Deepseek Coder 1.3B Base GPTQ by TheBloke

 »  All LLMs  »  TheBloke  »  Deepseek Coder 1.3B Base GPTQ   URL Share it on

  4-bit   Autotrain compatible Base model:deepseek-ai/deepsee... Base model:quantized:deepseek-...   Codegen   Gptq   Llama   Quantized   Region:us   Safetensors

Deepseek Coder 1.3B Base GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Deepseek Coder 1.3B Base GPTQ (TheBloke/deepseek-coder-1.3b-base-GPTQ)

Deepseek Coder 1.3B Base GPTQ Parameters and Internals

Model Type 
deepseek
Use Cases 
Areas:
research, commercial applications
Applications:
code completion, project-level code completion, code infilling
Primary Use Cases:
Advanced Code Completion, Repository Level Code Completion
Additional Notes 
Quantization performed by TheBloke with multiple GPTQ parameter permutations provided
Supported Languages 
English (High proficiency), Chinese (High proficiency)
Training Details 
Data Sources:
project-level code corpus
Data Volume:
2 trillion tokens
Context Length:
16000
Model Architecture:
Multi-Head Attention
LLM NameDeepseek Coder 1.3B Base GPTQ
Repository 🤗https://huggingface.co/TheBloke/deepseek-coder-1.3b-base-GPTQ 
Model NameDeepseek Coder 1.3B Base
Model CreatorDeepSeek
Base Model(s)  deepseek-ai/deepseek-coder-1.3b-base   deepseek-ai/deepseek-coder-1.3b-base
Model Size1.3b
Required VRAM0.9 GB
Updated2025-02-05
MaintainerTheBloke
Model Typedeepseek
Model Files  0.9 GB
GPTQ QuantizationYes
Quantization Typegptq
Generates CodeYes
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length16384
Model Max Length16384
Transformers Version4.35.0
Tokenizer ClassLlamaTokenizerFast
Beginning of Sentence Token<|begin▁of▁sentence|>
End of Sentence Token<|end▁of▁sentence|>
Vocabulary Size32256
Torch Data Typebfloat16

Best Alternatives to Deepseek Coder 1.3B Base GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
...pseek Coder 1.3B Instruct GPTQ16K / 0.9 GB1566
Deepseek Coder 1.3B Instruct16K / 2.7 GB34556107
...c Deepseek Coder 1.3B Instruct16K / 5.4 GB1320
CursorCore DS 1.3B LC16K / 2.7 GB1200
CursorCore DS 1.3B SR16K / 2.7 GB1200
CursorCore DS 1.3B16K / 2.7 GB1180
Llm4decompile 1.3B V216K / 2.7 GB4386
Speechless Coder Ds 1.3B16K / 2.7 GB12430
Deepseek Coder 1.3B Base16K / 2.7 GB5242577
Hpc Coder V2.1.3B16K / 2.7 GB984
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/deepseek-coder-1.3b-base-GPTQ.

Rank the Deepseek Coder 1.3B Base GPTQ Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227