DeepSeek Coder V2 Lite Instruct AWQ By TechxGenus: Benchmarks, Features and Detailed Analysis. Insights on DeepSeek Coder V2 Lite Instruct AWQ.

Arxiv:2401.06066 4-bit Autotrain compatible Awq Codegen Conversational Custom code Deepseek v2 Endpoints compatible Instruct Quantized Region:us Safetensors Sharded Tensorflow

Model Card on HF 🤗: https://huggingface.co/TechxGenus/DeepSeek-Coder-V2-Lite-Instruct-AWQ

DeepSeek Coder V2 Lite Instruct AWQ Benchmarks

LLME Score: 0.19445

^nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").

What is the LLM Explorer Rank (Score)

DeepSeek Coder V2 Lite Instruct AWQ (TechxGenus/DeepSeek-Coder-V2-Lite-Instruct-AWQ)

DeepSeek Coder V2 Lite Instruct AWQ Parameters and Internals

Model Type

Mixture-of-Experts (MoE) code language model

Use Cases

Areas:

code intelligence, mathematical reasoning

Primary Use Cases:

coding tasks, general language tasks

Additional Notes

AWQ quantized version available for DeepSeek-Coder-V2-Lite-Instruct model.

Supported Languages

number_of_languages (,), languages_typical_comments (Expands its support for programming languages from 86 to 338.)

Training Details

Data Sources:

high-quality, multi-source corpus

Data Volume:

6 trillion tokens

Methodology:

Mixture of Experts (MoE) approach

Context Length:

128000

Model Architecture:

Mixture-of-Experts (MoE)

Input Output

Input Format:

Chat completion and code completion

Accepted Modalities:

text

Output Format:

Generated code

LLM Name	DeepSeek Coder V2 Lite Instruct AWQ
Repository 🤗	https://huggingface.co/TechxGenus/DeepSeek-Coder-V2-Lite-Instruct-AWQ
Model Size	2.6b
Required VRAM	9.1 GB
Updated	2025-06-01
Maintainer	TechxGenus
Model Type	deepseek_v2
Instruction-Based	Yes
Model Files	5.0 GB: 1-of-2 4.1 GB: 2-of-2
AWQ Quantization	Yes
Quantization Type	awq
Generates Code	Yes
Model Architecture	DeepseekV2ForCausalLM
License	other
Context Length	163840
Model Max Length	163840
Transformers Version	4.41.2
Tokenizer Class	LlamaTokenizer
Padding Token	<｜end▁of▁sentence｜>
Vocabulary Size	102400
Torch Data Type	float16

Rank the DeepSeek Coder V2 Lite Instruct AWQ Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation
Factuality and Completeness of Knowledge
Censorship and Alignment
Data Analysis and Insight Generation
Text Generation
Text Summarization and Feature Extraction
Code Generation
Multi-Language Support and Translation

What open-source LLMs or SLMs are you in search of? 47770 in total.

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241227

Support LLM Explorer

DeepSeek Coder V2 Lite Instruct AWQ by TechxGenus

» All LLMs » TechxGenus » DeepSeek Coder V2 Lite Instruct AWQ URL Share it on

DeepSeek Coder V2 Lite Instruct AWQ Benchmarks

DeepSeek Coder V2 Lite Instruct AWQ Parameters and Internals

Rank the DeepSeek Coder V2 Lite Instruct AWQ Capabilities

What open-source LLMs or SLMs are you in search of? 47770 in total.