CodeFuse DeepSeek 33B by codefuse-ai

 »  All LLMs  »  codefuse-ai  »  CodeFuse DeepSeek 33B   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   Llama   Pytorch   Region:us   Sharded

CodeFuse DeepSeek 33B Benchmarks

CodeFuse DeepSeek 33B (codefuse-ai/CodeFuse-DeepSeek-33B)

CodeFuse DeepSeek 33B Parameters and Internals

Model Type 
code-generation
Training Details 
Methodology:
QLoRA
Input Output 
Input Format:
Concatenated string in training data format
Accepted Modalities:
text
Output Format:
Generated code
Performance Tips:
Ensure input string ends with '\ bot' for generating answers.
Release Notes 
Date:
2024-01-12
Notes:
Released with pass@1 score of 78.65% on HumanEval.
LLM NameCodeFuse DeepSeek 33B
Repository 🤗https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B 
Model Size33b
Required VRAM66.5 GB
Updated2025-02-05
Maintainercodefuse-ai
Model Typellama
Model Files  9.7 GB: 1-of-7   9.9 GB: 2-of-7   9.9 GB: 3-of-7   9.8 GB: 4-of-7   9.9 GB: 5-of-7   9.9 GB: 6-of-7   7.4 GB: 7-of-7
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length16384
Model Max Length16384
Transformers Version4.34.1
Tokenizer ClassLlamaTokenizerFast
Beginning of Sentence Token<|begin▁of▁sentence|>
End of Sentence Token<|end▁of▁sentence|>
Vocabulary Size32256
Torch Data Typebfloat16

Quantized Models of the CodeFuse DeepSeek 33B

Model
Likes
Downloads
VRAM
CodeFuse DeepSeek 33B 4bits102518 GB

Best Alternatives to CodeFuse DeepSeek 33B

Best Alternatives
Context / RAM
Downloads
Likes
...angled Llama 33M 32K Base V0.132K / 0.1 GB201
ReflectionCoder DS 33B16K / 67 GB41624
Chronos Divergence 33B16K / 65 GB2629
Deepseek Wizard 33B Slerp16K / 35.3 GB90
ValidateAI 33B Slerp16K / 35.4 GB110
Deepseek Coder 33B Instruct16K / 66.5 GB14194483
WhiteRabbitNeo 33B V116K / 67 GB128684
ValidateAI 3 33B Ties16K / 66.5 GB70
ValidateAI 2 33B AT16K / 66.5 GB100
...dy Deepseekcoder 33B V16.1 32K16K / 67.1 GB13580
Note: green Score (e.g. "73.2") means that the model is better than codefuse-ai/CodeFuse-DeepSeek-33B.

Rank the CodeFuse DeepSeek 33B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42625 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227