CodeFuse DeepSeek 33B by codefuse-ai

 »  All LLMs  »  codefuse-ai  »  CodeFuse DeepSeek 33B   URL Share it on

  Autotrain compatible   Conversational   Endpoints compatible   License:other   Llama   Pytorch   Region:us   Sharded

CodeFuse DeepSeek 33B Benchmarks

Rank the CodeFuse DeepSeek 33B Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  
CodeFuse DeepSeek 33B (codefuse-ai/CodeFuse-DeepSeek-33B)

Quantized Models of the CodeFuse DeepSeek 33B

CodeFuse DeepSeek 33B 4bits107518 GB

Best Alternatives to CodeFuse DeepSeek 33B

Best Alternatives
HF Rank
Deepseek Wizard 33B Slerp16K / 35.3 GB17210
ValidateAI 33B Slerp16K / 35.4 GB17050
Deepseek Coder 33B Instruct16K / 66.5 GB23849414
OpenCodeInterpreter DS 33B16K / 66.5 GB756110
Deepseek Coder 33B Base16K / 66.5 GB1029263
Everyone Coder 33B Base16K / 66.5 GB172817
Llm4decompile 33B16K / 66.5 GB27
Everyone Coder 33B V2 Base16K / 66.5 GB15
ValidateAI 3 33B Ties16K / 66.5 GB17210
ValidateAI 2 33B AT16K / 66.5 GB17210

CodeFuse DeepSeek 33B Parameters and Internals

LLM NameCodeFuse DeepSeek 33B
RepositoryOpen on 🤗 
Model Size33b
Required VRAM66.5 GB
Model Typellama
Model Files  9.7 GB: 1-of-7   9.9 GB: 2-of-7   9.9 GB: 3-of-7   9.8 GB: 4-of-7   9.9 GB: 5-of-7   9.9 GB: 6-of-7   7.4 GB: 7-of-7
Model ArchitectureLlamaForCausalLM
Context Length16384
Model Max Length16384
Transformers Version4.34.1
Tokenizer ClassLlamaTokenizerFast
Beginning of Sentence Token<|begin▁of▁sentence|>
End of Sentence Token<|end▁of▁sentence|>
Vocabulary Size32256
Initializer Range0.02
Torch Data Typebfloat16

What open-source LLMs or SLMs are you in search of? 35130 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v2024042801