Deepseek Qwen2.5 7B Redistil by jan-hq

 »  All LLMs  »  jan-hq  »  Deepseek Qwen2.5 7B Redistil   URL Share it on

  Arxiv:1910.09700   Autotrain compatible   Conversational   Endpoints compatible   Gguf   Quantized   Qwen2   Region:us   Safetensors   Sharded   Tensorflow

Deepseek Qwen2.5 7B Redistil Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Deepseek Qwen2.5 7B Redistil (jan-hq/Deepseek-Qwen2.5-7B-Redistil)

Deepseek Qwen2.5 7B Redistil Parameters and Internals

LLM NameDeepseek Qwen2.5 7B Redistil
Repository 🤗https://huggingface.co/jan-hq/Deepseek-Qwen2.5-7B-Redistil 
Model Size7b
Required VRAM15.2 GB
Updated2025-02-22
Maintainerjan-hq
Model Typeqwen2
Model Files  8.1 GB   8.1 GB   4.9 GB: 1-of-4   4.9 GB: 2-of-4   4.3 GB: 3-of-4   1.1 GB: 4-of-4
GGUF QuantizationYes
Quantization Typegguf
Model ArchitectureQwen2ForCausalLM
Context Length131072
Model Max Length131072
Transformers Version4.48.2
Tokenizer ClassLlamaTokenizer
Padding Token<|end▁of▁sentence|>
Vocabulary Size152064
Torch Data Typebfloat16

Best Alternatives to Deepseek Qwen2.5 7B Redistil

Best Alternatives
Context / RAM
Downloads
Likes
Pathumma Llm Text 1.0.0128K / 30.5 GB4518
SvelteCodeQwen1.5 7B Chat64K / 14.5 GB4600
CodeQwen1.5 7B Chat GGUF64K / 3 GB1322
Qwen2 Cantonese 7B Instruct32K / 15.4 GB1303
Openthaigpt1.5 7B Instruct32K / 15.2 GB205315
Qwen 2.5 7B Threatflux32K / 15.5 GB725
...der 7B Instruct Abliterated V132K / 15.2 GB531
Qwen2 7B Instruct GGUF32K / 3 GB1051
Qwen2 7B Instruct GGUF32K / 3 GB990
Qwen1.5 7B Chat GGUF32K / 3.1 GB1231

Rank the Deepseek Qwen2.5 7B Redistil Capabilities

🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227