LLM Name | Deepseek Qwen2.5 7B Redistil |
Repository 🤗 | https://huggingface.co/jan-hq/Deepseek-Qwen2.5-7B-Redistil |
Model Size | 7b |
Required VRAM | 15.2 GB |
Updated | 2025-04-30 |
Maintainer | jan-hq |
Model Type | qwen2 |
Model Files | |
GGUF Quantization | Yes |
Quantization Type | gguf |
Model Architecture | Qwen2ForCausalLM |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.48.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <|end▁of▁sentence|> |
Vocabulary Size | 152064 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Pathumma Llm Text 1.0.0 | 128K / 30.5 GB | 309 | 9 |
SvelteCodeQwen1.5 7B Chat | 64K / 14.5 GB | 460 | 0 |
CodeQwen1.5 7B Chat GGUF | 64K / 3 GB | 119 | 2 |
Qwen2 Cantonese 7B Instruct | 32K / 15.4 GB | 78 | 3 |
Openthaigpt1.5 7B Instruct | 32K / 15.2 GB | 1227 | 15 |
Qwen 2.5 7B Threatflux | 32K / 15.5 GB | 11 | 6 |
...der 7B Instruct Abliterated V1 | 32K / 15.2 GB | 201 | 1 |
Qwen2 7B Instruct GGUF | 32K / 3 GB | 147 | 1 |
Qwen2 7B Instruct GGUF | 32K / 3 GB | 41 | 0 |
Qwen1.5 7B Chat GGUF | 32K / 3.1 GB | 145 | 1 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟