LLM Name | Deepseek Qwen2.5 7B Redistil |
Repository 🤗 | https://huggingface.co/jan-hq/Deepseek-Qwen2.5-7B-Redistil |
Model Size | 7b |
Required VRAM | 15.2 GB |
Updated | 2025-02-22 |
Maintainer | jan-hq |
Model Type | qwen2 |
Model Files | |
GGUF Quantization | Yes |
Quantization Type | gguf |
Model Architecture | Qwen2ForCausalLM |
Context Length | 131072 |
Model Max Length | 131072 |
Transformers Version | 4.48.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <|end▁of▁sentence|> |
Vocabulary Size | 152064 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Pathumma Llm Text 1.0.0 | 128K / 30.5 GB | 451 | 8 |
SvelteCodeQwen1.5 7B Chat | 64K / 14.5 GB | 460 | 0 |
CodeQwen1.5 7B Chat GGUF | 64K / 3 GB | 132 | 2 |
Qwen2 Cantonese 7B Instruct | 32K / 15.4 GB | 130 | 3 |
Openthaigpt1.5 7B Instruct | 32K / 15.2 GB | 2053 | 15 |
Qwen 2.5 7B Threatflux | 32K / 15.5 GB | 72 | 5 |
...der 7B Instruct Abliterated V1 | 32K / 15.2 GB | 53 | 1 |
Qwen2 7B Instruct GGUF | 32K / 3 GB | 105 | 1 |
Qwen2 7B Instruct GGUF | 32K / 3 GB | 99 | 0 |
Qwen1.5 7B Chat GGUF | 32K / 3.1 GB | 123 | 1 |
🆘 Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! 🌟