QwQ 32B Preview by Qwen

 ยป  All LLMs  ยป  Qwen  ยป  QwQ 32B Preview   URL Share it on

  Arxiv:2407.10671   Autotrain compatible Base model:finetune:qwen/qwen2... Base model:qwen/qwen2.5-32b-in...   Chat   Conversational   En   Endpoints compatible   Instruct   Qwen2   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/Qwen/QwQ-32B-Preview 

QwQ 32B Preview Benchmarks

QwQ 32B Preview (Qwen/QwQ-32B-Preview)

QwQ 32B Preview Parameters and Internals

Model Type 
Causal Language Models
Use Cases 
Limitations:
Language Mixing and Code-Switching, Recursive Reasoning Loops, Safety and Ethical Considerations, Performance and Benchmark Limitations
Additional Notes 
As a preview release, it demonstrates promising analytical abilities while having several important limitations.
Supported Languages 
en (Native)
Training Details 
Context Length:
32768
Model Architecture:
transformers with RoPE, SwiGLU, RMSNorm, and Attention QKV bias
Responsible Ai Considerations 
Mitigation Strategies:
Requires enhanced safety measures to ensure reliable and secure performance.
Input Output 
Input Format:
text input with chat template
Accepted Modalities:
text
Output Format:
text
LLM NameQwQ 32B Preview
Repository ๐Ÿค—https://huggingface.co/Qwen/QwQ-32B-Preview 
Base Model(s)  Qwen/Qwen2.5-32B-Instruct   Qwen/Qwen2.5-32B-Instruct
Model Size32b
Required VRAM65.5 GB
Updated2025-02-05
MaintainerQwen
Model Typeqwen2
Instruction-BasedYes
Model Files  3.9 GB: 1-of-17   3.9 GB: 2-of-17   3.9 GB: 3-of-17   3.9 GB: 4-of-17   3.9 GB: 5-of-17   3.9 GB: 6-of-17   3.9 GB: 7-of-17   3.9 GB: 8-of-17   3.9 GB: 9-of-17   3.9 GB: 10-of-17   3.9 GB: 11-of-17   3.9 GB: 12-of-17   3.9 GB: 13-of-17   3.9 GB: 14-of-17   3.9 GB: 15-of-17   3.9 GB: 16-of-17   3.1 GB: 17-of-17
Supported Languagesen
Model ArchitectureQwen2ForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.43.1
Tokenizer ClassQwen2Tokenizer
Padding Token<|endoftext|>
Vocabulary Size152064
Torch Data Typebfloat16
Errorsreplace

Quantized Models of the QwQ 32B Preview

Model
Likes
Downloads
VRAM
PathfinderAI05265 GB
...eview Gptqmodel 4bit Vortex V216123215 GB
QwQ 32B Preview AWQ2340757419 GB
QwQ 32B Preview 6bit410026 GB
QwQ 32B Preview Bnb 4bit3214519 GB
...Q 32B Preview Unsloth Bnb 4bit1892925 GB
...eview Gptqmodel 4bit Vortex V1513416 GB
QwQ 32B Preview GPTQ 4bit330616 GB
QwQ 32B Preview 8bit510734 GB
QwQ 32B Preview 4bit39518 GB

Best Alternatives to QwQ 32B Preview

Best Alternatives
Context / RAM
Downloads
Likes
...y Qwen2.5coder 32B V24.1q 200K195K / 65.8 GB122
...wen2.5 32B Inst BaseMerge TIES128K / 65.8 GB3619
...wen2.5 32B Inst BaseMerge TIES128K / 65.8 GB181
Franqwenstein 35B128K / 69.8 GB2578
EVA Qwen2.5 32B V0.2128K / 65.8 GB345048
...1 Qwen2.5 Instruct 32B Preview128K / 65.8 GB1747
QwQenSeek Coder128K / 65.7 GB584
Qwenstein2.5 32B Instruct128K / 65.5 GB942
EVA Qwen2.5 32B V0.0128K / 65.8 GB104826
EVA Qwen2.5 32B V0.1128K / 65.8 GB99514

Rank the QwQ 32B Preview Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227