PARM V1.5 Base QwQ Qwen 2.5 O1 3B by Pinkstack

 ยป  All LLMs  ยป  Pinkstack  ยป  PARM V1.5 Base QwQ Qwen 2.5 O1 3B   URL Share it on

  Autotrain compatible Base model:finetune:qwen/qwen2... Base model:qwen/qwen2.5-3b-ins...   Conversational   En   Endpoints compatible   Instruct   O1   Pytorch   Qwen2   Region:us   Safetensors   Sft   Sharded   Trl   Unsloth   Zh

PARM V1.5 Base QwQ Qwen 2.5 O1 3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
PARM V1.5 Base QwQ Qwen 2.5 O1 3B (Pinkstack/PARM-V1.5-base-QwQ-Qwen-2.5-o1-3B)

PARM V1.5 Base QwQ Qwen 2.5 O1 3B Parameters and Internals

LLM NamePARM V1.5 Base QwQ Qwen 2.5 O1 3B
Repository ๐Ÿค—https://huggingface.co/Pinkstack/PARM-V1.5-base-QwQ-Qwen-2.5-o1-3B 
Base Model(s)  Qwen/Qwen2.5-3B-Instruct   Qwen/Qwen2.5-3B-Instruct
Model Size3b
Required VRAM6.2 GB
Updated2025-03-24
MaintainerPinkstack
Model Typeqwen2
Instruction-BasedYes
Model Files  5.0 GB: 1-of-2   1.2 GB: 2-of-2
Supported Languagesen zh
Model ArchitectureQwen2ForCausalLM
Licenseapache-2.0
Context Length32768
Model Max Length32768
Transformers Version4.46.3
Tokenizer ClassQwen2Tokenizer
Padding Token<|PAD_TOKEN|>
Vocabulary Size151936
Torch Data Typefloat16
Errorsreplace

Best Alternatives to PARM V1.5 Base QwQ Qwen 2.5 O1 3B

Best Alternatives
Context / RAM
Downloads
Likes
Saba2 3B128K / 6.2 GB100
SmallThinker 3B Preview32K / 6.8 GB114270391
Qwen2.5 3B Instruct32K / 6.2 GB688389221
Chirp 0132K / 6.2 GB100013
Qwen2.5 3B Model Stock V3.132K / 6.8 GB1043
Qwen2.5 3B Model Stock V3.232K / 6.8 GB632
Calme 3.2 Baguette 3B32K / 11 GB30421
Qwen2.5 3B Model Stock V4.132K / 6.8 GB382
Menda 3B 75032K / 6.2 GB201
Calme 3.1 Baguette 3B32K / 6.2 GB30541
Note: green Score (e.g. "73.2") means that the model is better than Pinkstack/PARM-V1.5-base-QwQ-Qwen-2.5-o1-3B.

Rank the PARM V1.5 Base QwQ Qwen 2.5 O1 3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45546 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227