Kyro N1 3B by open-neo

 ยป  All LLMs  ยป  open-neo  ยป  Kyro N1 3B   URL Share it on

  Arxiv:2407.10671   4bit   Agentic   Ar   Autotrain compatible Base model:finetune:qwen/qwen2... Base model:qwen/qwen2.5-3b-ins...   Bn   Ceb   Conversational   Cs   De   Deepseek-r1   En   Endpoints compatible   Es   Fa   Finetuned   Fr   He   Hi   Id   Instruct   It   Ja   Km   Ko   Lo   Ms   My   Nl   Open-llm   Pl   Pt   Quantized   Qwen2   Qwen2.5   Reasoning   Region:us   Ru   Safetensors   Sharded   Synthetic-data   Tensorflow   Th   Tl   Tr   Trl   Unsloth   Ur   Vi   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/open-neo/Kyro-n1-3B 

Kyro N1 3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Kyro N1 3B (open-neo/Kyro-n1-3B)

Kyro N1 3B Parameters and Internals

LLM NameKyro N1 3B
Repository ๐Ÿค—https://huggingface.co/open-neo/Kyro-n1-3B 
Base Model(s)  Qwen/Qwen2.5-3B-Instruct   Qwen/Qwen2.5-3B-Instruct
Model Size3b
Required VRAM6.2 GB
Updated2025-02-22
Maintaineropen-neo
Model Typeqwen2
Instruction-BasedYes
Model Files  5.0 GB: 1-of-2   1.2 GB: 2-of-2
Supported Languagesen zh fr es pt de it ru ja ko vi th ar fa he tr cs pl hi bn ur id ms lo my km tl nl
Quantization Type4bit
Model ArchitectureQwen2ForCausalLM
Licensemit
Context Length32768
Model Max Length32768
Transformers Version4.48.2
Tokenizer ClassQwen2Tokenizer
Padding Token<|vision_pad|>
Vocabulary Size151936
Torch Data Typefloat16
Errorsreplace

Best Alternatives to Kyro N1 3B

Best Alternatives
Context / RAM
Downloads
Likes
...5 3B Instruct Unsloth Bnb 4bit32K / 2.4 GB917923
QwQ LCoT 3B Instruct32K / 6.2 GB14210
Qwen2.5 3B Reasoner32K / 6.2 GB210
Athena 1 3B32K / 6.2 GB380
Test StealthThinker32K / 6.2 GB80
Qwen2.5 3B Instruct Bnb 4bit32K / 2 GB195967
...2.5 Coder 3B Instruct Bnb 4bit32K / 2 GB31292
...5 3B Instruct Lora Sex Float1632K / 6.2 GB220
Qwen3b Lora Finetune32K / 6.2 GB91
Saba2 3B128K / 6.2 GB560
Note: green Score (e.g. "73.2") means that the model is better than open-neo/Kyro-n1-3B.

Rank the Kyro N1 3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227