Yarn Solar 10B 64K by NousResearch

 ยป  All LLMs  ยป  NousResearch  ยป  Yarn Solar 10B 64K   URL Share it on

  Arxiv:2309.00071   Autotrain compatible   Custom code Dataset:emozilla/yarn-train-to...   En   Endpoints compatible   Llama   Pytorch   Region:us   Sharded   Yarn

Yarn Solar 10B 64K Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yarn Solar 10B 64K (NousResearch/Yarn-Solar-10b-64k)

Yarn Solar 10B 64K Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
research, commercial applications
Additional Notes 
Authors thank LAION AI for compute support.
Supported Languages 
en (proficient)
Training Details 
Data Sources:
emoZilla/yarn-train-tokenized-32k-mistral
Data Volume:
two billion long context tokens
Methodology:
extension method using YaRN
Context Length:
64000
Hardware Used:
JUWELS supercomputer
Input Output 
Accepted Modalities:
text
Output Format:
text
Performance Tips:
pass 'trust_remote_code=True' when loading the model
LLM NameYarn Solar 10B 64K
Repository ๐Ÿค—https://huggingface.co/NousResearch/Yarn-Solar-10b-64k 
Model Size10b
Required VRAM21.4 GB
Updated2025-02-05
MaintainerNousResearch
Model Typellama
Model Files  4.9 GB: 1-of-5   5.0 GB: 2-of-5   4.9 GB: 3-of-5   4.9 GB: 4-of-5   1.7 GB: 5-of-5
Supported Languagesen
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length65536
Model Max Length65536
Transformers Version4.35.2
Tokenizer ClassLlamaTokenizer
Vocabulary Size32000
Torch Data Typebfloat16

Quantized Models of the Yarn Solar 10B 64K

Model
Likes
Downloads
VRAM
Yarn Solar 10B 64K GGUF1734 GB

Best Alternatives to Yarn Solar 10B 64K

Best Alternatives
Context / RAM
Downloads
Likes
...enbuddy Falcon3 10B V24.2 131K128K / 20.7 GB80
Priya 10B128K / 20.5 GB371
HelpingAI2.5 10B128K / 20.5 GB662
HelpingAI2.5 10B128K / 20.5 GB6923
L3.1 Mochav2 10B128K / 42.8 GB210
HELVETE X128K / 20.5 GB784
StoryTeller 10B 2e V258K / 21.4 GB51
Falcon3 10B Instruct32K / 20.5 GB3794787
Virtuoso Lite32K / 20.5 GB51226
Falcon3 10B Base32K / 20.5 GB1080832
Note: green Score (e.g. "73.2") means that the model is better than NousResearch/Yarn-Solar-10b-64k.

Rank the Yarn Solar 10B 64K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227