Solar Pro Preview Instruct by upstage

 ยป  All LLMs  ยป  upstage  ยป  Solar Pro Preview Instruct   URL Share it on

  Autotrain compatible   Conversational   Custom code   En   Instruct   Region:us   Safetensors   Sharded   Solar   Tensorflow

Solar Pro Preview Instruct Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Solar Pro Preview Instruct (upstage/solar-pro-preview-instruct)

Solar Pro Preview Instruct Parameters and Internals

Model Type 
text-generation, instruction-following
Use Cases 
Areas:
research, commercial applications
Applications:
conversational tasks, instruction-following tasks
Primary Use Cases:
instruction-tuned tasks, chat interactions
Limitations:
Limited language coverage (English), maximum context length of 4K
Additional Notes 
Solar Pro Preview is a pre-release with an official version expected in November 2024, featuring expanded language support and longer context windows.
Supported Languages 
en (high)
Training Details 
Methodology:
Enhanced depth up-scaling method, training strategy on MMLU-Pro and IFEval benchmarks
Context Length:
4000
Hardware Used:
GPU with 80GB of VRAM
Input Output 
Accepted Modalities:
text
Output Format:
text
Performance Tips:
Use the ChatML template for optimal performance in conversational and instruction-following tasks.
LLM NameSolar Pro Preview Instruct
Repository ๐Ÿค—https://huggingface.co/upstage/solar-pro-preview-instruct 
Model Size22.1b
Required VRAM44.5 GB
Updated2025-02-05
Maintainerupstage
Model Typesolar
Instruction-BasedYes
Model Files  4.9 GB: 1-of-9   5.0 GB: 2-of-9   4.9 GB: 3-of-9   5.0 GB: 4-of-9   5.0 GB: 5-of-9   5.0 GB: 6-of-9   4.9 GB: 7-of-9   5.0 GB: 8-of-9   4.8 GB: 9-of-9
Supported Languagesen
Model ArchitectureSolarForCausalLM
Licensemit
Context Length4096
Model Max Length4096
Transformers Version4.44.2
Tokenizer ClassLlamaTokenizer
Padding Token<|im_end|>
Vocabulary Size32128
Torch Data Typebfloat16

Quantized Models of the Solar Pro Preview Instruct

Model
Likes
Downloads
VRAM
...olar Pro Preview Instruct GGUF2412003924 GB

Rank the Solar Pro Preview Instruct Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 42577 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227