Starchat2 15B Sft V0.1 by HuggingFaceH4

 ยป  All LLMs  ยป  HuggingFaceH4  ยป  Starchat2 15B Sft V0.1   URL Share it on

  Alignment-handbook   Autotrain compatible Base model:bigcode/starcoder2-... Base model:finetune:bigcode/st...   Conversational Dataset:huggingfaceh4/airoboro...   Dataset:huggingfaceh4/capybara Dataset:huggingfaceh4/code-fee... Dataset:huggingfaceh4/orca-mat... Dataset:huggingfaceh4/systemch...   Endpoints compatible   Generated from trainer   Region:us   Safetensors   Sharded   Starcoder2   Tensorboard   Tensorflow

Starchat2 15B Sft V0.1 Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Starchat2 15B Sft V0.1 (HuggingFaceH4/starchat2-15b-sft-v0.1)

Starchat2 15B Sft V0.1 Parameters and Internals

Training Details 
Data Sources:
HuggingFaceH4/airoboros-3.2, HuggingFaceH4/Code-Feedback, HuggingFaceH4/orca-math-word-problems-200k, HuggingFaceH4/SystemChat, HuggingFaceH4/capybara
Hardware Used:
16 multi-GPU devices
LLM NameStarchat2 15B Sft V0.1
Repository ๐Ÿค—https://huggingface.co/HuggingFaceH4/starchat2-15b-sft-v0.1 
Base Model(s)  Starcoder2 15B   bigcode/starcoder2-15b
Model Size15b
Required VRAM31.9 GB
Updated2025-02-22
MaintainerHuggingFaceH4
Model Typestarcoder2
Model Files  4.9 GB: 1-of-7   5.0 GB: 2-of-7   5.0 GB: 3-of-7   5.0 GB: 4-of-7   5.0 GB: 5-of-7   5.0 GB: 6-of-7   2.0 GB: 7-of-7   0.0 GB
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.39.0.dev0
Tokenizer ClassGPT2Tokenizer
Padding Token<|im_end|>
Vocabulary Size49154
Torch Data Typebfloat16

Quantized Models of the Starchat2 15B Sft V0.1

Model
Likes
Downloads
VRAM
Starchat2 15B V0.1 AWQ069 GB

Best Alternatives to Starchat2 15B Sft V0.1

Best Alternatives
Context / RAM
Downloads
Likes
Starcoder2 15B16K / 63.8 GB25873587
Starchat2 15B V0.116K / 31.9 GB15029112
Starcoder2 15B Instruct V0.116K / 31.9 GB1273101
CodeFuse StarCoder2 15B16K / 31.9 GB112
Starcoder2 15B Finetuned Drake16K / 63.8 GB50
...aceH4 Starchat2 15B V0.1 4bits16K / 9.9 GB70
Starcoder2 15B Instruct V0.116K / 53.4 GB110
Dolphincoder Starcoder2 15B16K / 31.9 GB14869
Starcoder2 15B Instruct16K / 31.9 GB227
Opencsg Starcoder2 15B V0.116K / 31.9 GB312
Note: green Score (e.g. "73.2") means that the model is better than HuggingFaceH4/starchat2-15b-sft-v0.1.

Rank the Starchat2 15B Sft V0.1 Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227