Starchat2 15B V0.1 AWQ by stelterlab

 ยป  All LLMs  ยป  stelterlab  ยป  Starchat2 15B V0.1 AWQ   URL Share it on

  Arxiv:2311.07911   Arxiv:2402.19173   4-bit   Autotrain compatible   Awq Base model:huggingfaceh4/starc... Base model:quantized:huggingfa...   Conversational Dataset:huggingfaceh4/orca dpo... Dataset:huggingfaceh4/ultrafee...   Endpoints compatible   Quantized   Region:us   Safetensors   Starcoder2

Starchat2 15B V0.1 AWQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Starchat2 15B V0.1 AWQ (stelterlab/starchat2-15b-v0.1-AWQ)

Starchat2 15B V0.1 AWQ Parameters and Internals

Model Type 
GPT-like, fine-tuned
Use Cases 
Areas:
chat, programming
Applications:
coding assistant
Primary Use Cases:
code completion, programming help
Limitations:
syntactically valid but semantically incorrect code possible, demographic bias aligned with GitHub community, false URLs
Considerations:
Review output for accuracy and potential biases.
Additional Notes 
Model not aligned with RLHF, may produce unaligned outputs.
Supported Languages 
English (primary), programming_languages (600+)
Training Details 
Data Sources:
HuggingFaceH4/ultrafeedback_binarized, HuggingFaceH4/orca_dpo_pairs
Methodology:
Fine-tuned from StarCoder2, using SFT and DPO techniques.
Hardware Used:
multi-GPU, 8 devices
Model Architecture:
GPT-like
Input Output 
Input Format:
Messages in chat format
Accepted Modalities:
text
Output Format:
Generated text responses
Performance Tips:
Utilize pipelines for optimal performance.
LLM NameStarchat2 15B V0.1 AWQ
Repository ๐Ÿค—https://huggingface.co/stelterlab/starchat2-15b-v0.1-AWQ 
Base Model(s)  HuggingFaceH4/starchat2-15b-sft-v0.1   HuggingFaceH4/starchat2-15b-sft-v0.1
Model Size15b
Required VRAM9.2 GB
Updated2025-02-22
Maintainerstelterlab
Model Typestarcoder2
Model Files  9.2 GB
AWQ QuantizationYes
Quantization Typeawq
Model ArchitectureStarcoder2ForCausalLM
Licensebigcode-openrail-m
Context Length16384
Model Max Length16384
Transformers Version4.40.1
Tokenizer ClassGPT2Tokenizer
Padding Token<|im_end|>
Vocabulary Size49154
Torch Data Typefloat16

Best Alternatives to Starchat2 15B V0.1 AWQ

Best Alternatives
Context / RAM
Downloads
Likes
Starcoder2 15B AWQ16K / 9.2 GB411
Speechless Starcoder2 15B16K / 31.9 GB93
Starcoder2 15B16K / 63.8 GB25873587
Starchat2 15B V0.116K / 31.9 GB15029112
Starcoder2 15B Instruct V0.116K / 31.9 GB1273101
CodeFuse StarCoder2 15B16K / 31.9 GB112
...rCoder2 15B Instruct V0.1 GGUF16K / 6.2 GB1090
Starcoder2 15B Finetuned Drake16K / 63.8 GB50
StarCoder2 15B GGUF16K / 6.2 GB95524
...rCoder2 15B Instruct V0.1 GGUF16K / 6.2 GB710
Note: green Score (e.g. "73.2") means that the model is better than stelterlab/starchat2-15b-v0.1-AWQ.

Rank the Starchat2 15B V0.1 AWQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43508 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227