Baichuan 7B Sft by hiyouga

 ยป  All LLMs  ยป  hiyouga  ยป  Baichuan 7B Sft   URL Share it on

  Autotrain compatible   Baichuan   Custom code Dataset:sahil2801/codealpaca-2...   Dataset:tatsu-lab/alpaca   En   Endpoints compatible   Lora   Pytorch   Region:us   Sharded   Zh
Model Card on HF ๐Ÿค—: https://huggingface.co/hiyouga/Baichuan-7B-sft 

Baichuan 7B Sft Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Baichuan 7B Sft (hiyouga/Baichuan-7B-sft)

Baichuan 7B Sft Parameters and Internals

Model Type 
text-generation
Additional Notes 
Bilingual instruction-tuned LoRA model with training scripts provided for reproduction using LLaMA-Factory.
Supported Languages 
zh (unknown), en (unknown)
Training Details 
Data Sources:
tatsu-lab/alpaca, sahil2801/CodeAlpaca-20k
Methodology:
LoRA (Low-Rank Adaptation)
Input Output 
Accepted Modalities:
text
LLM NameBaichuan 7B Sft
Repository ๐Ÿค—https://huggingface.co/hiyouga/Baichuan-7B-sft 
Model Size7b
Required VRAM14 GB
Updated2024-12-26
Maintainerhiyouga
Model Typebaichuan
Model Files  10.0 GB: 1-of-2   4.0 GB: 2-of-2
Supported Languageszh en
Model ArchitectureBaiChuanForCausalLM
Licenseapache-2.0
Context Length4096
Model Max Length4096
Transformers Version4.30.1
Tokenizer ClassBaiChuanTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size64000
LoRA ModelYes
Torch Data Typefloat16

Best Alternatives to Baichuan 7B Sft

Best Alternatives
Context / RAM
Downloads
Likes
Baichuan 7B4K / 14 GB15952834
WisdomInterrogatory4K / 13.9 GB144
MedChatZH4K / 14 GB796
HuatuoGPT 7B4K / 28 GB46622
Qiaoban Bc4K / 28 GB137
Baichuan 7B Instruction4K / 14 GB172
Baichuan 7B Sft 0014K / 14 GB143
Baichuan 7B4K / 13.8 GB160
Baichuan 7B Chat4K / 14 GB3425
Baichuan 7B Sharded4K / 13.9 GB181
Note: green Score (e.g. "73.2") means that the model is better than hiyouga/Baichuan-7B-sft.

Rank the Baichuan 7B Sft Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40303 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227