Baichuan 13B Chat by baichuan-inc

 ยป  All LLMs  ยป  baichuan-inc  ยป  Baichuan 13B Chat   URL Share it on

  Arxiv:2009.03300   Arxiv:2104.09864   Arxiv:2108.12409   Autotrain compatible   Baichuan   Custom code   En   Endpoints compatible   Pytorch   Region:us   Sharded   Zh

Baichuan 13B Chat Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Baichuan 13B Chat (baichuan-inc/Baichuan-13B-Chat)

Baichuan 13B Chat Parameters and Internals

Model Type 
text generation
Use Cases 
Areas:
research, commercial applications
Considerations:
Commercial use must be authorized.
Supported Languages 
Chinese (high), English (high)
Training Details 
Data Sources:
high-quality corpora
Data Volume:
1.4 trillion tokens
Methodology:
Uses ALiBi position encoding
Context Length:
4096
Model Architecture:
Based on Baichuan-7B
Safety Evaluation 
Ethical Considerations:
Users must ensure technology is developed in a regulated and legal environment.
Input Output 
Input Format:
textual prompts
Accepted Modalities:
text
Output Format:
text
Performance Tips:
Supports INT8 and INT4 quantization for efficient inference.
LLM NameBaichuan 13B Chat
Repository ๐Ÿค—https://huggingface.co/baichuan-inc/Baichuan-13B-Chat 
Model Size13b
Required VRAM26.5 GB
Updated2025-01-14
Maintainerbaichuan-inc
Model Typebaichuan
Model Files  10.0 GB: 1-of-3   9.9 GB: 2-of-3   6.6 GB: 3-of-3
Supported Languageszh en
Model ArchitectureBaichuanForCausalLM
Model Max Length4096
Transformers Version4.29.2
Tokenizer ClassBaichuanTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size64000
Torch Data Typebfloat16

Best Alternatives to Baichuan 13B Chat

Best Alternatives
Context / RAM
Downloads
Likes
Tiny Random Baichuan2 13B0K / 0.1 GB1264020
Baichuan2 13B Chat0K / 27.8 GB72022426
ShieldLM 13B Baichuan20K / 27.8 GB413
Baichuan2 13B Base0K / 27.8 GB102678
Blossom V3.1 Baichuan2 13B0K / 27.8 GB121
HuatuoGPT2 13B0K / 29.1 GB316
Buffer Baichuan2 13B Rag 4bits0K / 9.9 GB160
Buffer Baichuan2 13B Rag0K / 27.8 GB91
... Efficient Training Of LLMs V10K / 29.1 GB231
Sakura 13B LNovel V0.80K / 27.8 GB195
Note: green Score (e.g. "73.2") means that the model is better than baichuan-inc/Baichuan-13B-Chat.

Rank the Baichuan 13B Chat Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41301 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227