Nanbeige 16B Chat by Nanbeige

 ยป  All LLMs  ยป  Nanbeige  ยป  Nanbeige 16B Chat   URL Share it on

  Autotrain compatible   Custom code   En   Endpoints compatible   Llama   Nanbeige   Region:us   Safetensors   Sharded   Tensorflow   Zh

Nanbeige 16B Chat Benchmarks

Nanbeige 16B Chat (Nanbeige/Nanbeige-16B-Chat)

Nanbeige 16B Chat Parameters and Internals

Model Type 
text-generation, chat
Use Cases 
Areas:
Research, Commercial applications (with license)
Applications:
Text generation, Dialogue systems
Primary Use Cases:
Chat dialogue, possibly others based on fine-tuning and continuation work
Limitations:
Potential for biased or unexpected outputs
Considerations:
Model's size and probabilistic nature.
Additional Notes 
Contact via the provided email to apply for commercial usage licensing.
Supported Languages 
en (English), zh (Chinese)
Training Details 
Data Sources:
Large amount of high-quality internet corpus, Various books, Code and other desensitized text
Data Volume:
2.5T Tokens
Methodology:
YaRN interpolation method, human-aligned training
Responsible Ai Considerations 
Mitigation Strategies:
Ensuring outputs meet ethical and legal requirements to minimize bias and harmful content.
Input Output 
Input Format:
Expected input for chat dialogue
Accepted Modalities:
text
Output Format:
Text output
LLM NameNanbeige 16B Chat
Repository ๐Ÿค—https://huggingface.co/Nanbeige/Nanbeige-16B-Chat 
Model Size16b
Required VRAM31.6 GB
Updated2024-12-26
MaintainerNanbeige
Model Typellama
Model Files  4.9 GB: 1-of-7   4.9 GB: 2-of-7   4.9 GB: 3-of-7   5.0 GB: 4-of-7   5.0 GB: 5-of-7   4.9 GB: 6-of-7   2.0 GB: 7-of-7
Supported Languagesen zh
Model ArchitectureLlamaForCausalLM
Licenseother
Context Length4096
Model Max Length4096
Transformers Version4.35.0
Tokenizer ClassNanbeigeTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size59392
Torch Data Typebfloat16

Quantized Models of the Nanbeige 16B Chat

Model
Likes
Downloads
VRAM
Nanbeige 16B Chat GGUF11026 GB
Nanbeige 16B Chat GPTQ1189 GB

Best Alternatives to Nanbeige 16B Chat

Best Alternatives
Context / RAM
Downloads
Likes
Phi 3.5 Mini Investigator 16B128K / 7.6 GB280
...aid Blackroot Grand HORROR 16B8K / 33.8 GB310
...o Sft Ties Post Merge Auto DPO8K / 141.2 GB150
Nanbeige2 16B Chat4K / 31.6 GB9630
...ALAXY V03 Slimorca 1 Epoch 50k4K / 31.8 GB2000
...ca 1 Epoch 50k DPO 1 Epoch 30k4K / 31.8 GB870
Nanbeige 16B Base Llama4K / 31.6 GB10463
GALAXY XB V.034K / 31.9 GB750
FusionNet SOLAR4K / 31.9 GB10661
Llama 2 16B Nastychat4K / 32.4 GB7608
Note: green Score (e.g. "73.2") means that the model is better than Nanbeige/Nanbeige-16B-Chat.

Rank the Nanbeige 16B Chat Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40248 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217