Yi 34B 200K by 01-ai

 ยป  All LLMs  ยป  01-ai  ยป  Yi 34B 200K   URL Share it on

  Arxiv:2311.16502   Arxiv:2401.11944   Arxiv:2403.04652   Autotrain compatible   Endpoints compatible   Llama   Pytorch   Region:us   Safetensors   Sharded   Tensorflow
Model Card on HF ๐Ÿค—: https://huggingface.co/01-ai/Yi-34B-200K 

Yi 34B 200K Benchmarks

Yi 34B 200K (01-ai/Yi-34B-200K)

Yi 34B 200K Parameters and Internals

Model Type 
Chat model, Text generation
Use Cases 
Areas:
Chat applications, Creative content generation
Applications:
Commercial applications, Research, Educational tools
Primary Use Cases:
Chatbots, Virtual assistants, Story generation
Limitations:
Potential for hallucination, May produce inconsistent outputs
Considerations:
Adjust generation parameters for desired output qualities.
Additional Notes 
Models do not directly use Llama's weights; unique datasets and training infrastructure emphasize Yi's independent development.
Supported Languages 
English (Fluent), Chinese (Fluent)
Training Details 
Data Sources:
Trainer Multilingual Corpora, 3T Tokens
Data Volume:
3T Multilingual Corpus
Methodology:
Transformer-based architecture
Context Length:
200000
Training Time:
Not specified
Hardware Used:
NVIDIA A800 (80GB), 4090 GPU
Model Architecture:
Based on Llama's architecture
Responsible Ai Considerations 
Fairness:
Addressed during model development.
Transparency:
Standard Transformer architecture; detailed in tech report.
Accountability:
01.AI
Mitigation Strategies:
Use of Supervised Fine-Tuning for better accuracy.
Input Output 
Input Format:
Interactive prompt conversation
Accepted Modalities:
Text
Output Format:
Text responses or follow-ups
Performance Tips:
Calibrate temperature, top_p, top_k settings for desired response diversity.
Release Notes 
Version:
1.0
Date:
2023-11-23
Notes:
Initial open-source release of chat model, supporting both 4-bit and 8-bit quantizations.
Version:
2.0
Date:
2023-12-19
Notes:
Improved performance in coding, math, and reasoning with larger context capabilities.
LLM NameYi 34B 200K
Repository ๐Ÿค—https://huggingface.co/01-ai/Yi-34B-200K 
Model Size34b
Required VRAM68.9 GB
Updated2024-12-14
Maintainer01-ai
Model Typellama
Model Files  10.0 GB: 1-of-7   9.9 GB: 2-of-7   9.8 GB: 3-of-7   9.8 GB: 4-of-7   9.8 GB: 5-of-7   9.9 GB: 6-of-7   9.7 GB: 7-of-7   10.0 GB: 1-of-7   9.9 GB: 2-of-7   9.8 GB: 3-of-7   9.8 GB: 4-of-7   9.8 GB: 5-of-7   9.9 GB: 6-of-7   9.7 GB: 7-of-7
Model ArchitectureLlamaForCausalLM
Licenseapache-2.0
Context Length200000
Model Max Length200000
Transformers Version4.34.0
Tokenizer ClassLlamaTokenizer
Padding Token<unk>
Vocabulary Size64000
Torch Data Typebfloat16

Quantized Models of the Yi 34B 200K

Model
Likes
Downloads
VRAM
Yi 34B 200K GGUF2954814 GB
Yi 34B 200K AWQ94819 GB
Yi 34B 200K GPTQ32818 GB
Yi 34B 200K AWQ11319 GB

Best Alternatives to Yi 34B 200K

Best Alternatives
Context / RAM
Downloads
Likes
34B Beta195K / 69.2 GB371262
Bagel Hermes 34B Slerp195K / 68.9 GB40881
Smaug 34B V0.1195K / 69.2 GB371760
Bagel 34B V0.2195K / 68.7 GB578339
Yi 34B 200K AEZAKMI V2195K / 69.2 GB129312
Smaug 34B V0.1 ExPO195K / 69.2 GB30170
Faro Yi 34B195K / 69.2 GB38426
Mergekit Slerp Anaazls195K / 69.2 GB70
Bagel DPO 34B V0.5195K / 68.7 GB306117
Dolphin 2.2 Yi 34B 200K195K / 69.2 GB174636
Note: green Score (e.g. "73.2") means that the model is better than 01-ai/Yi-34B-200K.

Rank the Yi 34B 200K Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 39237 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241124