Yi 34B 200K GPTQ by TheBloke

 ยป  All LLMs  ยป  TheBloke  ยป  Yi 34B 200K GPTQ   URL Share it on

  4-bit   Autotrain compatible   Base model:01-ai/yi-34b-200k Base model:quantized:01-ai/yi-...   Custom code   Gptq   Quantized   Region:us   Safetensors   Yi

Yi 34B 200K GPTQ Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Yi 34B 200K GPTQ (TheBloke/Yi-34B-200K-GPTQ)

Yi 34B 200K GPTQ Parameters and Internals

Model Type 
yi
Use Cases 
Considerations:
Risk of generating problematic outputs remains. Usage comes with no responsibility from creators for misuse or associated data security issues.
Additional Notes 
Compatible with multiple GPU and CPU quantization formats provided by TheBloke. Multiple quantization branches available for different hardware requirements.
Supported Languages 
English (bilingual), Chinese (bilingual)
Training Details 
Data Volume:
200K context length
Context Length:
200000
Input Output 
Input Format:
{prompt}
Performance Tips:
Use consistent prompts and post-processing strategies for better performance.
Release Notes 
Version:
01-ai/Yi-6B-200K
Date:
2023-11-06
Notes:
Base model of Yi-6B-200K with 200K context length released.
Version:
01-ai/Yi-6B
Date:
2023-11-02
Notes:
Base model Yi-6B and Yi-34B released.
LLM NameYi 34B 200K GPTQ
Repository ๐Ÿค—https://huggingface.co/TheBloke/Yi-34B-200K-GPTQ 
Model NameYi 34B 200K
Model Creator01-ai
Base Model(s)  Yi 34B 200K   01-ai/Yi-34B-200K
Model Size34b
Required VRAM18.6 GB
Updated2024-12-26
MaintainerTheBloke
Model Typeyi
Model Files  18.6 GB
GPTQ QuantizationYes
Quantization Typegptq
Model ArchitectureYiForCausalLM
Licenseother
Context Length200000
Model Max Length200000
Transformers Version4.35.0
Tokenizer ClassYiTokenizer
Vocabulary Size64000
Torch Data Typebfloat16

Best Alternatives to Yi 34B 200K GPTQ

Best Alternatives
Context / RAM
Downloads
Likes
Yi 34B GPTQ4K / 18.7 GB6833
...lpaca Rpv3 Scipy 4bpw Hb6 EXL2195K / 18.7 GB152
...200K Alpaca Rpv3 4bpw Hb6 EXL2195K / 18.7 GB202
Yi 34B 200K 4.65bpw H6 EXL2195K / 20.8 GB116
Yi 34B 200K 4.0bpw H6 EXL2195K / 18.1 GB114
Yi 34B 200K 8.0bpw H8 EXL2195K / 34.8 GB113
Yi 34B 200K 5.0bpw H6 EXL2195K / 22.3 GB101
Deepsex 34B 4bpw H8 EXL24K / 18.1 GB254
Deepsex 34B 4bpw H6 EXL24K / 18.1 GB113
Yi 34B 5.0bpw H6 EXL24K / 22.2 GB113
Note: green Score (e.g. "73.2") means that the model is better than TheBloke/Yi-34B-200K-GPTQ.

Rank the Yi 34B 200K GPTQ Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 40248 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241217