LLM Name | Yi 34B Chat GPTQ |
Repository ๐ค | https://huggingface.co/TheBloke/Yi-34B-Chat-GPTQ |
Model Name | Yi 34B Chat |
Base Model(s) | |
Model Size | 34b |
Required VRAM | 18.6 GB |
Updated | 2024-12-22 |
Maintainer | TheBloke |
Model Type | llama |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq |
Model Architecture | LlamaForCausalLM |
License | other |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.35.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 64000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Smaug 34B V0.1 GPTQ | 195K / 21.2 GB | 15 | 1 |
Yi 34B 200K RPMerge GPTQ | 195K / 21.2 GB | 10 | 3 |
Tess 34B V1.5B GPTQ | 195K / 18.6 GB | 31 | 7 |
...4B 200K DARE Megamerge V8 GPTQ | 195K / 18.6 GB | 97 | 3 |
...ous Capybara Limarpv3 34B GPTQ | 195K / 18.6 GB | 33 | 4 |
Deepmoney 34B 200K Base GPTQ | 195K / 18.6 GB | 22 | 3 |
...y 34B 200K Chat Evaluator GPTQ | 195K / 18.6 GB | 20 | 3 |
Bagel 34B V0.2 GPTQ | 195K / 18.6 GB | 69 | 2 |
Bagel DPO 34B V0.2 GPTQ | 195K / 18.6 GB | 42 | 2 |
Nontoxic Bagel 34B V0.2 GPTQ | 195K / 18.6 GB | 34 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐