LLM Name | Yi 34B Chat AWQ |
Repository ๐ค | https://huggingface.co/TheBloke/Yi-34B-Chat-AWQ |
Model Name | Yi 34B Chat |
Base Model(s) | |
Model Size | 34b |
Required VRAM | 19.3 GB |
Updated | 2024-12-22 |
Maintainer | TheBloke |
Model Type | llama |
Model Files | |
AWQ Quantization | Yes |
Quantization Type | awq |
Model Architecture | LlamaForCausalLM |
License | other |
Context Length | 4096 |
Model Max Length | 4096 |
Transformers Version | 4.35.2 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 64000 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Opus V1 34B AWQ | 195K / 19.2 GB | 33 | 1 |
Yi 34B 200K RPMerge AWQ | 195K / 19.2 GB | 20 | 1 |
Smaug 34B V0.1 AWQ | 195K / 19.2 GB | 10 | 2 |
Tess 34B V1.5B AWQ | 195K / 19.3 GB | 24 | 3 |
...34B 200K DARE Megamerge V8 AWQ | 195K / 19.3 GB | 42 | 2 |
...ey 34B 200K Chat Evaluator AWQ | 195K / 19.3 GB | 22 | 5 |
Deepmoney 34B 200K Base AWQ | 195K / 19.3 GB | 27 | 1 |
Nous Capybara Limarpv3 34B AWQ | 195K / 19.3 GB | 23 | 1 |
Bagel DPO 34B V0.2 AWQ | 195K / 19.3 GB | 17 | 7 |
Nontoxic Bagel 34B V0.2 AWQ | 195K / 19.3 GB | 18 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐