LLM Name | Qwen 7B Chat Int8 |
Repository ๐ค | https://huggingface.co/Qwen/Qwen-7B-Chat-Int8 |
Model Size | 7b |
Required VRAM | 9 GB |
Updated | 2024-09-16 |
Maintainer | Qwen |
Model Type | qwen |
Model Files | |
Supported Languages | zh en |
Model Architecture | QWenLMHeadModel |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.32.0 |
Tokenizer Class | QWenTokenizer |
Vocabulary Size | 151936 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Qwen 7B Chat | 32K / 15.3 GB | 32126 | 745 |
Qwen 7B | 32K / 15.3 GB | 25379 | 361 |
Qwen 7B Chat Int4 | 32K / 5.8 GB | 4115 | 67 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐