LLM Name | Vicuna 33B 1 3 SuperHOT 8K GPTQ |
Repository | Open on ๐ค |
Model Size | 33b |
Required VRAM | 16.9 GB |
Updated | 2024-07-26 |
Maintainer | TheBloke |
Model Type | llama |
Model Files | |
GPTQ Quantization | Yes |
Context Length | 8k |
Quantization Type | gptq |
Model Architecture | LlamaForCausalLM |
License | other |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.30.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Beginning of Sentence Token | <s> |
End of Sentence Token | </s> |
Unk Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | float16 |
Best Alternatives |
HF Rank |
Context/RAM |
Downloads |
Likes |
---|---|---|---|---|
WizardCoder 33B V1.1 GPTQ | 0.2 | 16K / 17.4 GB | 88 | 11 |
Everyone Coder 33B Base GPTQ | 0.2 | 16K / 17.4 GB | 12 | 2 |
CodeFuse DeepSeek 33B 4bits | 0.2 | 16K / 18.7 GB | 14 | 10 |
WhiteRabbitNeo 33B V1 GPTQ | 0.2 | 16K / 17.4 GB | 12 | 4 |
...epseek Coder 33B Instruct GPTQ | 0.2 | 16K / 17.4 GB | 83 | 26 |
Deepseek Coder 33B Base GPTQ | 0.2 | 16K / 17.4 GB | 23 | 2 |
... 33B Gpt4 1 4 SuperHOT 8K GPTQ | 0.2 | 8K / 16.9 GB | 11 | 27 |
Sorceroboros 33B S2a4 Gptq | 0.1 | 8K / 17.6 GB | 10 | 3 |
...Combined Data SuperHOT 8K GPTQ | 0.1 | 8K / 18.1 GB | 12 | 4 |
Guanaco 33B SuperHOT 8K GPTQ | 0.1 | 8K / 16.9 GB | 24 | 19 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐