LLM Name | Pallas 0.5 GPTQ |
Repository ๐ค | https://huggingface.co/TheBloke/Pallas-0.5-GPTQ |
Model Name | Pallas 0.5 |
Model Creator | Mihai |
Base Model(s) | |
Model Size | 5.1b |
Required VRAM | 18.6 GB |
Updated | 2024-09-16 |
Maintainer | TheBloke |
Model Type | llama |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq |
Model Architecture | LlamaForCausalLM |
License | other |
Context Length | 200000 |
Model Max Length | 200000 |
Transformers Version | 4.37.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 64002 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Pallas 0.5 LASER 0.6 GPTQ | 195K / 18.6 GB | 5 | 1 |
Pallas 0.4 GPTQ | 195K / 18.6 GB | 10 | 1 |
Pallas 0.3 GPTQ | 195K / 18.6 GB | 7 | 1 |
Tess M V1.3 GPTQ | 195K / 18.6 GB | 8 | 2 |
Tess M V1.2 GPTQ | 195K / 18.6 GB | 7 | 2 |
Tess M V1.1 GPTQ | 195K / 18.6 GB | 5 | 1 |
Tess M Creative V1.0 GPTQ | 195K / 18.6 GB | 8 | 5 |
PiVoT SUS RP GPTQ | 8K / 18.6 GB | 7 | 1 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐