LLM Name | PULI LlumiX 32K Instruct Q4 K M GGUF |
Repository ๐ค | https://huggingface.co/fragata/PULI-LlumiX-32K-Instruct-Q4_K_M-GGUF |
Model Size | 7b |
Required VRAM | 4.1 GB |
Updated | 2024-12-22 |
Maintainer | fragata |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
GGML Quantization | Yes |
GGUF Quantization | Yes |
Quantization Type | gguf|ggml|q4|q4_k |
Model Architecture | LlamaForCausalLM |
Context Length | 32768 |
Model Max Length | 32768 |
Transformers Version | 4.36.0.dev0 |
Tokenizer Class | LlamaTokenizer |
Padding Token | <unk> |
Vocabulary Size | 32000 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Sqlcoder 7B 2 | 16K / 13.5 GB | 48768 | 301 |
Sql Code Gguf | 16K / 4.8 GB | 12 | 0 |
...pseek Coder 6.7B Instruct GGUF | 16K / 2.5 GB | 198 | 6 |
Bailong Orpo 7B | 4K / 14 GB | 32 | 5 |
Latxa 7B Instruct | 4K / 13.5 GB | 5 | 0 |
...p 0.05 Max Grad1.0 Grad Accu32 | 32K / 14.4 GB | 24 | 0 |
...p 0.05 Max Grad1.0 Grad Accu32 | 32K / 14.4 GB | 23 | 0 |
...ruct Solidity Bnb 4bit Smashed | 16K / 4.2 GB | 14 | 0 |
...B Instruct Hf Bnb 4bit Smashed | 16K / 4.2 GB | 21 | 0 |
CodelLama7B Inst DPO 7K Mlx | 16K / 4.2 GB | 8 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐