LLM Name | Granite 3B GPTQ 4bit GPTQ Code Instruct |
Repository ๐ค | https://huggingface.co/boadisamson/granite-3b-GPTQ-4bit-GPTQ-code-instruct |
Model Size | 3b |
Required VRAM | 2 GB |
Updated | 2024-10-16 |
Maintainer | boadisamson |
Model Type | llama |
Instruction-Based | Yes |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq|4bit |
Model Architecture | LlamaForCausalLM |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.40.2 |
Vocabulary Size | 49152 |
Torch Data Type | float16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
...zard Evol Instuct V2 196K GPTQ | 2K / 2.1 GB | 4 | 2 |
Llama 3.2 3B Instruct Bnb 4bit | 128K / 2.2 GB | 142578 | 11 |
Llama 3.2 3B Instruct 4bit | 128K / 1.8 GB | 5698 | 7 |
Reasoning Llama 3B V0.1 | 128K / 6.5 GB | 85 | 4 |
...nstruct Medical Conversational | 128K / 6.5 GB | 264 | 3 |
...olio Query Llama 3.2 3B V3 Cot | 128K / 6.5 GB | 169 | 1 |
...Gb Safetensor Experiment 16bit | 128K / 6.5 GB | 96 | 0 |
Llama3.2 3B 4bit | 128K / 2.2 GB | 46 | 0 |
FineTome Llama3.2 3B 1002 | 128K / 6.5 GB | 50 | 1 |
...3b V2 Python Instruct 0.1 4bit | 8K / 2.5 GB | 6 | 0 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐