LLM Name | Falcon H1 34B Instruct GPTQ Int8 |
Repository ๐ค | https://huggingface.co/tiiuae/Falcon-H1-34B-Instruct-GPTQ-Int8 |
Base Model(s) | |
Model Size | 34b |
Required VRAM | 44.3 GB |
Updated | 2025-06-01 |
Maintainer | tiiuae |
Model Type | falcon_h1 |
Instruction-Based | Yes |
Model Files | |
GPTQ Quantization | Yes |
Quantization Type | gptq |
Model Architecture | FalconH1ForCausalLM |
License | other |
Context Length | 8192 |
Model Max Length | 8192 |
Transformers Version | 4.52.0.dev0 |
Tokenizer Class | PreTrainedTokenizer |
Padding Token | <|pad|> |
Vocabulary Size | 261120 |
Torch Data Type | bfloat16 |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Falcon H1 34B Instruct | 256K / 67 GB | 2635 | 29 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐