Model Type |
| ||||||||||||
Use Cases |
| ||||||||||||
Supported Languages |
| ||||||||||||
Training Details |
|
LLM Name | Diablo Italian Chatbot 1.3B |
Repository ๐ค | https://huggingface.co/osiria/diablo-italian-chatbot-1.3b |
Model Size | 1.3b |
Required VRAM | 2.6 GB |
Updated | 2025-02-22 |
Maintainer | osiria |
Model Type | xglm |
Model Files | |
Supported Languages | it |
Model Architecture | AutoModel |
License | mit |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.29.0 |
Padding Token | <pad> |
Vocabulary Size | 50335 |
Torch Data Type | float16 |
Activation Function | gelu |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Diablo Italian Base 1.3B | 2K / 2.6 GB | 15 | 0 |
GPT Neo X 1.3B Qlora Test | 0K / 0 GB | 0 | 1 |
Test Discriminator | 0K / 0 GB | 76 | 0 |
Cerebras GPT 1.3B | 0K / 5.4 GB | 2535 | 49 |
...lb 200 Distilled 1.3B Ct2 Int8 | 0K / 1.4 GB | 3007 | 4 |
...pseek Coder 1.3B Instruct GGUF | 0K / 0.6 GB | 35399 | 34 |
Deepseek Coder 1.3B Base GGUF | 0K / 0.6 GB | 4536 | 6 |
...eared LLaMA 1.3B ShareGPT GGUF | 0K / 0.6 GB | 289 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐