Model Type |
| |||||||||
Use Cases |
| |||||||||
Supported Languages |
| |||||||||
Training Details |
|
LLM Name | Diablo Italian Base 1.3B |
Repository ๐ค | https://huggingface.co/osiria/diablo-italian-base-1.3b |
Model Size | 1.3b |
Required VRAM | 2.6 GB |
Updated | 2025-03-27 |
Maintainer | osiria |
Model Type | xglm |
Model Files | |
Supported Languages | it |
Model Architecture | AutoModel |
License | mit |
Context Length | 2048 |
Model Max Length | 2048 |
Transformers Version | 4.29.0 |
Padding Token | <pad> |
Vocabulary Size | 50335 |
Torch Data Type | float16 |
Activation Function | gelu |
Best Alternatives |
Context / RAM |
Downloads |
Likes |
---|---|---|---|
Diablo Italian Chatbot 1.3B | 2K / 2.6 GB | 35 | 0 |
GPT Neo X 1.3B Qlora Test | 0K / 0 GB | 0 | 1 |
Test Discriminator | 0K / 0 GB | 6 | 0 |
Cerebras GPT 1.3B | 0K / 5.4 GB | 2140 | 49 |
...lb 200 Distilled 1.3B Ct2 Int8 | 0K / 1.4 GB | 5451 | 4 |
...pseek Coder 1.3B Instruct GGUF | 0K / 0.6 GB | 7053 | 34 |
Deepseek Coder 1.3B Base GGUF | 0K / 0.6 GB | 2047 | 6 |
...eared LLaMA 1.3B ShareGPT GGUF | 0K / 0.6 GB | 289 | 2 |
๐ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐