Diablo Italian Base 1.3B by osiria

 ยป  All LLMs  ยป  osiria  ยป  Diablo Italian Base 1.3B   URL Share it on

  Arxiv:2005.14165   Arxiv:2112.10668   Endpoints compatible   It   Pytorch   Region:us   Safetensors   Xglm

Diablo Italian Base 1.3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Diablo Italian Base 1.3B (osiria/diablo-italian-base-1.3b)

Diablo Italian Base 1.3B Parameters and Internals

Model Type 
causal language model, text generation
Use Cases 
Primary Use Cases:
basic natural language generation
Limitations:
unsuitable for fair or true content generation
Considerations:
Should not be used in situations requiring fairness or truth in generated content
Supported Languages 
Italian (proficient)
Training Details 
Methodology:
Modifying Meta's XGLM architecture
Model Architecture:
GPT-like architecture
LLM NameDiablo Italian Base 1.3B
Repository ๐Ÿค—https://huggingface.co/osiria/diablo-italian-base-1.3b 
Model Size1.3b
Required VRAM2.6 GB
Updated2025-03-27
Maintainerosiria
Model Typexglm
Model Files  2.6 GB   2.6 GB
Supported Languagesit
Model ArchitectureAutoModel
Licensemit
Context Length2048
Model Max Length2048
Transformers Version4.29.0
Padding Token<pad>
Vocabulary Size50335
Torch Data Typefloat16
Activation Functiongelu

Best Alternatives to Diablo Italian Base 1.3B

Best Alternatives
Context / RAM
Downloads
Likes
Diablo Italian Chatbot 1.3B2K / 2.6 GB350
GPT Neo X 1.3B Qlora Test0K / 0 GB01
Test Discriminator0K / 0 GB60
Cerebras GPT 1.3B0K / 5.4 GB214049
...lb 200 Distilled 1.3B Ct2 Int80K / 1.4 GB54514
...pseek Coder 1.3B Instruct GGUF0K / 0.6 GB705334
Deepseek Coder 1.3B Base GGUF0K / 0.6 GB20476
...eared LLaMA 1.3B ShareGPT GGUF0K / 0.6 GB2892
Note: green Score (e.g. "73.2") means that the model is better than osiria/diablo-italian-base-1.3b.

Rank the Diablo Italian Base 1.3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 45631 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227