Diablo Italian Chatbot 1.3B by osiria

 ยป  All LLMs  ยป  osiria  ยป  Diablo Italian Chatbot 1.3B   URL Share it on

  Arxiv:2004.13637   Arxiv:2005.14165   Arxiv:2112.10668   Endpoints compatible   It   Pytorch   Region:us   Safetensors   Xglm

Diablo Italian Chatbot 1.3B Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Diablo Italian Chatbot 1.3B (osiria/diablo-italian-chatbot-1.3b)

Diablo Italian Chatbot 1.3B Parameters and Internals

Model Type 
conversational, Italian language, GPT-like
Use Cases 
Areas:
brief and informal conversations, small talk
Limitations:
Model might behave erratically with prompts outside its training set, Might produce biased or offensive content
Considerations:
Model outputs should be used with caution and not for truthful or fair-required scenarios.
Supported Languages 
Italian (proficient)
Training Details 
Data Sources:
Meta's Blenderbot, machine translation
Data Volume:
50K Italian conversational exchanges
Methodology:
Modified Meta's XGLM architecture with checkpoints and learned through conversational data.
Model Architecture:
GPT-like
LLM NameDiablo Italian Chatbot 1.3B
Repository ๐Ÿค—https://huggingface.co/osiria/diablo-italian-chatbot-1.3b 
Model Size1.3b
Required VRAM2.6 GB
Updated2025-02-22
Maintainerosiria
Model Typexglm
Model Files  2.6 GB   2.6 GB
Supported Languagesit
Model ArchitectureAutoModel
Licensemit
Context Length2048
Model Max Length2048
Transformers Version4.29.0
Padding Token<pad>
Vocabulary Size50335
Torch Data Typefloat16
Activation Functiongelu

Best Alternatives to Diablo Italian Chatbot 1.3B

Best Alternatives
Context / RAM
Downloads
Likes
Diablo Italian Base 1.3B2K / 2.6 GB150
GPT Neo X 1.3B Qlora Test0K / 0 GB01
Test Discriminator0K / 0 GB760
Cerebras GPT 1.3B0K / 5.4 GB253549
...lb 200 Distilled 1.3B Ct2 Int80K / 1.4 GB30074
...pseek Coder 1.3B Instruct GGUF0K / 0.6 GB3539934
Deepseek Coder 1.3B Base GGUF0K / 0.6 GB45366
...eared LLaMA 1.3B ShareGPT GGUF0K / 0.6 GB2892
Note: green Score (e.g. "73.2") means that the model is better than osiria/diablo-italian-chatbot-1.3b.

Rank the Diablo Italian Chatbot 1.3B Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 43470 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227