Leo Hessianai 13B Chat Bilingual by LeoLM

 ยป  All LLMs  ยป  LeoLM  ยป  Leo Hessianai 13B Chat Bilingual   URL Share it on

  Autotrain compatible   Custom code Dataset:bjoernp/oasst25-08-23-... Dataset:freedomintelligence/al... Dataset:freedomintelligence/ev... Dataset:garage-baind/open-plat...   Dataset:leolm/german poems   Dataset:leolm/german songs   Dataset:leolm/openschnabeltier   Dataset:openassistant/oasst-de Dataset:wizardlm/wizardlm evol...   De   En   Endpoints compatible   Instruct   Llama   Pytorch   Region:us   Sharded

Leo Hessianai 13B Chat Bilingual Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Leo Hessianai 13B Chat Bilingual (LeoLM/leo-hessianai-13b-chat-bilingual)

Leo Hessianai 13B Chat Bilingual Parameters and Internals

Model Type 
Causal decoder-only transformer language model
Additional Notes 
The model is built on the foundation model `LeoLM/leo-hessianai-13b` and finetuned on a selection of German translated instruction datasets and their English counterparts.
Supported Languages 
en (bilingual proficiency), de (native proficiency)
Training Details 
Data Sources:
LeoLM/OpenSchnabeltier, OpenAssistant/OASST-DE, FreedomIntelligence/alpaca-gpt4-deutsch, FreedomIntelligence/evol-instruct-deutsch, LeoLM/German_Poems, LeoLM/German_Songs, garage-bAInd/Open-Platypus, WizardLM/WizardLM_evol_instruct_70k, bjoernp/oasst25-08-23-filtered
Data Volume:
115862397 tokens total
Methodology:
Finetuned for conversational tasks in English and German
Context Length:
8192
Hardware Used:
Supercomputer 42 at HessianAI
Model Architecture:
Transformer
Responsible Ai Considerations 
Fairness:
The model has been tested in English and German; it may produce inaccurate or biased responses.
Mitigation Strategies:
Safety testing and tuning before deployment.
Input Output 
Input Format:
Prompt dialogue template (ChatML format)
Accepted Modalities:
text
Output Format:
Generated text response
Performance Tips:
Use flash-attention2 for faster inference.
LLM NameLeo Hessianai 13B Chat Bilingual
Repository ๐Ÿค—https://huggingface.co/LeoLM/leo-hessianai-13b-chat-bilingual 
Model Size13b
Required VRAM26 GB
Updated2025-03-13
MaintainerLeoLM
Model Typellama
Instruction-BasedYes
Model Files  9.9 GB: 1-of-3   9.9 GB: 2-of-3   6.2 GB: 3-of-3
Supported Languagesen de
Model ArchitectureLlamaForCausalLM
Context Length8192
Model Max Length8192
Transformers Version4.33.1
Tokenizer ClassLlamaTokenizer
Beginning of Sentence Token<s>
End of Sentence Token</s>
Unk Token<unk>
Vocabulary Size32128
Torch Data Typefloat16

Quantized Models of the Leo Hessianai 13B Chat Bilingual

Model
Likes
Downloads
VRAM
...sianai 13B Chat Bilingual GGUF64885 GB
...sianai 13B Chat Bilingual GPTQ41147 GB
...ssianai 13B Chat Bilingual AWQ1957 GB

Best Alternatives to Leo Hessianai 13B Chat Bilingual

Best Alternatives
Context / RAM
Downloads
Likes
NexusRaven V2 13B16K / 26 GB3908466
CodeLlama 13B Instruct Hf16K / 26 GB18103145
CodeLlama 13B MORepair16K / 26 GB322
CodeLlama 13B Instruct Hf16K / 26 GB104921
TableLLM 13B16K / 26 GB31726
NexusRaven 13B16K / 26 GB181102
Panda Coder 13B16K / 26 GB15813
... Llama 2 13B Instruct Text2sql16K / 26 GB16527
Gen Sim16K / 0.3 GB502
Llama 3 13B Instruct Ft8K / 26.1 GB222
Note: green Score (e.g. "73.2") means that the model is better than LeoLM/leo-hessianai-13b-chat-bilingual.

Rank the Leo Hessianai 13B Chat Bilingual Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 44950 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227