Jais Family 590M Chat by inceptionai

 ยป  All LLMs  ยป  inceptionai  ยป  Jais Family 590M Chat   URL Share it on

  Arxiv:2307.09288   Arxiv:2308.16149   Arxiv:2402.12840   Ar   Arabic Base model:finetune:inceptiona... Base model:inceptionai/jais-fa...   Conversational   Custom code   Decoder   En   English   Jais   Jais-family   Region:us   Safetensors   Sharded   Tensorflow

Jais Family 590M Chat Benchmarks

nn.n% — How the model compares to the reference models: Anthropic Sonnet 3.5 ("so35"), GPT-4o ("gpt4o") or GPT-4 ("gpt4").
Jais Family 590M Chat (inceptionai/jais-family-590m-chat)

Jais Family 590M Chat Parameters and Internals

Model Type 
text generation, decoder, causal-lm
Use Cases 
Areas:
Research, Commercial Applications
Applications:
Development of chat assistants, Sentiment analysis, Summarization of bilingual documents
Primary Use Cases:
Arabic and English NLP tasks, Cultural alignment analysis, Mechanistic interpretability
Additional Notes 
Jais models are designed for Arabic and English tasks, not other languages.
Supported Languages 
Arabic (high proficiency), English (strong capabilities)
Training Details 
Data Sources:
Public web pages, Wikipedia, News articles, Social network content, Code in various languages, Books in Arabic and English, ArXiv papers, Synthetic translations of high-quality English resources
Data Volume:
Up to 1.6 trillion tokens
Methodology:
Two-stage training with frozen and unfrozen layers for adapted pre-training; progressive context length expansion
Context Length:
16384
Hardware Used:
Condor Galaxy supercomputer, 64 Cerebras CS-2 WSE-2 units
Model Architecture:
Transformer-based, decoder-only architecture with SwiGLU activation and ALiBi/ROPE position encoding
Responsible Ai Considerations 
Fairness:
Bias mitigation techniques employed.
Input Output 
Accepted Modalities:
text
Output Format:
generates text
LLM NameJais Family 590M Chat
Repository ๐Ÿค—https://huggingface.co/inceptionai/jais-family-590m-chat 
Base Model(s)  Jais Family 590M   inceptionai/jais-family-590m
Model Size590m
Required VRAM3.1 GB
Updated2025-01-22
Maintainerinceptionai
Model Typejais
Model Files  3.1 GB: 1-of-1
Supported Languagesar en
Model ArchitectureJAISLMHeadModel
Licenseapache-2.0
Model Max Length2048
Transformers Version4.40.1
Tokenizer ClassPreTrainedTokenizerFast
Padding Token<|endoftext|>
Vocabulary Size84992
Torch Data Typefloat32
Activation Functionswiglu

Best Alternatives to Jais Family 590M Chat

Best Alternatives
Context / RAM
Downloads
Likes
Jais Family 590M0K / 3.1 GB25596

Rank the Jais Family 590M Chat Capabilities

๐Ÿ†˜ Have you tried this model? Rate its performance. This feedback would greatly assist ML community in identifying the most suitable model for their needs. Your contribution really does make a difference! ๐ŸŒŸ

Instruction Following and Task Automation  
Factuality and Completeness of Knowledge  
Censorship and Alignment  
Data Analysis and Insight Generation  
Text Generation  
Text Summarization and Feature Extraction  
Code Generation  
Multi-Language Support and Translation  

What open-source LLMs or SLMs are you in search of? 41728 in total.

Our Social Media →  
Original data from HuggingFace, OpenCompass and various public git repos.
Release v20241227