Model Type | bilingual, auto-regressive, transformer-based, decoder-only, LLM, causal-lm |
|
Use Cases |
Areas: | |
Applications: | Chat assistants, Sentiment analysis, Summarization |
|
Primary Use Cases: | Arabic NLP research, Mechanistic interpretability, Cultural analysis, Chat applications |
|
Limitations: | Limited language proficiency outside Arabic-English |
|
Considerations: | Avoid use in high-stakes decision making. |
|
|
Supported Languages | Arabic (MSA), English (strong) |
|
Training Details |
Data Sources: | Web pages, Wikipedia articles, News articles, Social network content, Code data, Books, ArXiv papers, Synthetic data (in-house translation) |
|
Data Volume: | |
Context Length: | |
Model Architecture: | transformer-based, decoder-only architecture (GPT-3) |
|
|
Input Output |
Input Format: | |
Output Format: | |
|