Model Type | |
Use Cases |
Areas: | research, commercial applications |
|
Applications: | chatbots, text generation tasks |
|
Primary Use Cases: | |
Limitations: | Performance in languages other than French and English is not guaranteed., Degradation in performance due to change in datatype from float16 to bfloat16. |
|
|
Additional Notes | The tokenizer designed for multilingual contexts improves efficiency. |
|
Supported Languages | French (fluent), English (fluent) |
|
Training Details |
Data Sources: | ehartford/wizard_vicuna_70k_unfiltered, shahules786/orca-chat, timdettmers/openassistant-guanaco, laion/OIG |
|
Data Volume: | |
Methodology: | Fine-tuned on French and English data |
|
Context Length: | |
Hardware Used: | 1 x A100 40GB, 4 x A100 40GB |
|
Model Architecture: | Transposition from float16 to bfloat16 for improved efficiency |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | Text response with chatbot capabilities |
|
Performance Tips: | Precede individual prompt by EOS token (</s>) and generated part by BOS token (<s>). |
|
|
Release Notes |
Version: | |
Date: | |
Notes: | Fine-tuned model for chatbot applications in French and English. |
|
|
|