Model Type | |
Use Cases |
Areas: | research, foundation for specialization |
|
Applications: | summarization, text generation, chatbot |
|
Limitations: | generalization issues to unsupported languages, biases from web-corpora |
|
Considerations: | Tailor to specific tasks and assessments. |
|
|
Additional Notes | This model was part of a merge using mergekit with specific layer configurations. |
|
Supported Languages | Portuguese (native), English (proficient), German (proficient), Spanish (proficient), French (proficient), Italian (proficient), Polish (proficient), Dutch (proficient), Romanian (proficient), Czech (proficient), Swedish (proficient) |
|
Training Details |
Data Sources: | wikimedia/wikipedia Portuguese subset |
|
Methodology: | continued pre-training and pruning using layer similarity to maintain performance while reducing model size |
|
|
Safety Evaluation |
Methodologies: | PruneMe layer similarity analysis |
|
Risk Categories: | bias, generalization issues |
|
|
Responsible Ai Considerations |
Mitigation Strategies: | Fine-tuning and guardrails recommended for production use. |
|
|
Input Output |
Input Format: | |
Accepted Modalities: | |
Output Format: | |
Performance Tips: | Use PyTorch 2.0 for optimal inference with Falcon models |
|
|
Release Notes |
Notes: | Merged using passthrough method and pruned with PruneMe for optimized performance. |
|
|
|