Model Type | |
Use Cases |
Areas: | |
Limitations: | Performance may be limited with out-of-domain programming languages. |
|
Considerations: | Providing a few-shot example may help steer the model's output. |
|
|
Supported Languages | Python (supported), JavaScript (supported), Java (supported), Go (supported), C++ (supported), Rust (supported), Other (limited) |
|
Training Details |
Data Sources: | bigcode/commitpackft, TIGER-Lab/MathInstruct, meta-math/MetaMathQA, glaiveai/glaive-code-assistant-v3, glaive-function-calling-v2, bugdaryan/sql-create-context-instruction, garage-bAInd/Open-Platypus, nvidia/HelpSteer |
|
Methodology: | finetuned using instruction-response pairs |
|
Hardware Used: | IBM's super computing clusters Vela and Blue Vela with NVIDIA A100 and H100 GPUs |
|
|
Responsible Ai Considerations |
Mitigation Strategies: | Developers should perform safety testing and target-specific tuning before deployment. |
|
|
Release Notes | |