Model Type | |
Use Cases |
Areas: | research, commercial applications |
|
Applications: | coding, chat, math reasoning |
|
Primary Use Cases: | general question answering, math problem-solving, coding assistance |
|
Limitations: | complex reasoning, mathematical tasks, hallucination, safety concerns |
|
|
Additional Notes | Distributed under Apache License 2.0, enabling wide usage and modification rights. |
|
Supported Languages | English (high), Chinese (limited <0.1% data coverage) |
|
Training Details |
Data Sources: | openchat/openchat_sharegpt4_dataset, kaist-ai/Feedback-Collection, imone/OpenOrca_FLAN, LDJnr/Capybara, tiedong/goat, glaiveai/glaive-code-assistant, meta-math/MetaMathQA, OpenAssistant/oasst_top1_2023-08-25, TIGER-Lab/MathInstruct |
|
Methodology: | RLFT (Reinforcement Learning from Feedback Transcription) |
|
Context Length: | |
|
Input Output |
Input Format: | Text input for tasks like coding, general Q&A, and mathematical reasoning |
|
Accepted Modalities: | |
Output Format: | Text output reflecting task completion and responses |
|
Performance Tips: | Enable tensor parallelism for improved performance |
|
|