Open Source LLMs in the Context of Translation

12/06/2024 09:06:34

A week ago, the Intento team (a machine translation and multilingual generative AI platform for global enterprise companies) published its 8th annual State of Machine Translation report. The work analyzes 52 MT engines and LLMs across 11 language pairs and 9 content domains. The MT systems and LLMs covered in this report were accessed between March 25 and May 14, 2024. What is interesting is the performance of open-source LLMs in the domain of translation.

Firstly, open-source LLMs such as TowerInstruct, RakutenAI 7B, Neurotõlge, Aya-101, Command R, and Mixtral 8x7B show promising capabilities. TowerInstruct models by Unbabel, based on Llama-2, are designed for translation tasks and perform well in multilingual settings. RakutenAI 7B excels in Japanese and English translation, while Neurotõlge supports both high-resource and low-resource languages, particularly Finno-Ugric languages. Aya-101 by Cohere supports 101 languages, focusing on lower-resourced ones, and Command R excels in reasoning and multilingual generation. Mixtral 8x7B by Mistral AI is noted for high-quality performance in various languages. However, these models generally fall into the second tier compared to commercial engines due to more limited multilingual capabilities.

Several Open Source Models Deliver Impressive Results

Despite their potential, open-source LLMs face significant challenges. They are generally 10-100 times less expensive than traditional machine translation (MT) systems but are also 50-1000 times slower, impacting their suitability for real-time applications. Customization through fine-tuning, prompt engineering, and the use of translation memories can enhance their performance.

Open-source models like TowerInstruct 7B and Command R approach top-tier commercial engine performance but often struggle with complex translations, particularly in languages like Arabic.

Overall, while open-source LLMs are cost-effective and show impressive results in certain contexts, they still lag behind commercial models in multilingual capabilities and real-time translation performance.

Was this helpful?

Recent Blog Posts

Kudos Qwen Coder Models: Open Weights and Self-Hosted on Your Hardware

2024-11-12
SmolLM2: The Week's Top-Ranked Compact Language Model

2024-11-02
OmniParser and Ferret-UI: New Tools for AI Understanding of User Interfaces

2024-10-30
Aya Expanse 8B: Translation-Focused Language Model

2024-10-27
User Feedback on NVIDIA's Llama 3.1 Nemotron 70B Instruct: Strengths and Limitations

2024-10-18
WhiteRabbitNeo-2.5-Qwen-2.5-Coder-7B: Practical Applications in Cybersecurity

2024-10-03
Meta's Llama 3.2 Restriction Prompts EU AI Regulation Debate

2024-09-29
User Feedback on Qwen 2.5 Models: Impressive Performance with Lower Computational Resources

2024-09-25
What's the Deal with Solar Pro Preview Instruct?

2024-09-16
Google introduced DataGemma - the world's first open models designed to address AI hallucination by LLMs

2024-09-14

Email us: info@extractum.io. Our Privacy Policy | Terms and Conditions | Suggest an improvement.

Our Social Media →

Original data from HuggingFace, OpenCompass and various public git repos.

Release v20241110

Support LLM Explorer