To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages

Read original: arXiv:2311.09404 - Published 7/11/2024 by Benedikt Ebing, Goran Glavav{s}

🔄

Overview

This paper evaluates different approaches for cross-lingual transfer (XLT) using multilingual language models (mLMs) and machine translation (MT).
The researchers investigate whether recent advances in massively multilingual MT can replace XLT with mLMs for low-resource language tasks.
They systematically evaluate existing and propose new translation-based XLT approaches, showing that these dramatically outperform zero-shot XLT with mLMs.
The researchers also explore ways to further improve translation-based XLT, such as adding reliable translations to the training data and proposing an effective strategy for languages not supported by the MT system.

Plain English Explanation

When you want to use a machine learning model to understand or generate text in a language that has limited data available, you can try cross-lingual transfer (XLT). This involves using a multilingual language model (mLM) that has been trained on many languages to perform the task in the low-resource language.

However, recent progress in machine translation (MT) has been so significant that the researchers wondered if it could completely replace the need for XLT with mLMs. They evaluated different approaches that use MT to help with low-resource language tasks, and found that these "translation-based" methods dramatically outperformed the zero-shot XLT approach with mLMs.

The researchers also discovered some ways to further improve the translation-based XLT methods. For example, adding high-quality translations of data from other languages to the training set can boost performance. They even came up with a way to do translation-based XLT for languages that aren't supported by the MT system being used.

Overall, the findings suggest that using MT-based approaches is a more robust and effective way to handle low-resource language tasks, compared to relying solely on multilingual language models.

Technical Explanation

The paper systematically evaluates existing and proposes new translation-based cross-lingual transfer (XLT) approaches for transferring to low-resource languages. The researchers compare these techniques to the standard zero-shot XLT approach using multilingual language models (mLMs).

The key translation-based XLT approaches evaluated include:

Round-trip translation of the source-language training data
Translation of the target-language test instances at inference time
Adding reliable translations of high-resource language data to the training set

The experiments show that all the translation-based methods dramatically outperform zero-shot XLT with mLMs. The combination of round-trip translation of the source data and translating the target test instances is generally the most effective approach.

The researchers also propose a strategy for doing translation-based XLT for languages not directly supported by the machine translation (MT) system. Additionally, they find that model selection for XLT can be improved by using target-language validation data obtained via MT, rather than relying solely on the source-language data.

Critical Analysis

The paper provides a thorough and systematic evaluation of translation-based approaches for cross-lingual transfer (XLT), which is an important problem in multilingual natural language processing. The key finding that translation-based methods significantly outperform zero-shot XLT with multilingual language models (mLMs) is a valuable insight.

However, the paper does not delve into the potential limitations or caveats of the translation-based approaches. For example, the quality of the machine translation (MT) system used could be a critical factor, and the performance gains may not generalize to low-resource language pairs not covered by the MT system.

Additionally, the paper does not address potential issues with the training data quality and translation reliability when adding translated high-resource language data to the training set. This could be an area for further research.

Overall, the findings warrant further exploration and a broader inclusion of translation-based baselines in XLT research. However, it would be valuable for future work to examine the limitations and robustness of these approaches, particularly for truly low-resource language settings.

Conclusion

This paper presents a comprehensive evaluation of translation-based approaches for cross-lingual transfer (XLT), demonstrating that they dramatically outperform the standard zero-shot XLT method using multilingual language models (mLMs). The researchers also propose several ways to further improve the translation-based XLT strategies, such as augmenting the training data with reliable translations and devising approaches for languages not directly supported by the machine translation (MT) system.

These findings suggest that translation-based techniques should be more widely adopted as robust baselines for XLT research, especially as massively multilingual MT systems continue to advance. However, further work is needed to fully understand the limitations and generalization of these approaches, particularly for truly low-resource language settings.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🔄

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages

Benedikt Ebing, Goran Glavav{s}

Perfect machine translation (MT) would render cross-lingual transfer (XLT) by means of multilingual language models (mLMs) superfluous. Given, on the one hand, the large body of work on improving XLT with mLMs and, on the other hand, recent advances in massively multilingual MT, in this work, we systematically evaluate existing and propose new translation-based XLT approaches for transfer to low-resource languages. We show that all translation-based approaches dramatically outperform zero-shot XLT with mLMs -- with the combination of round-trip translation of the source-language training data and the translation of the target-language test instances at inference -- being generally the most effective. We next show that one can obtain further empirical gains by adding reliable translations to other high-resource languages to the training data. Moreover, we propose an effective translation-based XLT strategy even for languages not supported by the MT system. Finally, we show that model selection for XLT based on target-language validation data obtained with MT outperforms model selection based on the source-language data. We believe our findings warrant a broader inclusion of more robust translation-based baselines in XLT research.

7/11/2024

Analysis of Multi-Source Language Training in Cross-Lingual Transfer

Seong Hoon Lim, Taejun Yun, Jinhyeon Kim, Jihun Choi, Taeuk Kim

The successful adaptation of multilingual language models (LMs) to a specific language-task pair critically depends on the availability of data tailored for that condition. While cross-lingual transfer (XLT) methods have contributed to addressing this data scarcity problem, there still exists ongoing debate about the mechanisms behind their effectiveness. In this work, we focus on one of promising assumptions about inner workings of XLT, that it encourages multilingual LMs to place greater emphasis on language-agnostic or task-specific features. We test this hypothesis by examining how the patterns of XLT change with a varying number of source languages involved in the process. Our experimental findings show that the use of multiple source languages in XLT-a technique we term Multi-Source Language Training (MSLT)-leads to increased mingling of embedding spaces for different languages, supporting the claim that XLT benefits from making use of language-independent information. On the other hand, we discover that using an arbitrary combination of source languages does not always guarantee better performance. We suggest simple heuristics for identifying effective language combinations for MSLT and empirically prove its effectiveness.

6/6/2024

On the Evaluation Practices in Multilingual NLP: Can Machine Translation Offer an Alternative to Human Translations?

Rochelle Choenni, Sara Rajaee, Christof Monz, Ekaterina Shutova

While multilingual language models (MLMs) have been trained on 100+ languages, they are typically only evaluated across a handful of them due to a lack of available test data in most languages. This is particularly problematic when assessing MLM's potential for low-resource and unseen languages. In this paper, we present an analysis of existing evaluation frameworks in multilingual NLP, discuss their limitations, and propose several directions for more robust and reliable evaluation practices. Furthermore, we empirically study to what extent machine translation offers a {reliable alternative to human translation} for large-scale evaluation of MLMs across a wide set of languages. We use a SOTA translation model to translate test data from 4 tasks to 198 languages and use them to evaluate three MLMs. We show that while the selected subsets of high-resource test languages are generally sufficiently representative of a wider range of high-resource languages, we tend to overestimate MLMs' ability on low-resource languages. Finally, we show that simpler baselines can achieve relatively strong performance without having benefited from large-scale multilingual pretraining.

6/21/2024

💬

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

Wenhao Zhu, Hongyi Liu, Qingxiu Dong, Jingjing Xu, Shujian Huang, Lingpeng Kong, Jiajun Chen, Lei Li

Large language models (LLMs) have demonstrated remarkable potential in handling multilingual machine translation (MMT). In this paper, we systematically investigate the advantages and challenges of LLMs for MMT by answering two questions: 1) How well do LLMs perform in translating massive languages? 2) Which factors affect LLMs' performance in translation? We thoroughly evaluate eight popular LLMs, including ChatGPT and GPT-4. Our empirical results show that translation capabilities of LLMs are continually involving. GPT-4 has beat the strong supervised baseline NLLB in 40.91% of translation directions but still faces a large gap towards the commercial translation system like Google Translate, especially on low-resource languages. Through further analysis, we discover that LLMs exhibit new working patterns when used for MMT. First, LLM can acquire translation ability in a resource-efficient way and generate moderate translation even on zero-resource languages. Second, instruction semantics can surprisingly be ignored when given in-context exemplars. Third, cross-lingual exemplars can provide better task guidance for low-resource translation than exemplars in the same language pairs. Code will be released at: https://github.com/NJUNLP/MMT-LLM.

6/17/2024