Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts

Read original: arXiv:2306.11372 - Published 7/22/2024 by Xuan-Phi Nguyen, Sharifah Mahani Aljunied, Shafiq Joty, Lidong Bing

🤔

Overview

Large language models (LLMs) can effectively perform tasks by observing just a few examples.
In low-resource languages, obtaining these example prompts can be challenging, so unsupervised techniques may be necessary.
LLM performance is often better in high-resource languages due to pre-training data imbalance.
The paper proposes an unsupervised prompting method to leverage LLMs for low-resource language tasks without any supervised data.

Plain English Explanation

The researchers looked at how large language models can be used to perform tasks in languages that don't have a lot of available data, like many indigenous and African languages. These language models are usually very good at tasks when they have lots of example data to learn from. But for less common languages, getting those examples can be difficult.

The researchers developed a new method to create synthetic example prompts from a diverse set of high-resource languages. They then used these prompts to teach the language models how to translate from any language into English. This unsupervised approach allows the models to learn how to work with low-resource languages without needing any labeled training data.

The researchers found that this method performed just as well as supervised few-shot learning approaches, even for smaller language models. It also allowed larger models to perform competitively with even larger, more powerful models. For some non-English translation tasks, the unsupervised method even outperformed supervised approaches.

When tested on multilingual summarization, the researchers' method was able to outperform other baselines that also use English as a pivot language. It was even preferred by the powerful GPT-4 model.

Overall, this unsupervised prompting technique seems to be an effective way to leverage the capabilities of large language models for tasks in low-resource languages, without needing any labeled training data.

Technical Explanation

The paper proposes an unsupervised prompting method to enable large language models (LLMs) to perform tasks in low-resource languages. The researchers assemble synthetic prompts from a diverse set of high-resource languages and use these to teach the LLMs to translate from any language into English.

These translated prompts are then used to create intra-lingual exemplars, which allow the LLMs to perform tasks in the target low-resource languages. The researchers evaluated this approach on translation between English and 13 Indic and 21 African low-resource languages, and found it performed on par with supervised few-shot learning for LLMs of different sizes.

They also showed that fine-tuning a 7B parameter model using the generated data allowed it to perform competitively with a much larger 175B parameter model. For non-English translation tasks, the unsupervised method even outperformed supervised prompting by up to 3 chrF++.

When evaluated on zero-shot multilingual summarization, the researchers' method outperformed other English-pivoting baselines by up to 4 ROUGE-L, and was preferred by the powerful GPT-4 model.

Critical Analysis

The paper presents a novel and effective approach for leveraging large language models in low-resource language settings. By generating synthetic prompts from high-resource languages, the method is able to sidestep the challenge of obtaining labeled training data for these underrepresented languages.

However, one potential limitation is that the quality and diversity of the synthetic prompts may impact the performance of the approach. The researchers do not provide a detailed analysis of how the choice of high-resource languages or the prompt generation process affects the final results.

Additionally, while the method outperforms supervised approaches in some cases, it would be valuable to understand the specific conditions or language pairs where the unsupervised prompting approach excels. A more in-depth analysis of the strengths and weaknesses compared to supervised few-shot learning could help guide practitioners in selecting the appropriate technique for their needs.

Overall, the research presents an interesting and promising direction for expanding the capabilities of large language models to low-resource languages. Further investigation into the factors influencing the method's performance could lead to even more robust and generalizable solutions.

Conclusion

This paper introduces an unsupervised prompting technique that allows large language models to effectively perform tasks in low-resource languages without any supervised data. By leveraging a diverse set of high-resource language prompts, the method is able to teach the models how to translate into English, which can then be used to create intra-lingual exemplars for other tasks.

The researchers demonstrate the effectiveness of this approach through extensive experiments on translation and multilingual summarization tasks, showing performance on par with or exceeding supervised few-shot learning. This work represents an important step forward in making large language models more accessible and capable across a wider range of languages, including many underrepresented and minoritized tongues.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤔

Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts

Xuan-Phi Nguyen, Sharifah Mahani Aljunied, Shafiq Joty, Lidong Bing

Large language models (LLMs) are known to effectively perform tasks by simply observing few exemplars. However, in low-resource languages, obtaining such hand-picked exemplars can still be challenging, where unsupervised techniques may be necessary. Moreover, competent generative capabilities of LLMs are observed only in high-resource languages, while their performances among under-represented languages fall behind due to pre-training data imbalance. To elicit LLMs' ability onto low-resource languages without any supervised data, we propose to assemble synthetic exemplars from a diverse set of high-resource languages to prompt the LLMs to translate from any language into English. These prompts are then used to create intra-lingual exemplars to perform tasks in the target languages. Our unsupervised prompting method performs on par with supervised few-shot learning in LLMs of different sizes for translations between English and 13 Indic and 21 African low-resource languages. We also show that fine-tuning a 7B model on data generated from our method helps it perform competitively with a 175B model. In non-English translation tasks, our method even outperforms supervised prompting by up to 3 chrF++ in many low-resource languages. When evaluated on zero-shot multilingual summarization, our method surpasses other English-pivoting baselines by up to 4 ROUGE-L and is also favored by GPT-4.

7/22/2024

Low-Resource Machine Translation through Retrieval-Augmented LLM Prompting: A Study on the Mambai Language

Raphael Merx, Aso Mahmudi, Katrina Langford, Leo Alberto de Araujo, Ekaterina Vylomova

This study explores the use of large language models (LLMs) for translating English into Mambai, a low-resource Austronesian language spoken in Timor-Leste, with approximately 200,000 native speakers. Leveraging a novel corpus derived from a Mambai language manual and additional sentences translated by a native speaker, we examine the efficacy of few-shot LLM prompting for machine translation (MT) in this low-resource context. Our methodology involves the strategic selection of parallel sentences and dictionary entries for prompting, aiming to enhance translation accuracy, using open-source and proprietary LLMs (LlaMa 2 70b, Mixtral 8x7B, GPT-4). We find that including dictionary entries in prompts and a mix of sentences retrieved through TF-IDF and semantic embeddings significantly improves translation quality. However, our findings reveal stark disparities in translation performance across test sets, with BLEU scores reaching as high as 21.2 on materials from the language manual, in contrast to a maximum of 4.4 on a test set provided by a native speaker. These results underscore the importance of diverse and representative corpora in assessing MT for low-resource languages. Our research provides insights into few-shot LLM prompting for low-resource MT, and makes available an initial corpus for the Mambai language.

4/9/2024

💬

New!Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts

Teng Wang, Zhenqi He, Wing-Yin Yu, Xiaojin Fu, Xiongwei Han

With the advent of Large Language Models (LLMs), generating rule-based data for real-world applications has become more accessible. Due to the inherent ambiguity of natural language and the complexity of rule sets, especially in long contexts, LLMs often struggle to follow all specified rules, frequently omitting at least one. To enhance the reasoning and understanding of LLMs on long and complex contexts, we propose a novel prompting strategy Multi-Lingual Prompt, namely MLPrompt, which automatically translates the error-prone rule that an LLM struggles to follow into another language, thus drawing greater attention to it. Experimental results on public datasets across various tasks have shown MLPrompt can outperform state-of-the-art prompting methods such as Chain of Thought, Tree of Thought, and Self-Consistency. Additionally, we introduce a framework integrating MLPrompt with an auto-checking mechanism for structured data generation, with a specific case study in text-to-MIP instances. Further, we extend the proposed framework for text-to-SQL to demonstrate its generation ability towards structured data synthesis.

9/18/2024

Learning-From-Mistakes Prompting for Indigenous Language Translation

You-Cheng Liao, Chen-Jui Yu, Chi-Yi Lin, He-Feng Yun, Yen-Hsiang Wang, Hsiao-Min Li, Yao-Chung Fan

Using large language models, this paper presents techniques to improve extremely low-resourced indigenous language translations. Our approaches are grounded in the use of (1) the presence of a datastore consisting of a limited number of parallel translation examples, (2) the inherent capabilities of LLMs like GPT-3.5, and (3) a word-level translation dictionary. We harness the potential of LLMs and in-context learning techniques in such a setting for using LLMs as universal translators for extremely low-resourced languages. Our methodology hinges on utilizing LLMs as language compilers for selected language pairs, hypothesizing that they could internalize syntactic structures to facilitate accurate translation. We introduce three techniques: KNNPrompting with Retrieved Prompting Context, Chain-of-Thought Prompting and Learningfrom-Mistakes Prompting, with the last method addressing past errors. The evaluation results suggest that, even with limited corpora, LLMs can effectively translate extremely low-resource languages when paired with proper prompting.

7/19/2024