MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Read original: arXiv:2401.07598 - Published 7/23/2024 by Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram

💬

Overview

Parameter Efficient Finetuning (PEFT) is a technique for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute.
Prior research has shown a significant performance gap between LLMs on English and other languages, as well as between smaller open-source models and larger LLMs.
Finetuning can help bridge this gap and make language models more equitable.

Plain English Explanation

Parameter Efficient Finetuning (PEFT) is a technique that can be used to enhance the performance of Large Language Models (LLMs) without the need for extensive resources and computing power. Previous research has revealed a substantial gap in the performance of these models when used for English compared to other languages. Additionally, there is a significant disparity in the performance of smaller open-source models and the larger, more powerful LLMs.

Finetuning can be an effective way to bridge this gap and make language models more accessible and equitable across different languages. In this study, the researchers finetune the LLaMA-2-7B and Mistral-7B models on synthetic multilingual instruction datasets to assess the impact on model performance across six downstream tasks covering forty languages.

The researchers also experiment with various parameters, such as rank for low-rank adaptation and quantisation values, to determine their effects on the performance of the models on low-resource languages. They find that higher rank and higher quantisation values can benefit low-resource languages, while finetuning can sometimes improve performance on these languages while degrading performance on high-resource languages.

Technical Explanation

In this work, the researchers investigate the use of Parameter Efficient Finetuning (PEFT) to enhance the performance of Large Language Models (LLMs) on multilingual tasks. They finetune the LLaMA-2-7B and Mistral-7B models on two synthetic multilingual instruction tuning datasets to evaluate the impact on model performance across six downstream tasks covering forty languages.

The researchers experiment with various parameters, such as rank for low-rank adaptation and quantisation values, to determine their effects on the downstream performance of the models. They find that higher rank and higher quantisation values can benefit low-resource languages, while finetuning can sometimes improve performance on these languages while degrading performance on high-resource languages.

Critical Analysis

The paper provides a comprehensive evaluation of the use of Parameter Efficient Finetuning (PEFT) to improve the performance of Large Language Models (LLMs) on multilingual tasks. However, the authors acknowledge several limitations and areas for further research.

One limitation is that the evaluation is based on synthetic multilingual instruction datasets, which may not fully capture the complexity and nuances of real-world multilingual data. Additionally, the authors note that the performance trade-offs between high-resource and low-resource languages observed in their experiments may be an area for further investigation.

The paper also raises the question of whether the benefits of PEFT in improving the performance of smaller open-source models can be maintained while avoiding potential degradation in English performance, which is an important consideration for practical applications.

Conclusion

This study demonstrates the potential of Parameter Efficient Finetuning (PEFT) to enhance the performance of Large Language Models (LLMs) on multilingual tasks. By finetuning LLaMA-2-7B and Mistral-7B models on synthetic multilingual instruction datasets, the researchers were able to identify strategies, such as using higher rank and higher quantisation values, that can benefit low-resource languages. However, the potential trade-offs in performance on high-resource languages remain an area for further exploration.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

💬

MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models

Divyanshu Aggarwal, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram

Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of Large Language Models (LLMs) without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there is a large gap between the performance of LLMs on English and other languages. Further, there is also a large gap between the performance of smaller open-source models and larger LLMs. Finetuning can be an effective way to bridge this gap and make language models more equitable. In this work, we finetune the LLama-2-7B and Mistral-7B models on two synthetic multilingual instruction tuning datasets to determine its effect on model performance on six downstream tasks covering forty languages in all. Additionally, we experiment with various parameters, such as rank for low-rank adaptation and values of quantisation to determine their effects on downstream performance and find that higher rank and higher quantisation values benefit low-resource languages. We find that PEFT of smaller open-source models sometimes bridges the gap between the performance of these models and the larger ones, however, English performance can take a hit. We also find that finetuning sometimes improves performance on low-resource languages, while degrading performance on high-resource languages.

7/23/2024

Unlocking Parameter-Efficient Fine-Tuning for Low-Resource Language Translation

Tong Su, Xin Peng, Sarubi Thillainathan, David Guzm'an, Surangika Ranathunga, En-Shiun Annie Lee

Parameter-efficient fine-tuning (PEFT) methods are increasingly vital in adapting large-scale pre-trained language models for diverse tasks, offering a balance between adaptability and computational efficiency. They are important in Low-Resource Language (LRL) Neural Machine Translation (NMT) to enhance translation accuracy with minimal resources. However, their practical effectiveness varies significantly across different languages. We conducted comprehensive empirical experiments with varying LRL domains and sizes to evaluate the performance of 8 PEFT methods with in total of 15 architectures using the SacreBLEU score. We showed that 6 PEFT architectures outperform the baseline for both in-domain and out-domain tests and the Houlsby+Inversion adapter has the best performance overall, proving the effectiveness of PEFT methods.

4/8/2024

An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models

Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, V'ictor Guti'errez-Basulto, Jeff Z. Pan

Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually contain billions of parameters. To address this issue, we study parameter-efficient fine-tuning (PEFT) methods for MLLMs. We aim to identify effective methods for enhancing the performance of MLLMs in scenarios where only a limited number of parameters are trained. This paper conducts empirical studies using four popular PEFT methods to fine-tune the LLM component of open-source MLLMs. We present a comprehensive analysis that encompasses various aspects, including the impact of PEFT methods on various models, parameters and location of the PEFT module, size of fine-tuning data, model stability based on PEFT methods, MLLM's generalization, and hallucination. We evaluated four PEFT methods on seven datasets from two different categories: unseen and seen datasets. Across all experiments, we show that the adapter is the best-performing PEFT method. At the same time, fine-tuning the connector layers leads to improved performance in most MLLMs. Code and data are available at https://github.com/alenai97/PEFT-MLLM.git.

6/10/2024

Leveraging Parameter Efficient Training Methods for Low Resource Text Classification: A Case Study in Marathi

Pranita Deshmukh, Nikita Kulkarni, Sanhita Kulkarni, Kareena Manghani, Raviraj Joshi

With the surge in digital content in low-resource languages, there is an escalating demand for advanced Natural Language Processing (NLP) techniques tailored to these languages. BERT (Bidirectional Encoder Representations from Transformers), serving as the foundational framework for numerous NLP architectures and language models, is increasingly employed for the development of low-resource NLP models. Parameter Efficient Fine-Tuning (PEFT) is a method for fine-tuning Large Language Models (LLMs) and reducing the training parameters to some extent to decrease the computational costs needed for training the model and achieve results comparable to a fully fine-tuned model. In this work, we present a study of PEFT methods for the Indic low-resource language Marathi. We conduct a comprehensive analysis of PEFT methods applied to various monolingual and multilingual Marathi BERT models. These approaches are evaluated on prominent text classification datasets like MahaSent, MahaHate, and MahaNews. The incorporation of PEFT techniques is demonstrated to significantly expedite the training speed of the models, addressing a critical aspect of model development and deployment. In this study, we explore Low-Rank Adaptation of Large Language Models (LoRA) and adapter methods for low-resource text classification. We show that these methods are competitive with full fine-tuning and can be used without loss in accuracy. This study contributes valuable insights into the effectiveness of Marathi BERT models, offering a foundation for the continued advancement of NLP capabilities in Marathi and similar Indic languages.

8/7/2024