A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

2405.10936

Published 5/20/2024 by Kaiyu Huang, Fengran Mo, Hongliang Li, You Li, Yuanchi Zhang, Weijian Yi, Yulong Mao, Jinchen Liu, Yuzhuang Xu, Jinan Xu and 2 others

cs.CL cs.AI

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

Abstract

The rapid development of Large Language Models (LLMs) demonstrates remarkable multilingual capabilities in natural language processing, attracting global attention in both academia and industry. To mitigate potential discrimination and enhance the overall usability and accessibility for diverse language user groups, it is important for the development of language-fair technology. Despite the breakthroughs of LLMs, the investigation into the multilingual scenario remains insufficient, where a comprehensive survey to summarize recent approaches, developments, limitations, and potential solutions is desirable. To this end, we provide a survey with multiple perspectives on the utilization of LLMs in the multilingual scenario. We first rethink the transitions between previous and current research on pre-trained language models. Then we introduce several perspectives on the multilingualism of LLMs, including training and inference methods, model security, multi-domain with language culture, and usage of datasets. We also discuss the major challenges that arise in these aspects, along with possible solutions. Besides, we highlight future research directions that aim at further enhancing LLMs with multilingualism. The survey aims to help the research community address multilingual problems and provide a comprehensive understanding of the core concepts, key techniques, and latest developments in multilingual natural language processing based on LLMs.

Create account to get full access

Overview

This paper provides a comprehensive survey of recent advancements in large language models (LLMs) with a focus on multilingualism.
It covers topics such as multilingual language model resources and taxonomy, multimodal and efficient LLMs, and the use of LLMs in education.
The survey also explores the foundations and techniques underlying the development of these powerful AI models.

Plain English Explanation

The paper discusses the recent progress in large language models (LLMs) that can understand and generate text in multiple languages. These models have become increasingly sophisticated, with the ability to handle not just text, but also images, audio, and other modalities.

The researchers provide an overview of the various resources and taxonomies available for multilingual LLMs, making it easier for developers and researchers to navigate this rapidly evolving field. They also explore how LLMs can be made more efficient, allowing them to be deployed in a wider range of applications, including educational settings.

The survey delves into the fundamental techniques and approaches used to create these impressive AI models, shedding light on the complex algorithms and vast datasets that power their capabilities. By understanding the foundations of LLMs, the research community can continue to push the boundaries of what these models can achieve.

Technical Explanation

The paper begins by introducing the concept of large language models (LLMs) and their growing importance in the field of natural language processing. It then presents a comprehensive survey of recent advancements in LLMs with a focus on multilingualism.

The researchers provide an in-depth look at the resources and taxonomies available for multilingual LLMs, making it easier for developers and researchers to navigate this rapidly evolving field. They also explore multimodal and efficient LLM architectures, which can handle a wider range of data types and operate more efficiently.

The survey also delves into the use of LLMs in educational settings, highlighting their potential to transform the way we approach learning and knowledge acquisition.

Finally, the paper examines the foundations and techniques underlying the development of these powerful AI models, providing insights into the complex algorithms and vast datasets that enable their impressive capabilities.

Critical Analysis

The paper provides a comprehensive and well-researched overview of the latest advancements in multilingual large language models. However, it does not address some potential limitations or areas for further research.

For instance, the paper does not discuss the ethical implications of deploying these powerful models, such as the risk of perpetuating biases or the potential for misuse. Additionally, the survey does not delve into the computational and energy-related challenges associated with training and deploying LLMs at scale.

Furthermore, the paper could have explored the integration of LLMs with other AI techniques, such as reinforcement learning or knowledge-based systems, to further enhance their capabilities and versatility.

Despite these minor shortcomings, the survey offers a valuable resource for researchers and practitioners working in the field of natural language processing and AI.

Conclusion

This paper provides a thorough and insightful overview of the recent advancements in large language models with a focus on multilingualism. By exploring the various resources, taxonomies, and techniques underlying these powerful AI models, the researchers have laid the groundwork for further innovation and research in this rapidly evolving field.

The survey's emphasis on efficiency, multimodality, and educational applications highlights the potential of LLMs to transform a wide range of industries and domains. As the field continues to progress, it will be crucial to address the ethical and practical challenges associated with the deployment of these models to ensure their responsible and beneficial use.

Overall, this paper serves as a valuable reference for anyone interested in the latest developments and future directions of large language models.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

Libo Qin, Qiguang Chen, Yuhang Zhou, Zhi Chen, Yinghui Li, Lizi Liao, Min Li, Wanxiang Che, Philip S. Yu

Multilingual Large Language Models are capable of using powerful Large Language Models to handle and respond to queries in multiple languages, which achieves remarkable success in multilingual natural language processing tasks. Despite these breakthroughs, there still remains a lack of a comprehensive survey to summarize existing approaches and recent developments in this field. To this end, in this paper, we present a thorough review and provide a unified perspective to summarize the recent progress as well as emerging trends in multilingual large language models (MLLMs) literature. The contributions of this paper can be summarized: (1) First survey: to our knowledge, we take the first step and present a thorough review in MLLMs research field according to multi-lingual alignment; (2) New taxonomy: we offer a new and unified perspective to summarize the current progress of MLLMs; (3) New frontiers: we highlight several emerging frontiers and discuss the corresponding challenges; (4) Abundant resources: we collect abundant open-source resources, including relevant papers, data corpora, and leaderboards. We hope our work can provide the community with quick access and spur breakthrough research in MLLMs.

4/9/2024

cs.CL

A Survey on Multilingual Large Language Models: Corpora, Alignment, and Bias

Yuemei Xu, Ling Hu, Jiayi Zhao, Zihan Qiu, Yuqi Ye, Hanwen Gu

Based on the foundation of Large Language Models (LLMs), Multilingual Large Language Models (MLLMs) have been developed to address the challenges of multilingual natural language processing tasks, hoping to achieve knowledge transfer from high-resource to low-resource languages. However, significant limitations and challenges still exist, such as language imbalance, multilingual alignment, and inherent bias. In this paper, we aim to provide a comprehensive analysis of MLLMs, delving deeply into discussions surrounding these critical issues. First of all, we start by presenting an overview of MLLMs, covering their evolution, key techniques, and multilingual capacities. Secondly, we explore widely utilized multilingual corpora for MLLMs' training and multilingual datasets oriented for downstream tasks that are crucial for enhancing the cross-lingual capability of MLLMs. Thirdly, we survey the existing studies on multilingual representations and investigate whether the current MLLMs can learn a universal language representation. Fourthly, we discuss bias on MLLMs including its category and evaluation metrics, and summarize the existing debiasing techniques. Finally, we discuss existing challenges and point out promising research directions. By demonstrating these aspects, this paper aims to facilitate a deeper understanding of MLLMs and their potentiality in various domains.

6/7/2024

cs.CL cs.AI

MM-LLMs: Recent Advances in MultiModal Large Language Models

Duzhen Zhang, Yahan Yu, Jiahua Dong, Chenxing Li, Dan Su, Chenhui Chu, Dong Yu

In the past year, MultiModal Large Language Models (MM-LLMs) have undergone substantial advancements, augmenting off-the-shelf LLMs to support MM inputs or outputs via cost-effective training strategies. The resulting models not only preserve the inherent reasoning and decision-making capabilities of LLMs but also empower a diverse range of MM tasks. In this paper, we provide a comprehensive survey aimed at facilitating further research of MM-LLMs. Initially, we outline general design formulations for model architecture and training pipeline. Subsequently, we introduce a taxonomy encompassing 126 MM-LLMs, each characterized by its specific formulations. Furthermore, we review the performance of selected MM-LLMs on mainstream benchmarks and summarize key training recipes to enhance the potency of MM-LLMs. Finally, we explore promising directions for MM-LLMs while concurrently maintaining a real-time tracking website for the latest developments in the field. We hope that this survey contributes to the ongoing advancement of the MM-LLMs domain.

5/29/2024

cs.CL

💬

Efficient Large Language Models: A Survey

Zhongwei Wan, Xin Wang, Che Liu, Samiul Alam, Yu Zheng, Jiachen Liu, Zhongnan Qu, Shen Yan, Yi Zhu, Quanlu Zhang, Mosharaf Chowdhury, Mi Zhang

Large Language Models (LLMs) have demonstrated remarkable capabilities in important tasks such as natural language understanding and language generation, and thus have the potential to make a substantial impact on our society. Such capabilities, however, come with the considerable resources they demand, highlighting the strong need to develop effective techniques for addressing their efficiency challenges. In this survey, we provide a systematic and comprehensive review of efficient LLMs research. We organize the literature in a taxonomy consisting of three main categories, covering distinct yet interconnected efficient LLMs topics from model-centric, data-centric, and framework-centric perspective, respectively. We have also created a GitHub repository where we organize the papers featured in this survey at https://github.com/AIoT-MLSys-Lab/Efficient-LLMs-Survey. We will actively maintain the repository and incorporate new research as it emerges. We hope our survey can serve as a valuable resource to help researchers and practitioners gain a systematic understanding of efficient LLMs research and inspire them to contribute to this important and exciting field.

5/24/2024

cs.CL cs.AI