A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations

2406.10303

Published 6/18/2024 by Jinqiang Wang, Huansheng Ning, Yi Peng, Qikai Wei, Daniel Tesfai, Wenwei Mao, Tao Zhu, Runhe Huang

cs.CL cs.AI

A Survey on Large Language Models from General Purpose to Medical Applications: Datasets, Methodologies, and Evaluations

Abstract

Large Language Models (LLMs) have demonstrated surprising performance across various natural language processing tasks. Recently, medical LLMs enhanced with domain-specific knowledge have exhibited excellent capabilities in medical consultation and diagnosis. These models can smoothly simulate doctor-patient dialogues and provide professional medical advice. Most medical LLMs are developed through continued training of open-source general LLMs, which require significantly fewer computational resources than training LLMs from scratch. Additionally, this approach offers better protection of patient privacy compared to API-based solutions. This survey systematically explores how to train medical LLMs based on general LLMs. It covers: (a) how to acquire training corpus and construct customized medical training sets, (b) how to choose a appropriate training paradigm, (c) how to choose a suitable evaluation benchmark, and (d) existing challenges and promising future research directions are discussed. This survey can provide guidance for the development of LLMs focused on various medical applications, such as medical education, diagnostic planning, and clinical assistants.

Create account to get full access

Overview

This paper provides a comprehensive survey of large language models (LLMs) and their applications in the medical and healthcare domains.
It examines the datasets, methodologies, and evaluation approaches used in developing and assessing LLMs for medical tasks.
The survey covers the progress and advancements in the field, as well as the challenges and limitations of using LLMs in medical and healthcare settings.

Plain English Explanation

Large language models (LLMs) are powerful AI systems that can understand and generate human-like text. These models have become increasingly important in various domains, including medicine and healthcare. This paper provides an overview of how LLMs are being developed and used for medical applications.

The researchers looked at the different datasets, methods, and ways of evaluating the performance of LLMs in medical tasks. They examined the progress that has been made in this area and the challenges that still need to be addressed. For example, using LLMs in sensitive healthcare settings requires careful consideration of privacy, security, and ethical concerns.

Overall, the paper gives a comprehensive survey of the current state of large language models in medicine and healthcare, including both the promising advancements and the remaining issues that need to be worked on.

Technical Explanation

The paper begins by introducing large language models (LLMs) and their growing importance in various domains, including medicine and healthcare. It then provides a detailed survey of the datasets, methodologies, and evaluation approaches used in developing and assessing LLMs for medical tasks.

The researchers examine the progress that has been made in applying LLMs to medical and healthcare-related problems, such as clinical decision support, medical information extraction, and patient communication. They also discuss the challenges and limitations of using LLMs in these sensitive domains, including issues around privacy, security, and ethical concerns.

The paper also covers the multimodal capabilities of LLMs, which can integrate various types of data (e.g., text, images, and audio) to enhance their performance in medical applications. The researchers provide a comprehensive overview of the current state of the art in this area and highlight the potential for further advancements.

Overall, the paper offers a thorough and insightful survey of the use of large language models in medicine and healthcare, covering both the promising developments and the ongoing challenges that researchers and practitioners must address.

Critical Analysis

The paper provides a comprehensive and well-researched survey of the use of large language models in medical and healthcare applications. The authors have done an excellent job of covering the key datasets, methodologies, and evaluation approaches used in this domain, as well as the progress and challenges that have emerged.

One area that could have been explored in more depth is the potential for bias and fairness issues in medical LLMs. While the paper mentions these concerns, a more detailed discussion of the steps being taken to address them would have been valuable. Additionally, the paper could have delved deeper into the ethical implications of using LLMs in sensitive healthcare settings, such as the risks of data privacy breaches or the potential for misuse of the technology.

Overall, the paper is a valuable resource for researchers and practitioners working in the field of medical AI, and it sets the stage for further advancements and discussions around the responsible development and deployment of large language models in healthcare.

Conclusion

This comprehensive survey of large language models in medical and healthcare applications provides a valuable overview of the current state of the art in this important and rapidly evolving field. The paper covers the key datasets, methodologies, and evaluation approaches used in this domain, as well as the progress that has been made and the challenges that still need to be addressed.

The insights and findings presented in this paper have significant implications for the future of medical AI, as large language models continue to play an increasingly important role in clinical decision-making, patient communication, and other critical healthcare tasks. By understanding the current capabilities and limitations of these models, researchers and practitioners can work towards developing more robust, reliable, and ethically-sound LLMs that can truly transform the way we deliver healthcare.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

Evaluating large language models in medical applications: a survey

Xiaolan Chen, Jiayang Xiang, Shanfu Lu, Yexin Liu, Mingguang He, Danli Shi

Large language models (LLMs) have emerged as powerful tools with transformative potential across numerous domains, including healthcare and medicine. In the medical domain, LLMs hold promise for tasks ranging from clinical decision support to patient education. However, evaluating the performance of LLMs in medical contexts presents unique challenges due to the complex and critical nature of medical information. This paper provides a comprehensive overview of the landscape of medical LLM evaluation, synthesizing insights from existing studies and highlighting evaluation data sources, task scenarios, and evaluation methods. Additionally, it identifies key challenges and opportunities in medical LLM evaluation, emphasizing the need for continued research and innovation to ensure the responsible integration of LLMs into clinical practice.

5/14/2024

cs.CL cs.AI

💬

Large Language Models for Medicine: A Survey

Yanxin Zheng, Wensheng Gan, Zefeng Chen, Zhenlian Qi, Qian Liang, Philip S. Yu

To address challenges in the digital economy's landscape of digital intelligence, large language models (LLMs) have been developed. Improvements in computational power and available resources have significantly advanced LLMs, allowing their integration into diverse domains for human life. Medical LLMs are essential application tools with potential across various medical scenarios. In this paper, we review LLM developments, focusing on the requirements and applications of medical LLMs. We provide a concise overview of existing models, aiming to explore advanced research directions and benefit researchers for future medical applications. We emphasize the advantages of medical LLMs in applications, as well as the challenges encountered during their development. Finally, we suggest directions for technical integration to mitigate challenges and potential research directions for the future of medical LLMs, aiming to meet the demands of the medical field better.

5/24/2024

cs.CL cs.AI cs.CY

💬

A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, Jinfa Huang, Jinge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their development, practical applications, and outcomes in medicine, remains scarce. Therefore, this review aims to provide a detailed overview of the development and deployment of LLMs in medicine, including the challenges and opportunities they face. In terms of development, we provide a detailed introduction to the principles of existing medical LLMs, including their basic model structures, number of parameters, and sources and scales of data used for model development. It serves as a guide for practitioners in developing medical LLMs tailored to their specific needs. In terms of deployment, we offer a comparison of the performance of different LLMs across various medical tasks, and further compare them with state-of-the-art lightweight models, aiming to provide an understanding of the advantages and limitations of LLMs in medicine. Overall, in this review, we address the following questions: 1) What are the practices for developing medical LLMs 2) How to measure the medical task performance of LLMs in a medical setting? 3) How have medical LLMs been employed in real-world practice? 4) What challenges arise from the use of medical LLMs? and 5) How to more effectively develop and deploy medical LLMs? By answering these questions, this review aims to provide insights into the opportunities for LLMs in medicine and serve as a practical resource. We also maintain a regularly updated list of practical guides on medical LLMs at: https://github.com/AI-in-Health/MedLLMsPracticalGuide.

5/16/2024

cs.CL cs.AI

💬

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics

Kai He, Rui Mao, Qika Lin, Yucheng Ruan, Xiang Lan, Mengling Feng, Erik Cambria

The utilization of large language models (LLMs) in the Healthcare domain has generated both excitement and concern due to their ability to effectively respond to freetext queries with certain professional knowledge. This survey outlines the capabilities of the currently developed LLMs for Healthcare and explicates their development process, with the aim of providing an overview of the development roadmap from traditional Pretrained Language Models (PLMs) to LLMs. Specifically, we first explore the potential of LLMs to enhance the efficiency and effectiveness of various Healthcare applications highlighting both the strengths and limitations. Secondly, we conduct a comparison between the previous PLMs and the latest LLMs, as well as comparing various LLMs with each other. Then we summarize related Healthcare training data, training methods, optimization strategies, and usage. Finally, the unique concerns associated with deploying LLMs in Healthcare settings are investigated, particularly regarding fairness, accountability, transparency and ethics. Our survey provide a comprehensive investigation from perspectives of both computer science and Healthcare specialty. Besides the discussion about Healthcare concerns, we supports the computer science community by compiling a collection of open source resources, such as accessible datasets, the latest methodologies, code implementations, and evaluation benchmarks in the Github. Summarily, we contend that a significant paradigm shift is underway, transitioning from PLMs to LLMs. This shift encompasses a move from discriminative AI approaches to generative AI approaches, as well as a shift from model-centered methodologies to data-centered methodologies. Also, we determine that the biggest obstacle of using LLMs in Healthcare are fairness, accountability, transparency and ethics.

6/12/2024

cs.CL