Next Visit Diagnosis Prediction via Medical Code-Centric Multimodal Contrastive EHR Modelling with Hierarchical Regularisation

2401.11648

Published 5/2/2024 by Heejoon Koo

Next Visit Diagnosis Prediction via Medical Code-Centric Multimodal Contrastive EHR Modelling with Hierarchical Regularisation

Abstract

Predicting next visit diagnosis using Electronic Health Records (EHR) is an essential task in healthcare, critical for devising proactive future plans for both healthcare providers and patients. Nonetheless, many preceding studies have not sufficiently addressed the heterogeneous and hierarchical characteristics inherent in EHR data, inevitably leading to sub-optimal performance. To this end, we propose NECHO, a novel medical code-centric multimodal contrastive EHR learning framework with hierarchical regularisation. First, we integrate multifaceted information encompassing medical codes, demographics, and clinical notes using a tailored network design and a pair of bimodal contrastive losses, all of which pivot around a medical codes representation. We also regularise modality-specific encoders using a parental level information in medical ontology to learn hierarchical structure of EHR data. A series of experiments on MIMIC-III data demonstrates effectiveness of our approach.

Create account to get full access

Overview

This research paper proposes a novel approach for predicting a patient's next visit diagnosis using electronic health records (EHRs).
The key innovations include a medical code-centric multimodal contrastive EHR modelling technique and a hierarchical regularization method.
The model aims to leverage the rich information in EHRs to make more accurate predictions and provide insights for healthcare professionals.

Plain English Explanation

Electronic health records (EHRs) contain a wealth of information about a patient's medical history, including their diagnoses, treatments, and other relevant data. Researchers have been working on ways to use this information to predict a patient's future health outcomes, such as their next visit diagnosis.

The authors of this paper developed a new method for making these predictions. Their approach involves using multimodal contrastive learning, which means they combine different types of data (like text, images, and structured medical codes) and then train the model to identify the connections between them. This allows the model to learn a more comprehensive representation of the patient's health status.

Additionally, the researchers used a hierarchical regularization technique to encourage the model to learn the relationships between different medical codes, such as the connections between broad disease categories and more specific diagnoses. This helps the model make more nuanced and accurate predictions.

By combining these innovative techniques, the researchers were able to create a model that can more accurately predict a patient's next visit diagnosis based on their EHR data. This could be very useful for healthcare providers, as it could help them better prepare for a patient's future needs and potentially intervene earlier to prevent or manage certain conditions.

Technical Explanation

The paper proposes a medical code-centric multimodal contrastive EHR modelling approach for next visit diagnosis prediction. The key components of the model include:

Multimodal Contrastive Learning: The model leverages different modalities of EHR data, such as text notes, medical codes, and demographic information, and learns a joint representation through contrastive learning. This allows the model to capture the complex relationships between these diverse data sources.
Hierarchical Regularization: The authors introduce a hierarchical regularization method to encourage the model to learn the relationships between medical codes at different levels of the medical code hierarchy. This helps the model make more accurate and interpretable predictions.
Next Visit Diagnosis Prediction: The learned representations are used to predict the diagnoses that will be recorded in the patient's next visit. This can help healthcare providers anticipate and prepare for a patient's future medical needs.

The researchers evaluated their approach on several real-world EHR datasets and demonstrated its superior performance compared to state-of-the-art methods, such as those presented in [[https://aimodels.fyi/papers/arxiv/global-contrastive-training-multimodal-electronic-health-records|this paper]]. The model was also shown to provide more interpretable insights by highlighting the most relevant medical codes for a given prediction.

Critical Analysis

One potential limitation of the proposed approach is that it relies on the availability of structured medical codes in the EHR data. In some healthcare settings, the medical coding may be incomplete or inconsistent, which could impact the model's performance. The authors acknowledge this issue and suggest exploring ways to incorporate unstructured clinical notes more effectively.

Additionally, the paper does not discuss the ethical implications of using AI-powered prediction models in healthcare. There are concerns around bias, privacy, and the potential misuse of such technologies. The authors could have addressed these important considerations to provide a more well-rounded analysis of their work.

Finally, while the hierarchical regularization technique is a novel contribution, the authors could have explored alternative ways to capture the hierarchical relationships between medical codes, such as incorporating domain knowledge from medical ontologies or using graph neural networks.

Conclusion

This research paper presents a promising approach for next visit diagnosis prediction using electronic health records. By leveraging multimodal contrastive learning and hierarchical regularization, the model is able to make more accurate and interpretable predictions, which could ultimately help healthcare providers better anticipate and manage their patients' medical needs.

The authors have demonstrated the effectiveness of their method on real-world datasets, and their work contributes to the ongoing efforts to develop AI-powered tools that can enhance clinical decision-making and improve patient outcomes. As the field of healthcare AI continues to evolve, it will be important to address the ethical and practical considerations surrounding the use of these technologies in a responsible and thoughtful manner.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Global Contrastive Training for Multimodal Electronic Health Records with Language Supervision

Yingbo Ma, Suraj Kolla, Zhenhong Hu, Dhruv Kaliraman, Victoria Nolan, Ziyuan Guan, Yuanfang Ren, Brooke Armfield, Tezcan Ozrazgat-Baslanti, Jeremy A. Balch, Tyler J. Loftus, Parisa Rashidi, Azra Bihorac, Benjamin Shickel

Modern electronic health records (EHRs) hold immense promise in tracking personalized patient health trajectories through sequential deep learning, owing to their extensive breadth, scale, and temporal granularity. Nonetheless, how to effectively leverage multiple modalities from EHRs poses significant challenges, given its complex characteristics such as high dimensionality, multimodality, sparsity, varied recording frequencies, and temporal irregularities. To this end, this paper introduces a novel multimodal contrastive learning framework, specifically focusing on medical time series and clinical notes. To tackle the challenge of sparsity and irregular time intervals in medical time series, the framework integrates temporal cross-attention transformers with a dynamic embedding and tokenization scheme for learning multimodal feature representations. To harness the interconnected relationships between medical time series and clinical notes, the framework equips a global contrastive loss, aligning a patient's multimodal feature representations with the corresponding discharge summaries. Since discharge summaries uniquely pertain to individual patients and represent a holistic view of the patient's hospital stay, machine learning models are led to learn discriminative multimodal features via global contrasting. Extensive experiments with a real-world EHR dataset demonstrated that our framework outperformed state-of-the-art approaches on the exemplar task of predicting the occurrence of nine postoperative complications for more than 120,000 major inpatient surgeries using multimodal data from UF health system split among three hospitals (UF Health Gainesville, UF Health Jacksonville, and UF Health Jacksonville-North).

4/11/2024

cs.LG cs.CL

Synthesizing Multimodal Electronic Health Records via Predictive Diffusion Models

Yuan Zhong, Xiaochen Wang, Jiaqi Wang, Xiaokun Zhang, Yaqing Wang, Mengdi Huai, Cao Xiao, Fenglong Ma

Synthesizing electronic health records (EHR) data has become a preferred strategy to address data scarcity, improve data quality, and model fairness in healthcare. However, existing approaches for EHR data generation predominantly rely on state-of-the-art generative techniques like generative adversarial networks, variational autoencoders, and language models. These methods typically replicate input visits, resulting in inadequate modeling of temporal dependencies between visits and overlooking the generation of time information, a crucial element in EHR data. Moreover, their ability to learn visit representations is limited due to simple linear mapping functions, thus compromising generation quality. To address these limitations, we propose a novel EHR data generation model called EHRPD. It is a diffusion-based model designed to predict the next visit based on the current one while also incorporating time interval estimation. To enhance generation quality and diversity, we introduce a novel time-aware visit embedding module and a pioneering predictive denoising diffusion probabilistic model (PDDPM). Additionally, we devise a predictive U-Net (PU-Net) to optimize P-DDPM.We conduct experiments on two public datasets and evaluate EHRPD from fidelity, privacy, and utility perspectives. The experimental results demonstrate the efficacy and utility of the proposed EHRPD in addressing the aforementioned limitations and advancing EHR data generation.

6/21/2024

cs.LG

Continuous Predictive Modeling of Clinical Notes and ICD Codes in Patient Health Records

Mireia Hernandez Caralt, Clarence Boon Liang Ng, Marek Rei

Electronic Health Records (EHR) serve as a valuable source of patient information, offering insights into medical histories, treatments, and outcomes. Previous research has developed systems for detecting applicable ICD codes that should be assigned while writing a given EHR document, mainly focusing on discharge summaries written at the end of a hospital stay. In this work, we investigate the potential of predicting these codes for the whole patient stay at different time points during their stay, even before they are officially assigned by clinicians. The development of methods to predict diagnoses and treatments earlier in advance could open opportunities for predictive medicine, such as identifying disease risks sooner, suggesting treatments, and optimizing resource allocation. Our experiments show that predictions regarding final ICD codes can be made already two days after admission and we propose a custom model that improves performance on this early prediction task.

5/21/2024

cs.CL cs.LG

EMERGE: Integrating RAG for Improved Multimodal EHR Predictive Modeling

Yinghao Zhu, Changyu Ren, Zixiang Wang, Xiaochen Zheng, Shiyun Xie, Junlan Feng, Xi Zhu, Zhoujun Li, Liantao Ma, Chengwei Pan

The integration of multimodal Electronic Health Records (EHR) data has notably advanced clinical predictive capabilities. However, current models that utilize clinical notes and multivariate time-series EHR data often lack the necessary medical context for precise clinical tasks. Previous methods using knowledge graphs (KGs) primarily focus on structured knowledge extraction. To address this, we propose EMERGE, a Retrieval-Augmented Generation (RAG) driven framework aimed at enhancing multimodal EHR predictive modeling. Our approach extracts entities from both time-series data and clinical notes by prompting Large Language Models (LLMs) and aligns them with professional PrimeKG to ensure consistency. Beyond triplet relationships, we include entities' definitions and descriptions to provide richer semantics. The extracted knowledge is then used to generate task-relevant summaries of patients' health statuses. These summaries are fused with other modalities utilizing an adaptive multimodal fusion network with cross-attention. Extensive experiments on the MIMIC-III and MIMIC-IV datasets for in-hospital mortality and 30-day readmission tasks demonstrate the superior performance of the EMERGE framework compared to baseline models. Comprehensive ablation studies and analyses underscore the efficacy of each designed module and the framework's robustness to data sparsity. EMERGE significantly enhances the use of multimodal EHR data in healthcare, bridging the gap with nuanced medical contexts crucial for informed clinical predictions.

6/4/2024

cs.CL cs.AI cs.LG