EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records

Read original: arXiv:2405.14567 - Published 5/27/2024 by Adibvafa Fallahpour, Mahshid Alinoori, Arash Afkanpour, Amrit Krishnan

👨‍🏫

Overview

Transformers have significantly advanced the modeling of Electronic Health Records (EHR), but their deployment in real-world healthcare is limited by several key challenges.
The quadratic computational cost and insufficient context length of these models pose obstacles for hospitals processing extensive medical histories in EHR data.
Existing models employ separate finetuning for each clinical task, complicating maintenance in healthcare environments.
These models focus exclusively on either clinical prediction or EHR forecasting, lacking the flexibility to perform well across both.

Plain English Explanation

Transformers, a type of machine learning model, have greatly improved how we can analyze and make predictions from electronic health records (EHR) data. However, there are still some significant challenges that prevent these models from being widely used in real-world healthcare settings.

One key issue is the computational cost - the time and resources required to process the data. Transformers have a quadratic computational cost, meaning the time needed grows exponentially as the amount of data increases. This is a problem for hospitals, which often have very extensive medical histories for each patient in their EHR data.

Another challenge is the limited "context length" of these models. They can only consider a certain amount of information at a time when making predictions or forecasts. But healthcare data often includes long, detailed medical histories that these models struggle to fully account for.

Additionally, existing transformer models for EHR data require separate "finetuning" (additional training) for each specific clinical task, such as predicting a diagnosis or forecasting future health events. This makes it difficult to maintain and update the models in real-world healthcare settings.

Finally, these models tend to be specialized - focusing either on clinical prediction or EHR forecasting, but not both. Hospitals would benefit from more flexible models that can excel at a variety of healthcare-related tasks.

Technical Explanation

To address these limitations, the researchers introduce EHRMamba, a robust foundation model built on the Mamba architecture. EHRMamba can process sequences up to four times longer than previous models due to its linear computational cost.

The researchers also introduce a novel approach called Multitask Prompted Finetuning (MTF) for EHR data. This enables EHRMamba to simultaneously learn multiple clinical tasks in a single finetuning phase, significantly enhancing deployment and cross-task generalization.

Furthermore, EHRMamba leverages the HL7 FHIR data standard to simplify integration into existing hospital systems. Alongside EHRMamba, the researchers open-source Odyssey, a toolkit designed to support the development and deployment of EHR foundation models, with an emphasis on data standardization and interpretability.

The researchers evaluate EHRMamba on the MIMIC-IV dataset, demonstrating that it advances state-of-the-art performance across 6 major clinical tasks and excels in EHR forecasting, marking a significant leap forward in the field.

Critical Analysis

The researchers acknowledge that while EHRMamba addresses several key challenges, there are still some limitations and areas for further research. For example, they note that the performance of the model may vary depending on the specific healthcare system and patient population, and more work is needed to ensure the model's robustness and generalizability across diverse settings.

Additionally, the researchers highlight the importance of interpretability and explainability in healthcare applications, as clinicians and patients require transparency in the decision-making process. While Odyssey, the open-source toolkit, aims to improve interpretability, further research may be needed to fully address this concern.

Another potential area for improvement is the integration of multimodal data, such as medical images or sensor data, which could enhance the model's predictive capabilities. The researchers suggest that future work could explore ways to seamlessly incorporate these additional data sources into the EHRMamba framework.

Conclusion

Overall, the introduction of EHRMamba and the Odyssey toolkit represents a significant advancement in the field of EHR modeling. By addressing key challenges such as computational cost, context length, and model maintenance, the researchers have paved the way for more widespread adoption of transformer-based models in real-world healthcare settings.

The Multitask Prompted Finetuning approach and the integration with the HL7 FHIR standard further enhance the practicality and versatility of the EHRMamba model, making it a promising tool for improving clinical decision-making, patient outcomes, and the overall efficiency of healthcare systems.

As the researchers continue to refine and build upon this work, the potential impact of EHRMamba and similar foundation models in the healthcare domain is substantial, with the promise of transforming how we leverage data to deliver more personalized and effective patient care.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

👨‍🏫

EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records

Adibvafa Fallahpour, Mahshid Alinoori, Arash Afkanpour, Amrit Krishnan

Transformers have significantly advanced the modeling of Electronic Health Records (EHR), yet their deployment in real-world healthcare is limited by several key challenges. Firstly, the quadratic computational cost and insufficient context length of these models pose significant obstacles for hospitals in processing the extensive medical histories typical in EHR data. Additionally, existing models employ separate finetuning for each clinical task, complicating maintenance in healthcare environments. Moreover, these models focus exclusively on either clinical prediction or EHR forecasting, lacking the flexibility to perform well across both. To overcome these limitations, we introduce EHRMamba, a robust foundation model built on the Mamba architecture. EHRMamba can process sequences up to four times longer than previous models due to its linear computational cost. We also introduce a novel approach to Multitask Prompted Finetuning (MTF) for EHR data, which enables EHRMamba to simultaneously learn multiple clinical tasks in a single finetuning phase, significantly enhancing deployment and cross-task generalization. Furthermore, our model leverages the HL7 FHIR data standard to simplify integration into existing hospital systems. Alongside EHRMamba, we open-source Odyssey, a toolkit designed to support the development and deployment of EHR foundation models, with an emphasis on data standardization and interpretability. Our evaluations on the MIMIC-IV dataset demonstrate that EHRMamba advances state-of-the-art performance across 6 major clinical tasks and excels in EHR forecasting, marking a significant leap forward in the field.

5/27/2024

📈

A Multi-Center Study on the Adaptability of a Shared Foundation Model for Electronic Health Records

Lin Lawrence Guo, Jason Fries, Ethan Steinberg, Scott Lanyon Fleming, Keith Morse, Catherine Aftandilian, Jose Posada, Nigam Shah, Lillian Sung

Foundation models hold promise for transforming AI in healthcare by providing modular components that are easily adaptable to downstream healthcare tasks, making AI development more scalable and cost-effective. Structured EHR foundation models, trained on coded medical records from millions of patients, demonstrated benefits including increased performance with fewer training labels, and improved robustness to distribution shifts. However, questions remain on the feasibility of sharing these models across different hospitals and their performance for local task adaptation. This multi-center study examined the adaptability of a recently released structured EHR foundation model ($FM_{SM}$), trained on longitudinal medical record data from 2.57M Stanford Medicine patients. Experiments were conducted using EHR data at The Hospital for Sick Children and MIMIC-IV. We assessed both adaptability via continued pretraining on local data, and task adaptability compared to baselines of training models from scratch at each site, including a local foundation model. We evaluated the performance of these models on 8 clinical prediction tasks. In both datasets, adapting the off-the-shelf $FM_{SM}$ matched the performance of GBM models locally trained on all data while providing a 13% improvement in settings with few task-specific training labels. With continued pretraining on local data, label efficiency substantially improved, such that $FM_{SM}$ required fewer than 1% of training examples to match the fully trained GBM's performance. Continued pretraining was also 60 to 90% more sample-efficient than training local foundation models from scratch. Our findings show that adapting shared EHR foundation models across hospitals provides improved prediction performance at less cost, underscoring the utility of base foundation models as modular components to streamline the development of healthcare AI.

4/24/2024

Multi-task Heterogeneous Graph Learning on Electronic Health Records

Tsai Hor Chan, Guosheng Yin, Kyongtae Bae, Lequan Yu

Learning electronic health records (EHRs) has received emerging attention because of its capability to facilitate accurate medical diagnosis. Since the EHRs contain enriched information specifying complex interactions between entities, modeling EHRs with graphs is shown to be effective in practice. The EHRs, however, present a great degree of heterogeneity, sparsity, and complexity, which hamper the performance of most of the models applied to them. Moreover, existing approaches modeling EHRs often focus on learning the representations for a single task, overlooking the multi-task nature of EHR analysis problems and resulting in limited generalizability across different tasks. In view of these limitations, we propose a novel framework for EHR modeling, namely MulT-EHR (Multi-Task EHR), which leverages a heterogeneous graph to mine the complex relations and model the heterogeneity in the EHRs. To mitigate the large degree of noise, we introduce a denoising module based on the causal inference framework to adjust for severe confounding effects and reduce noise in the EHR data. Additionally, since our model adopts a single graph neural network for simultaneous multi-task prediction, we design a multi-task learning module to leverage the inter-task knowledge to regularize the training process. Extensive empirical studies on MIMIC-III and MIMIC-IV datasets validate that the proposed method consistently outperforms the state-of-the-art designs in four popular EHR analysis tasks -- drug recommendation, and predictions of the length of stay, mortality, and readmission. Thorough ablation studies demonstrate the robustness of our method upon variations to key components and hyperparameters.

8/15/2024

🎯

Recent Advances in Predictive Modeling with Electronic Health Records

Jiaqi Wang, Junyu Luo, Muchao Ye, Xiaochen Wang, Yuan Zhong, Aofei Chang, Guanjie Huang, Ziyi Yin, Cao Xiao, Jimeng Sun, Fenglong Ma

The development of electronic health records (EHR) systems has enabled the collection of a vast amount of digitized patient data. However, utilizing EHR data for predictive modeling presents several challenges due to its unique characteristics. With the advancements in machine learning techniques, deep learning has demonstrated its superiority in various applications, including healthcare. This survey systematically reviews recent advances in deep learning-based predictive models using EHR data. Specifically, we begin by introducing the background of EHR data and providing a mathematical definition of the predictive modeling task. We then categorize and summarize predictive deep models from multiple perspectives. Furthermore, we present benchmarks and toolkits relevant to predictive modeling in healthcare. Finally, we conclude this survey by discussing open challenges and suggesting promising directions for future research.

8/14/2024