Multi-Channel Masked Autoencoder and Comprehensive Evaluations for Reconstructing 12-Lead ECG from Arbitrary Single-Lead ECG

Read original: arXiv:2407.11481 - Published 7/17/2024 by Jiarong Chen, Wanqing Wu, Tong Liu, Shenda Hong

Multi-Channel Masked Autoencoder and Comprehensive Evaluations for Reconstructing 12-Lead ECG from Arbitrary Single-Lead ECG

Overview

This paper presents a novel multi-channel masked autoencoder model for reconstructing 12-lead electrocardiogram (ECG) signals from arbitrary single-lead ECG inputs.
The model is trained and evaluated on a large dataset of 12-lead ECG recordings, demonstrating its ability to accurately reconstruct the full 12-lead ECG from a single lead.
The paper also includes a comprehensive evaluation of the model's performance across various metrics and use cases, highlighting its potential for practical applications in healthcare.

Plain English Explanation

The paper describes a new deep learning model that can take a single ECG reading, typically recorded from one electrode placed on the body, and use that information to reconstruct the complete 12-lead ECG. An ECG measures the electrical activity of the heart and is an important tool for diagnosing heart conditions.

Traditionally, recording a full 12-lead ECG requires placing multiple electrodes in specific locations on the body. This can be time-consuming and inconvenient for patients. The approach presented in this paper could allow for a simpler ECG test that only requires a single electrode, while still providing the detailed information normally obtained from the full 12-lead ECG. This could make ECG testing more accessible and convenient for patients.

The key innovation of the model is its "multi-channel masked autoencoder" architecture, which allows it to learn how to reconstruct the missing ECG leads from the available single lead. The model was trained and tested on a large dataset of real 12-lead ECG recordings, demonstrating its ability to accurately recover the full 12-lead ECG signal.

Technical Explanation

The paper introduces a multi-channel masked autoencoder (MCMA) model for reconstructing 12-lead electrocardiogram (ECG) signals from arbitrary single-lead ECG inputs. [The model builds on prior work in ECG signal completion and representation learning for multi-lead ECG](https://aimodels.fyi/papers/arxiv/ecgrecover-deep-learning-approach-electrocardiogram-signal-completion, https://aimodels.fyi/papers/arxiv/modally-reduced-representation-learning-multi-lead-ecg).

The MCMA architecture consists of an encoder that learns a latent representation from the single-lead input, and a decoder that reconstructs the full 12-lead ECG signal. A key aspect is the use of a masking mechanism that randomly masks out individual ECG leads during training, forcing the model to learn how to recover the missing information.

The model was trained and evaluated on a large dataset of 12-lead ECG recordings from over 30,000 patients. Experimental results demonstrate the MCMA model's ability to accurately reconstruct the full 12-lead ECG from arbitrary single-lead inputs, outperforming previous approaches. The authors also show the model's effectiveness for downstream tasks like ECG classification, as in prior related work.

Critical Analysis

The paper provides a comprehensive evaluation of the MCMA model, assessing its performance across a variety of metrics and use cases. However, the authors acknowledge several limitations and areas for future research.

One key limitation is the use of a relatively homogeneous dataset, primarily composed of data from a single healthcare system. Evaluating the model's generalization to more diverse patient populations and data sources would be an important next step.

Additionally, the paper does not explore the model's robustness to noise or corrupted input signals, which would be crucial for real-world applications. Further research is needed to understand the model's performance under more challenging conditions.

While the paper demonstrates the MCMA model's effectiveness for ECG reconstruction, more work is needed to fully understand the clinical implications and potential impact on patient care. Validation through prospective clinical studies would be an important next stage of this research.

Conclusion

This paper presents a novel multi-channel masked autoencoder model for reconstructing 12-lead ECG signals from single-lead inputs. The comprehensive evaluation demonstrates the model's strong performance, suggesting its potential to simplify ECG testing and improve accessibility to this important diagnostic tool.

Future research should focus on expanding the model's generalization, robustness, and clinical validation to further establish its real-world viability. Overall, this work represents an important step forward in leveraging deep learning to advance electrocardiography and cardiovascular healthcare.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Multi-Channel Masked Autoencoder and Comprehensive Evaluations for Reconstructing 12-Lead ECG from Arbitrary Single-Lead ECG

Jiarong Chen, Wanqing Wu, Tong Liu, Shenda Hong

In the context of cardiovascular diseases (CVD) that exhibit an elevated prevalence and mortality, the electrocardiogram (ECG) is a popular and standard diagnostic tool for doctors, commonly utilizing a 12-lead configuration in clinical practice. However, the 10 electrodes placed on the surface would cause a lot of inconvenience and discomfort, while the rapidly advancing wearable devices adopt the reduced-lead or single-lead ECG to reduce discomfort as a solution in long-term monitoring. Since the single-lead ECG is a subset of 12-lead ECG, it provides insufficient cardiac health information and plays a substandard role in real-world healthcare applications. Hence, it is necessary to utilize signal generation technologies to reduce their clinical importance gap by reconstructing 12-lead ECG from the real single-lead ECG. Specifically, this study proposes a multi-channel masked autoencoder (MCMA) for this goal. In the experimental results, the visualized results between the generated and real signals can demonstrate the effectiveness of the proposed framework. At the same time, this study introduces a comprehensive evaluation benchmark named ECGGenEval, encompassing the signal-level, feature-level, and diagnostic-level evaluations, providing a holistic assessment of 12-lead ECG signals and generative model. Further, the quantitative experimental results are as follows, the mean square errors of 0.0178 and 0.0658, correlation coefficients of 0.7698 and 0.7237 in the signal-level evaluation, the average F1-score with two generated 12-lead ECG is 0.8319 and 0.7824 in the diagnostic-level evaluation, achieving the state-of-the-art performance. The open-source code is publicly available at url{https://github.com/CHENJIAR3/MCMA}.

7/17/2024

ECGrecover: a Deep Learning Approach for Electrocardiogram Signal Completion

Alex Lence, Ahmad Fall, Federica Granese, Blaise Hanczar, Joe-Elie Salem, Jean-Daniel Zucker, Edi Prifti

In this work, we address the challenge of reconstructing the complete 12-lead ECG signal from incomplete parts of it. We focus on two main scenarii: (i) reconstructing missing signal segments within an ECG lead and (ii) recovering missing leads from a single-lead. We propose a model with a U-Net architecture trained on a novel objective function to address the reconstruction problem. This function incorporates both spatial and temporal aspects of the ECG by combining the distance in amplitude between the reconstructed and real signals with the signal trend. Through comprehensive assessments using both a real-life dataset and a publicly accessible one, we demonstrate that the proposed approach consistently outperforms state-of-the-art methods based on generative adversarial networks and a CopyPaste strategy. Our proposed model demonstrates superior performance in standard distortion metrics and preserves critical ECG characteristics, particularly the P, Q, R, S, and T wave coordinates. Two emerging clinical applications emphasize the relevance of our work. The first is the increasing need to digitize paper-stored ECGs for utilization in AI-based applications (automatic annotation and risk-quantification), often limited to digital ECG complete 10s recordings. The second is the widespread use of wearable devices that record ECGs but typically capture only a small subset of the 12 standard leads. In both cases, a non-negligible amount of information is lost or not recorded, which our approach aims to recover to overcome these limitations.

6/27/2024

Modally Reduced Representation Learning of Multi-Lead ECG Signals through Simultaneous Alignment and Reconstruction

Nabil Ibtehaz, Masood Mortazavi

Electrocardiogram (ECG) signals, profiling the electrical activities of the heart, are used for a plethora of diagnostic applications. However, ECG systems require multiple leads or channels of signals to capture the complete view of the cardiac system, which limits their application in smartwatches and wearables. In this work, we propose a modally reduced representation learning method for ECG signals that is capable of generating channel-agnostic, unified representations for ECG signals. Through joint optimization of reconstruction and alignment, we ensure that the embeddings of the different channels contain an amalgamation of the overall information across channels while also retaining their specific information. On an independent test dataset, we generated highly correlated channel embeddings from different ECG channels, leading to a moderate approximation of the 12-lead signals from a single-channel embedding. Our generated embeddings can work as competent features for ECG signals for downstream tasks.

5/31/2024

🏷️

Masked Transformer for Electrocardiogram Classification

Ya Zhou, Xiaolin Diao, Yanni Huo, Yang Liu, Xiaohan Fan, Wei Zhao

Electrocardiogram (ECG) is one of the most important diagnostic tools in clinical applications. With the advent of advanced algorithms, various deep learning models have been adopted for ECG tasks. However, the potential of Transformer for ECG data has not been fully realized, despite their widespread success in computer vision and natural language processing. In this work, we present Masked Transformer for ECG classification (MTECG), a simple yet effective method which significantly outperforms recent state-of-the-art algorithms in ECG classification. Our approach adapts the image-based masked autoencoders to self-supervised representation learning from ECG time series. We utilize a lightweight Transformer for the encoder and a 1-layer Transformer for the decoder. The ECG signal is split into a sequence of non-overlapping segments along the time dimension, and learnable positional embeddings are added to preserve the sequential information. We construct the Fuwai dataset comprising 220,251 ECG recordings with a broad range of diagnoses, annotated by medical experts, to explore the potential of Transformer. A strong pre-training and fine-tuning recipe is proposed from the empirical study. The experiments demonstrate that the proposed method increases the macro F1 scores by 3.4%-27.5% on the Fuwai dataset, 9.9%-32.0% on the PTB-XL dataset, and 9.4%-39.1% on a multicenter dataset, compared to the alternative methods. We hope that this study could direct future research on the application of Transformer to more ECG tasks.

4/24/2024