RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos

2404.06483

YC

0

Reddit

0

Published 4/10/2024 by Bochao Zou, Zizheng Guo, Xiaocheng Hu, Huimin Ma
RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos

Abstract

Remote photoplethysmography (rPPG) is a non-contact method for detecting physiological signals from facial videos, holding great potential in various applications such as healthcare, affective computing, and anti-spoofing. Existing deep learning methods struggle to address two core issues of rPPG simultaneously: extracting weak rPPG signals from video segments with large spatiotemporal redundancy and understanding the periodic patterns of rPPG among long contexts. This represents a trade-off between computational complexity and the ability to capture long-range dependencies, posing a challenge for rPPG that is suitable for deployment on mobile devices. Based on the in-depth exploration of Mamba's comprehension of spatial and temporal information, this paper introduces RhythmMamba, an end-to-end Mamba-based method that employs multi-temporal Mamba to constrain both periodic patterns and short-term trends, coupled with frequency domain feed-forward to enable Mamba to robustly understand the quasi-periodic patterns of rPPG. Extensive experiments show that RhythmMamba achieves state-of-the-art performance with reduced parameters and lower computational complexity. The proposed RhythmMamba can be applied to video segments of any length without performance degradation. The codes are available at https://github.com/zizheng-guo/RhythmMamba.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper presents RhythmMamba, a new method for fast and accurate remote physiological measurement using arbitrary length videos.
  • The method can measure heart rate, breathing rate, and other vital signs without specialized equipment, just from regular video footage.
  • RhythmMamba works on videos of varying lengths, from a few seconds to several minutes, making it more practical for real-world applications compared to previous techniques.

Plain English Explanation

RhythmMamba is a new technology that can measure a person's physical signals, like their heart rate and breathing rate, just by looking at a video of them. This is done without needing any special medical devices - the system can analyze regular video footage to extract these vital signs.

One key advantage of RhythmMamba is that it works on videos of different lengths. Previous methods for remote physiological measurement were limited to very short videos, but RhythmMamba can handle videos ranging from a few seconds to several minutes. This makes it much more practical for real-world use cases, where you might want to continuously monitor someone's health over an extended period.

The paper demonstrates that RhythmMamba can accurately measure heart rate and breathing rate, even on challenging videos with subjects moving around or in different lighting conditions. This shows the potential for this technology to be used in a wide variety of applications, from remote patient monitoring to analysis of online meetings and video calls.

Technical Explanation

The paper introduces the RhythmMamba method for remote physiological measurement from arbitrary length videos. RhythmMamba builds on prior work in remote photoplethysmography (rPPG) and video understanding, but extends these techniques to handle videos of varying durations.

The key innovation is a novel neural network architecture that can efficiently process long video sequences and extract physiological signals. Unlike previous methods that were limited to short clips, RhythmMamba can analyze videos from a few seconds up to several minutes in length.

The authors evaluate RhythmMamba on a large dataset of videos, showing it can accurately measure heart rate and breathing rate compared to ground truth sensors. The method is robust to subject movement and varying lighting conditions, demonstrating its potential for real-world applications such as remote patient monitoring and video analytics.

Critical Analysis

The RhythmMamba paper makes a compelling case for its ability to perform remote physiological measurement on arbitrary length videos. The experimental results show the method is highly accurate and robust to challenging video conditions.

However, the paper does not address some potential limitations or areas for further research. For example, it is unclear how RhythmMamba would perform on videos with multiple people, occlusions, or very poor video quality. The authors also do not discuss privacy implications or the ethical use of this technology for remote monitoring.

Additionally, the paper focuses primarily on heart rate and breathing rate measurement. It would be valuable to see if the RhythmMamba approach can be extended to monitor other vital signs, such as blood pressure or oxygen saturation.

Overall, the RhythmMamba method represents an important advance in remote physiological sensing, but further research is needed to fully understand its capabilities and limitations in real-world applications.

Conclusion

The RhythmMamba paper introduces a novel technique for fast and accurate remote physiological measurement from arbitrary length videos. By overcoming the limitations of previous methods, RhythmMamba opens up new possibilities for applications like remote health monitoring, video analytics, and human-computer interaction.

The experimental results demonstrate the impressive performance of RhythmMamba, suggesting it could have a significant impact on how we measure and understand human physiology in the future. As the technology continues to develop, it will be important to carefully consider the ethical implications and ensure RhythmMamba is used responsibly to benefit society.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Jiankai Tang, Xinyi Li, Jiacheng Liu, Xiyuxing Zhang, Zeyu Wang, Yuntao Wang

YC

0

Reddit

0

Remote photoplethysmography (rPPG) emerges as a promising method for non-invasive, convenient measurement of vital signs, utilizing the widespread presence of cameras. Despite advancements, existing datasets fall short in terms of size and diversity, limiting comprehensive evaluation under diverse conditions. This paper presents an in-depth analysis of the VitalVideo dataset, the largest real-world rPPG dataset to date, encompassing 893 subjects and 6 Fitzpatrick skin tones. Our experimentation with six unsupervised methods and three supervised models demonstrates that datasets comprising a few hundred subjects(i.e., 300 for UBFC-rPPG, 500 for PURE, and 700 for MMPD-Simple) are sufficient for effective rPPG model training. Our findings highlight the importance of diversity and consistency in skin tones for precise performance evaluation across different datasets.

Read more

4/9/2024

SiNC+: Adaptive Camera-Based Vitals with Unsupervised Learning of Periodic Signals

SiNC+: Adaptive Camera-Based Vitals with Unsupervised Learning of Periodic Signals

Jeremy Speth, Nathan Vance, Patrick Flynn, Adam Czajka

YC

0

Reddit

0

Subtle periodic signals, such as blood volume pulse and respiration, can be extracted from RGB video, enabling noncontact health monitoring at low cost. Advancements in remote pulse estimation -- or remote photoplethysmography (rPPG) -- are currently driven by deep learning solutions. However, modern approaches are trained and evaluated on benchmark datasets with ground truth from contact-PPG sensors. We present the first non-contrastive unsupervised learning framework for signal regression to mitigate the need for labelled video data. With minimal assumptions of periodicity and finite bandwidth, our approach discovers the blood volume pulse directly from unlabelled videos. We find that encouraging sparse power spectra within normal physiological bandlimits and variance over batches of power spectra is sufficient for learning visual features of periodic signals. We perform the first experiments utilizing unlabelled video data not specifically created for rPPG to train robust pulse rate estimators. Given the limited inductive biases, we successfully applied the same approach to camera-based respiration by changing the bandlimits of the target signal. This shows that the approach is general enough for unsupervised learning of bandlimited quasi-periodic signals from different domains. Furthermore, we show that the framework is effective for finetuning models on unlabelled video from a single subject, allowing for personalized and adaptive signal regressors.

Read more

4/23/2024

🌐

Evaluation of Video-Based rPPG in Challenging Environments: Artifact Mitigation and Network Resilience

Nhi Nguyen, Le Nguyen, Honghan Li, Miguel Bordallo L'opez, Constantino 'Alvarez Casado

YC

0

Reddit

0

Video-based remote photoplethysmography (rPPG) has emerged as a promising technology for non-contact vital sign monitoring, especially under controlled conditions. However, the accurate measurement of vital signs in real-world scenarios faces several challenges, including artifacts induced by videocodecs, low-light noise, degradation, low dynamic range, occlusions, and hardware and network constraints. In this article, we systematically investigate comprehensive investigate these issues, measuring their detrimental effects on the quality of rPPG measurements. Additionally, we propose practical strategies for mitigating these challenges to improve the dependability and resilience of video-based rPPG systems. We detail methods for effective biosignal recovery in the presence of network limitations and present denoising and inpainting techniques aimed at preserving video frame integrity. Through extensive evaluations and direct comparisons, we demonstrate the effectiveness of the approaches in enhancing rPPG measurements under challenging environments, contributing to the development of more reliable and effective remote vital sign monitoring technologies.

Read more

5/3/2024

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

Jiyao Wang, Hao Lu, Ange Wang, Xiao Yang, Yingcong Chen, Dengbo He, Kaishun Wu

YC

0

Reddit

0

Remote photoplethysmography (rPPG) has been widely applied to measure heart rate from face videos. To increase the generalizability of the algorithms, domain generalization (DG) attracted increasing attention in rPPG. However, when rPPG is extended to simultaneously measure more vital signs (e.g., respiration and blood oxygen saturation), achieving generalizability brings new challenges. Although partial features shared among different physiological signals can benefit multi-task learning, the sparse and imbalanced target label space brings the seesaw effect over task-specific feature learning. To resolve this problem, we designed an end-to-end Mixture of Low-rank Experts for multi-task remote Physiological measurement (PhysMLE), which is based on multiple low-rank experts with a novel router mechanism, thereby enabling the model to adeptly handle both specifications and correlations within tasks. Additionally, we introduced prior knowledge from physiology among tasks to overcome the imbalance of label space under real-world multi-task physiological measurement. For fair and comprehensive evaluations, this paper proposed a large-scale multi-task generalization benchmark, named Multi-Source Synsemantic Domain Generalization (MSSDG) protocol. Extensive experiments with MSSDG and intra-dataset have shown the effectiveness and efficiency of PhysMLE. In addition, a new dataset was collected and made publicly available to meet the needs of the MSSDG.

Read more

5/13/2024