Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

2404.07855

Published 4/12/2024 by Weiyu Sun, Xinyu Zhang, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie Yuan, Yingcong Chen

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

Abstract

Remote photoplethysmography (rPPG) technology has become increasingly popular due to its non-invasive monitoring of various physiological indicators, making it widely applicable in multimedia interaction, healthcare, and emotion analysis. Existing rPPG methods utilize multiple datasets for training to enhance the generalizability of models. However, they often overlook the underlying conflict issues across different datasets, such as (1) label conflict resulting from different phase delays between physiological signal labels and face videos at the instance level, and (2) attribute conflict stemming from distribution shifts caused by head movements, illumination changes, skin types, etc. To address this, we introduce the DOmain-HArmonious framework (DOHA). Specifically, we first propose a harmonious phase strategy to eliminate uncertain phase delays and preserve the temporal variation of physiological signals. Next, we design a harmonious hyperplane optimization that reduces irrelevant attribute shifts and encourages the model's optimization towards a global solution that fits more valid scenarios. Our experiments demonstrate that DOHA significantly improves the performance of existing methods under multiple protocols. Our code is available at https://github.com/SWY666/rPPG-DOHA.

Create account to get full access

Related Work

Remote Photoplethysmography

Remote Photoplethysmography (rPPG) is a technique that uses camera-based measurement to estimate physiological signals, such as heart rate, from subtle changes in skin color. It has been explored in a variety of applications, including sleep staging, meeting engagement analysis, and remote physiological measurement. However, rPPG models often struggle with domain-specific challenges, such as varying lighting conditions, camera types, and subject characteristics, limiting their generalization to new environments.

Plain English Explanation

The paper focuses on resolving domain conflicts to enable more generalizable remote physiological measurement using rPPG techniques. Traditional rPPG models can have difficulty working well across different environments and subjects, as factors like lighting, camera type, and individual characteristics can impact the measurement. The researchers aim to develop methods that can more effectively adapt to these varying conditions and produce reliable physiological estimates regardless of the specific setup or population.

Technical Explanation

The paper proposes a novel framework to address domain conflicts in rPPG. Key elements include:

Adversarial Domain Adaptation: The model uses adversarial training to learn domain-invariant features, allowing it to generalize across different data distributions, such as those caused by varying lighting or camera types.
Calibration-Free Testing: The model can provide accurate physiological estimates without requiring subject-specific calibration, which is a common limitation of many rPPG approaches.
Extensive Evaluation: The researchers evaluate their method on several datasets, including their own large-scale remote physiological measurement dataset, demonstrating significant performance improvements over existing rPPG techniques.

Critical Analysis

The paper provides a compelling approach to addressing the domain-specific challenges that have historically limited the generalization of rPPG models. By incorporating adversarial domain adaptation, the framework can learn features that are robust to factors like lighting and camera type, which is a notable advancement.

However, the authors acknowledge that their method still has some limitations, such as the need for a sufficiently large and diverse training dataset to achieve good performance. Additionally, while the calibration-free testing is a useful capability, it may not completely eliminate the need for some subject-specific adjustments in certain applications.

Further research could explore ways to further improve the model's robustness, potentially by incorporating additional techniques like meta-learning or few-shot adaptation. Exploring the model's performance in real-world deployment scenarios would also be valuable to assess its practical utility.

Conclusion

This paper presents a novel framework for resolving domain conflicts in remote physiological measurement using rPPG. By incorporating adversarial domain adaptation and calibration-free testing, the model demonstrates significant improvements in generalization compared to traditional rPPG approaches. The research highlights the importance of addressing domain-specific challenges to enable more reliable and widely applicable remote physiological sensing, with potential implications for a variety of healthcare and multimedia applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

Jiyao Wang, Hao Lu, Ange Wang, Xiao Yang, Yingcong Chen, Dengbo He, Kaishun Wu

Remote photoplethysmography (rPPG) has been widely applied to measure heart rate from face videos. To increase the generalizability of the algorithms, domain generalization (DG) attracted increasing attention in rPPG. However, when rPPG is extended to simultaneously measure more vital signs (e.g., respiration and blood oxygen saturation), achieving generalizability brings new challenges. Although partial features shared among different physiological signals can benefit multi-task learning, the sparse and imbalanced target label space brings the seesaw effect over task-specific feature learning. To resolve this problem, we designed an end-to-end Mixture of Low-rank Experts for multi-task remote Physiological measurement (PhysMLE), which is based on multiple low-rank experts with a novel router mechanism, thereby enabling the model to adeptly handle both specifications and correlations within tasks. Additionally, we introduced prior knowledge from physiology among tasks to overcome the imbalance of label space under real-world multi-task physiological measurement. For fair and comprehensive evaluations, this paper proposed a large-scale multi-task generalization benchmark, named Multi-Source Synsemantic Domain Generalization (MSSDG) protocol. Extensive experiments with MSSDG and intra-dataset have shown the effectiveness and efficiency of PhysMLE. In addition, a new dataset was collected and made publicly available to meet the needs of the MSSDG.

5/13/2024

cs.CV

Measuring Domain Shifts using Deep Learning Remote Photoplethysmography Model Similarity

Nathan Vance, Patrick Flynn

Domain shift differences between training data for deep learning models and the deployment context can result in severe performance issues for models which fail to generalize. We study the domain shift problem under the context of remote photoplethysmography (rPPG), a technique for video-based heart rate inference. We propose metrics based on model similarity which may be used as a measure of domain shift, and we demonstrate high correlation between these metrics and empirical performance. One of the proposed metrics with viable correlations, DS-diff, does not assume access to the ground truth of the target domain, i.e. it may be applied to in-the-wild data. To that end, we investigate a model selection problem in which ground truth results for the evaluation domain is not known, demonstrating a 13.9% performance improvement over the average case baseline.

4/15/2024

cs.CV

RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos

Bochao Zou, Zizheng Guo, Xiaocheng Hu, Huimin Ma

Remote photoplethysmography (rPPG) is a non-contact method for detecting physiological signals from facial videos, holding great potential in various applications such as healthcare, affective computing, and anti-spoofing. Existing deep learning methods struggle to address two core issues of rPPG simultaneously: extracting weak rPPG signals from video segments with large spatiotemporal redundancy and understanding the periodic patterns of rPPG among long contexts. This represents a trade-off between computational complexity and the ability to capture long-range dependencies, posing a challenge for rPPG that is suitable for deployment on mobile devices. Based on the in-depth exploration of Mamba's comprehension of spatial and temporal information, this paper introduces RhythmMamba, an end-to-end Mamba-based method that employs multi-temporal Mamba to constrain both periodic patterns and short-term trends, coupled with frequency domain feed-forward to enable Mamba to robustly understand the quasi-periodic patterns of rPPG. Extensive experiments show that RhythmMamba achieves state-of-the-art performance with reduced parameters and lower computational complexity. The proposed RhythmMamba can be applied to video segments of any length without performance degradation. The codes are available at https://github.com/zizheng-guo/RhythmMamba.

4/10/2024

cs.CV

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Jiankai Tang, Xinyi Li, Jiacheng Liu, Xiyuxing Zhang, Zeyu Wang, Yuntao Wang

Remote photoplethysmography (rPPG) emerges as a promising method for non-invasive, convenient measurement of vital signs, utilizing the widespread presence of cameras. Despite advancements, existing datasets fall short in terms of size and diversity, limiting comprehensive evaluation under diverse conditions. This paper presents an in-depth analysis of the VitalVideo dataset, the largest real-world rPPG dataset to date, encompassing 893 subjects and 6 Fitzpatrick skin tones. Our experimentation with six unsupervised methods and three supervised models demonstrates that datasets comprising a few hundred subjects(i.e., 300 for UBFC-rPPG, 500 for PURE, and 700 for MMPD-Simple) are sufficient for effective rPPG model training. Our findings highlight the importance of diversity and consistency in skin tones for precise performance evaluation across different datasets.

4/9/2024

cs.CV cs.AI