PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

2405.06201

YC

0

Reddit

0

Published 5/13/2024 by Jiyao Wang, Hao Lu, Ange Wang, Xiao Yang, Yingcong Chen, Dengbo He, Kaishun Wu
PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement

Abstract

Remote photoplethysmography (rPPG) has been widely applied to measure heart rate from face videos. To increase the generalizability of the algorithms, domain generalization (DG) attracted increasing attention in rPPG. However, when rPPG is extended to simultaneously measure more vital signs (e.g., respiration and blood oxygen saturation), achieving generalizability brings new challenges. Although partial features shared among different physiological signals can benefit multi-task learning, the sparse and imbalanced target label space brings the seesaw effect over task-specific feature learning. To resolve this problem, we designed an end-to-end Mixture of Low-rank Experts for multi-task remote Physiological measurement (PhysMLE), which is based on multiple low-rank experts with a novel router mechanism, thereby enabling the model to adeptly handle both specifications and correlations within tasks. Additionally, we introduced prior knowledge from physiology among tasks to overcome the imbalance of label space under real-world multi-task physiological measurement. For fair and comprehensive evaluations, this paper proposed a large-scale multi-task generalization benchmark, named Multi-Source Synsemantic Domain Generalization (MSSDG) protocol. Extensive experiments with MSSDG and intra-dataset have shown the effectiveness and efficiency of PhysMLE. In addition, a new dataset was collected and made publicly available to meet the needs of the MSSDG.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • Presents a novel multi-task learning approach called PhysMLE to address the challenge of generalizable and priors-inclusive remote physiological measurement
  • Leverages a mixture of experts architecture and low-rank adaptation to improve domain generalization
  • Demonstrates state-of-the-art performance on several remote physiological measurement tasks across diverse datasets

Plain English Explanation

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement tackles the problem of reliably measuring people's physiological signals, like heart rate or breathing, using cameras instead of physical sensors. This is known as remote physiological measurement or rPPG.

The key challenge is that the performance of rPPG models can vary a lot across different people, environments, and camera setups. The researchers propose a new approach called PhysMLE that uses a "mixture of experts" architecture and techniques like "low-rank adaptation" to make the models more generalizable and robust to these changes.

This builds on previous work in camera-based remote physiology sensing and measuring domain shifts in remote physiological measurement. The goal is to create rPPG models that work well in a wide variety of real-world conditions, without requiring extensive retraining or adaptation.

Technical Explanation

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement introduces a novel multi-task learning approach to improve the generalization of remote physiological measurement (rPPG) models.

The key elements of the PhysMLE approach include:

  • Mixture of Experts Architecture: The model consists of multiple "expert" sub-networks, each specialized for a different rPPG task or domain. A "gating" network dynamically selects the most appropriate expert(s) for each input.
  • Low-Rank Adaptation: The expert sub-networks can efficiently adapt to new domains by learning low-dimensional task-specific transformations, rather than updating all model parameters.
  • Priors-Inclusive Training: The model incorporates physiological priors, such as signal periodicity, into the training process to improve robustness and generalization.

The researchers evaluate PhysMLE on several rPPG datasets, including RhythmMamba and SleePPG-Net2, and demonstrate state-of-the-art performance across a range of physiological measurement tasks.

Critical Analysis

The PhysMLE paper presents a well-designed and thoroughly evaluated approach to improving the generalization of remote physiological measurement models. The mixture of experts architecture, low-rank adaptation, and incorporation of physiological priors appear to be effective strategies for addressing the challenge of domain shift in rPPG.

One potential limitation is the reliance on task-specific expert sub-networks, which may not scale well to an ever-increasing number of target tasks or domains. The researchers acknowledge this and suggest further research into more efficient adaptation mechanisms.

Additionally, the paper does not deeply explore the theoretical underpinnings of the low-rank adaptation technique or provide a comprehensive analysis of the learned task-specific transformations. A more detailed understanding of these mechanisms could lead to further insights and improvements.

Overall, the PhysMLE work represents a significant contribution to the field of remote physiological measurement and demonstrates the value of leveraging domain-specific knowledge and priors to develop more robust and generalizable AI models.

Conclusion

PhysMLE: Generalizable and Priors-Inclusive Multi-task Remote Physiological Measurement introduces a novel multi-task learning approach that addresses the challenge of developing remote physiological measurement models that can perform well across diverse real-world conditions.

By combining a mixture of experts architecture, low-rank adaptation, and the incorporation of physiological priors, the researchers have demonstrated state-of-the-art performance on several rPPG datasets. This work advances the field of camera-based remote physiology sensing and brings us closer to reliable, generalizable, and priors-inclusive AI systems for remote health monitoring and analysis.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

Resolve Domain Conflicts for Generalizable Remote Physiological Measurement

Weiyu Sun, Xinyu Zhang, Hao Lu, Ying Chen, Yun Ge, Xiaolin Huang, Jie Yuan, Yingcong Chen

YC

0

Reddit

0

Remote photoplethysmography (rPPG) technology has become increasingly popular due to its non-invasive monitoring of various physiological indicators, making it widely applicable in multimedia interaction, healthcare, and emotion analysis. Existing rPPG methods utilize multiple datasets for training to enhance the generalizability of models. However, they often overlook the underlying conflict issues across different datasets, such as (1) label conflict resulting from different phase delays between physiological signal labels and face videos at the instance level, and (2) attribute conflict stemming from distribution shifts caused by head movements, illumination changes, skin types, etc. To address this, we introduce the DOmain-HArmonious framework (DOHA). Specifically, we first propose a harmonious phase strategy to eliminate uncertain phase delays and preserve the temporal variation of physiological signals. Next, we design a harmonious hyperplane optimization that reduces irrelevant attribute shifts and encourages the model's optimization towards a global solution that fits more valid scenarios. Our experiments demonstrate that DOHA significantly improves the performance of existing methods under multiple protocols. Our code is available at https://github.com/SWY666/rPPG-DOHA.

Read more

4/12/2024

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Jiankai Tang, Xinyi Li, Jiacheng Liu, Xiyuxing Zhang, Zeyu Wang, Yuntao Wang

YC

0

Reddit

0

Remote photoplethysmography (rPPG) emerges as a promising method for non-invasive, convenient measurement of vital signs, utilizing the widespread presence of cameras. Despite advancements, existing datasets fall short in terms of size and diversity, limiting comprehensive evaluation under diverse conditions. This paper presents an in-depth analysis of the VitalVideo dataset, the largest real-world rPPG dataset to date, encompassing 893 subjects and 6 Fitzpatrick skin tones. Our experimentation with six unsupervised methods and three supervised models demonstrates that datasets comprising a few hundred subjects(i.e., 300 for UBFC-rPPG, 500 for PURE, and 700 for MMPD-Simple) are sufficient for effective rPPG model training. Our findings highlight the importance of diversity and consistency in skin tones for precise performance evaluation across different datasets.

Read more

4/9/2024

Measuring Domain Shifts using Deep Learning Remote Photoplethysmography Model Similarity

Measuring Domain Shifts using Deep Learning Remote Photoplethysmography Model Similarity

Nathan Vance, Patrick Flynn

YC

0

Reddit

0

Domain shift differences between training data for deep learning models and the deployment context can result in severe performance issues for models which fail to generalize. We study the domain shift problem under the context of remote photoplethysmography (rPPG), a technique for video-based heart rate inference. We propose metrics based on model similarity which may be used as a measure of domain shift, and we demonstrate high correlation between these metrics and empirical performance. One of the proposed metrics with viable correlations, DS-diff, does not assume access to the ground truth of the target domain, i.e. it may be applied to in-the-wild data. To that end, we investigate a model selection problem in which ground truth results for the evaluation domain is not known, demonstrating a 13.9% performance improvement over the average case baseline.

Read more

4/15/2024

RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos

RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos

Bochao Zou, Zizheng Guo, Xiaocheng Hu, Huimin Ma

YC

0

Reddit

0

Remote photoplethysmography (rPPG) is a non-contact method for detecting physiological signals from facial videos, holding great potential in various applications such as healthcare, affective computing, and anti-spoofing. Existing deep learning methods struggle to address two core issues of rPPG simultaneously: extracting weak rPPG signals from video segments with large spatiotemporal redundancy and understanding the periodic patterns of rPPG among long contexts. This represents a trade-off between computational complexity and the ability to capture long-range dependencies, posing a challenge for rPPG that is suitable for deployment on mobile devices. Based on the in-depth exploration of Mamba's comprehension of spatial and temporal information, this paper introduces RhythmMamba, an end-to-end Mamba-based method that employs multi-temporal Mamba to constrain both periodic patterns and short-term trends, coupled with frequency domain feed-forward to enable Mamba to robustly understand the quasi-periodic patterns of rPPG. Extensive experiments show that RhythmMamba achieves state-of-the-art performance with reduced parameters and lower computational complexity. The proposed RhythmMamba can be applied to video segments of any length without performance degradation. The codes are available at https://github.com/zizheng-guo/RhythmMamba.

Read more

4/10/2024