Analyzing Participants' Engagement during Online Meetings Using Unsupervised Remote Photoplethysmography with Behavioral Features

2404.04394

YC

0

Reddit

0

Published 5/15/2024 by Alexander Vedernikov, Zhaodong Sun, Virpi-Liisa Kykyri, Mikko Pohjola, Miriam Nokia, Xiaobai Li
Analyzing Participants' Engagement during Online Meetings Using Unsupervised Remote Photoplethysmography with Behavioral Features

Abstract

Engagement measurement finds application in healthcare, education, services. The use of physiological and behavioral features is viable, but the impracticality of traditional physiological measurement arises due to the need for contact sensors. We demonstrate the feasibility of unsupervised remote photoplethysmography (rPPG) as an alternative for contact sensors in deriving heart rate variability (HRV) features, then fusing these with behavioral features to measure engagement in online group meetings. Firstly, a unique Engagement Dataset of online interactions among social workers is collected with granular engagement labels, offering insight into virtual meeting dynamics. Secondly, a pre-trained rPPG model is customized to reconstruct rPPG signals from video meetings in an unsupervised manner, enabling the calculation of HRV features. Thirdly, the feasibility of estimating engagement from HRV features using short observation windows, with a notable enhancement when using longer observation windows of two to four minutes, is demonstrated. Fourthly, the effectiveness of behavioral cues is evaluated when fused with physiological data, which further enhances engagement estimation performance. An accuracy of 94% is achieved when only HRV features are used, eliminating the need for contact sensors or ground truth signals; use of behavioral cues raises the accuracy to 96%. Facial analysis offers precise engagement measurement, beneficial for future applications.

Create account to get full access

or

If you already have an account, we'll log you in

Overview

  • This paper describes a study that used unsupervised remote photoplethysmography (rPPG) and behavioral features to analyze participant engagement during online meetings.
  • The researchers developed a system to unobtrusively measure physiological signals and behaviors of meeting participants to assess their level of engagement.
  • The goal was to provide a way to objectively measure engagement in remote work and learning environments.

Plain English Explanation

The researchers in this study wanted to find a better way to understand how engaged people are during online meetings. Rather than relying on self-reported data or subjective observations, they used a technique called remote photoplethysmography (rPPG) to measure physiological signals like heart rate. They combined this with data on participants' behaviors, like their facial expressions and movements, to get a more complete picture of engagement.

The key insight is that people's physiology and behaviors can reveal how interested or focused they are, even if they're not outwardly showing it. By using camera-based remote physiology sensing and analyzing digital perceptual signals, the researchers were able to unobtrusively monitor participants during online meetings without the need for any wearable devices.

This could be really useful for remote work and education, where it's hard to gauge how engaged people are when you can't see them in person. By getting a more objective measure of engagement, it may help meeting organizers, teachers, and others better understand how to keep people focused and involved, even from a distance.

Technical Explanation

The researchers developed a system that used remote photoplethysmography (rPPG) to capture participants' heart rate signals during online meetings. They combined this physiological data with behavioral features extracted from video, such as facial expressions, head movements, and body posture.

This multimodal approach allowed them to build a more comprehensive model of engagement than using either physiological or behavioral data alone. The researchers used unsupervised machine learning techniques to cluster participants into groups based on their engagement levels, without relying on subjective labeling.

Their analysis revealed several interesting insights. For example, they found that participants' heart rate variability and eye gaze patterns were particularly informative for assessing engagement. The system was also able to detect subtle behavioral cues, like small head nods, that indicated a person was paying attention.

Overall, this work demonstrates the potential of leveraging digital perceptual technologies for remote perception analysis to gain objective insights about human behaviors and experiences in online environments. The techniques developed could be applied to other domains, such as sleep staging from video or measuring emotional reactions to online content.

Critical Analysis

The researchers acknowledge several limitations of their approach. First, the system relies on participants having access to cameras, which may not always be the case in remote work or learning settings. There are also privacy concerns around continuously monitoring people's physiology and behaviors that would need to be carefully addressed.

Additionally, the study was conducted in a controlled laboratory setting, so the performance of the system in more naturalistic online meetings remains to be seen. The researchers also note that their unsupervised clustering approach may not generalize well to new data, and that further work is needed to develop more robust and generalizable models of engagement.

One area that could be explored further is the combination of physiological and behavioral signals with other modalities, such as audio-based emotion recognition or text analysis of chat messages. Integrating multiple sources of data could lead to a more holistic understanding of participant engagement.

Conclusion

This study demonstrates the potential of using unobtrusive physiological and behavioral sensing to objectively measure engagement during online meetings. By combining remote photoplethysmography with analysis of facial expressions, body language, and other cues, the researchers were able to cluster participants into groups based on their level of engagement.

While there are still some challenges to address, this work represents an important step towards developing tools that can provide valuable insights for remote work, education, and other applications where understanding human engagement is crucial. As we continue to rely on virtual interactions, having better ways to assess and support engagement will become increasingly important for fostering productive and meaningful online experiences.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🌐

Evaluation of Video-Based rPPG in Challenging Environments: Artifact Mitigation and Network Resilience

Nhi Nguyen, Le Nguyen, Honghan Li, Miguel Bordallo L'opez, Constantino 'Alvarez Casado

YC

0

Reddit

0

Video-based remote photoplethysmography (rPPG) has emerged as a promising technology for non-contact vital sign monitoring, especially under controlled conditions. However, the accurate measurement of vital signs in real-world scenarios faces several challenges, including artifacts induced by videocodecs, low-light noise, degradation, low dynamic range, occlusions, and hardware and network constraints. In this article, we systematically investigate comprehensive investigate these issues, measuring their detrimental effects on the quality of rPPG measurements. Additionally, we propose practical strategies for mitigating these challenges to improve the dependability and resilience of video-based rPPG systems. We detail methods for effective biosignal recovery in the presence of network limitations and present denoising and inpainting techniques aimed at preserving video frame integrity. Through extensive evaluations and direct comparisons, we demonstrate the effectiveness of the approaches in enhancing rPPG measurements under challenging environments, contributing to the development of more reliable and effective remote vital sign monitoring technologies.

Read more

5/3/2024

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Camera-Based Remote Physiology Sensing for Hundreds of Subjects Across Skin Tones

Jiankai Tang, Xinyi Li, Jiacheng Liu, Xiyuxing Zhang, Zeyu Wang, Yuntao Wang

YC

0

Reddit

0

Remote photoplethysmography (rPPG) emerges as a promising method for non-invasive, convenient measurement of vital signs, utilizing the widespread presence of cameras. Despite advancements, existing datasets fall short in terms of size and diversity, limiting comprehensive evaluation under diverse conditions. This paper presents an in-depth analysis of the VitalVideo dataset, the largest real-world rPPG dataset to date, encompassing 893 subjects and 6 Fitzpatrick skin tones. Our experimentation with six unsupervised methods and three supervised models demonstrates that datasets comprising a few hundred subjects(i.e., 300 for UBFC-rPPG, 500 for PURE, and 700 for MMPD-Simple) are sufficient for effective rPPG model training. Our findings highlight the importance of diversity and consistency in skin tones for precise performance evaluation across different datasets.

Read more

4/9/2024

SiNC+: Adaptive Camera-Based Vitals with Unsupervised Learning of Periodic Signals

SiNC+: Adaptive Camera-Based Vitals with Unsupervised Learning of Periodic Signals

Jeremy Speth, Nathan Vance, Patrick Flynn, Adam Czajka

YC

0

Reddit

0

Subtle periodic signals, such as blood volume pulse and respiration, can be extracted from RGB video, enabling noncontact health monitoring at low cost. Advancements in remote pulse estimation -- or remote photoplethysmography (rPPG) -- are currently driven by deep learning solutions. However, modern approaches are trained and evaluated on benchmark datasets with ground truth from contact-PPG sensors. We present the first non-contrastive unsupervised learning framework for signal regression to mitigate the need for labelled video data. With minimal assumptions of periodicity and finite bandwidth, our approach discovers the blood volume pulse directly from unlabelled videos. We find that encouraging sparse power spectra within normal physiological bandlimits and variance over batches of power spectra is sufficient for learning visual features of periodic signals. We perform the first experiments utilizing unlabelled video data not specifically created for rPPG to train robust pulse rate estimators. Given the limited inductive biases, we successfully applied the same approach to camera-based respiration by changing the bandlimits of the target signal. This shows that the approach is general enough for unsupervised learning of bandlimited quasi-periodic signals from different domains. Furthermore, we show that the framework is effective for finetuning models on unlabelled video from a single subject, allowing for personalized and adaptive signal regressors.

Read more

4/23/2024

Orientation-conditioned Facial Texture Mapping for Video-based Facial Remote Photoplethysmography Estimation

Orientation-conditioned Facial Texture Mapping for Video-based Facial Remote Photoplethysmography Estimation

Sam Cantrill, David Ahmedt-Aristizabal, Lars Petersson, Hanna Suominen, Mohammad Ali Armin

YC

0

Reddit

0

Camera-based remote photoplethysmography (rPPG) enables contactless measurement of important physiological signals such as pulse rate (PR). However, dynamic and unconstrained subject motion introduces significant variability into the facial appearance in video, confounding the ability of video-based methods to accurately extract the rPPG signal. In this study, we leverage the 3D facial surface to construct a novel orientation-conditioned facial texture video representation which improves the motion robustness of existing video-based facial rPPG estimation methods. Our proposed method achieves a significant 18.2% performance improvement in cross-dataset testing on MMPD over our baseline using the PhysNet model trained on PURE, highlighting the efficacy and generalization benefits of our designed video representation. We demonstrate significant performance improvements of up to 29.6% in all tested motion scenarios in cross-dataset testing on MMPD, even in the presence of dynamic and unconstrained subject motion, emphasizing the benefits of disentangling motion through modeling the 3D facial surface for motion robust facial rPPG estimation. We validate the efficacy of our design decisions and the impact of different video processing steps through an ablation study. Our findings illustrate the potential strengths of exploiting the 3D facial surface as a general strategy for addressing dynamic and unconstrained subject motion in videos. The code is available at https://samcantrill.github.io/orientation-uv-rppg/.

Read more

5/2/2024