A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities

Read original: arXiv:2409.09678 - Published 9/17/2024 by Jungpil Shin, Najmul Hassan, Abu Saleh Musa Miah1, Satoshi Nishimura
Total Score

0

A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities

Sign in to get full access

or

If you already have an account, we'll log you in

Overview

  • Provides a comprehensive survey of human activity recognition research across diverse data modalities
  • Examines the methodological approaches, datasets, and performance metrics used in this field
  • Offers insights into the current state of the art and highlights areas for future research

Plain English Explanation

This research paper is a thorough review of the various methods and techniques used to recognize human activities from different types of data, such as video, audio, and sensor readings. The authors analyze the key components of these activity recognition systems, including the experimental designs, the machine learning models employed, and the performance metrics used to evaluate their effectiveness.

By synthesizing the findings from a large number of research studies, the authors provide a comprehensive overview of the current state of human activity recognition technology. They identify the strengths and limitations of various approaches, as well as the emerging trends and promising directions for future research in this field.

Technical Explanation

The paper presents a thorough survey of the methodological approaches used in human activity recognition across a diverse range of data modalities, including video, audio, wearable sensors, and environmental sensors. The authors conducted a systematic literature review, analyzing over 200 relevant studies published in leading conferences and journals.

For each study, the authors extracted key information such as the experimental design, the data modalities used, the machine learning techniques employed, and the performance metrics reported. They then synthesized this information to identify common trends, best practices, and emerging research directions in the field of human activity recognition.

The survey covers a wide range of topics, including data preprocessing, feature engineering, model architectures, and evaluation methodologies. The authors also discuss the trade-offs between different approaches, such as the accuracy-complexity trade-off in activity recognition systems.

Critical Analysis

The authors provide a thorough and well-structured survey of the human activity recognition literature, which is a valuable resource for researchers and practitioners in this field. However, the paper does not delve deeply into the limitations and potential biases of the existing research, such as the reliance on constrained datasets or the lack of diversity in the study populations.

Additionally, the authors do not discuss the ethical considerations surrounding the deployment of activity recognition systems, such as privacy concerns and the potential for misuse. These are important aspects that should be addressed in future research and surveys on this topic.

Conclusion

This comprehensive survey of human activity recognition research offers a valuable synthesis of the current methodological approaches and insights across diverse data modalities. The authors' analysis provides a solid foundation for understanding the state of the art in this field and identifying promising directions for future research and development. By highlighting the key trends, trade-offs, and emerging trends, this survey can inform the design of more robust and effective activity recognition systems with broader real-world applications.



This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities
Total Score

0

A Comprehensive Methodological Survey of Human Activity Recognition Across Divers Data Modalities

Jungpil Shin, Najmul Hassan, Abu Saleh Musa Miah1, Satoshi Nishimura

Human Activity Recognition (HAR) systems aim to understand human behaviour and assign a label to each action, attracting significant attention in computer vision due to their wide range of applications. HAR can leverage various data modalities, such as RGB images and video, skeleton, depth, infrared, point cloud, event stream, audio, acceleration, and radar signals. Each modality provides unique and complementary information suited to different application scenarios. Consequently, numerous studies have investigated diverse approaches for HAR using these modalities. This paper presents a comprehensive survey of the latest advancements in HAR from 2014 to 2024, focusing on machine learning (ML) and deep learning (DL) approaches categorized by input data modalities. We review both single-modality and multi-modality techniques, highlighting fusion-based and co-learning frameworks. Additionally, we cover advancements in hand-crafted action features, methods for recognizing human-object interactions, and activity detection. Our survey includes a detailed dataset description for each modality and a summary of the latest HAR systems, offering comparative results on benchmark datasets. Finally, we provide insightful observations and propose effective future research directions in HAR.

Read more

9/17/2024

SoK: Behind the Accuracy of Complex Human Activity Recognition Using Deep Learning
Total Score

0

SoK: Behind the Accuracy of Complex Human Activity Recognition Using Deep Learning

Duc-Anh Nguyen, Nhien-An Le-Khac

Human Activity Recognition (HAR) is a well-studied field with research dating back to the 1980s. Over time, HAR technologies have evolved significantly from manual feature extraction, rule-based algorithms, and simple machine learning models to powerful deep learning models, from one sensor type to a diverse array of sensing modalities. The scope has also expanded from recognising a limited set of activities to encompassing a larger variety of both simple and complex activities. However, there still exist many challenges that hinder advancement in complex activity recognition using modern deep learning methods. In this paper, we comprehensively systematise factors leading to inaccuracy in complex HAR, such as data variety and model capacity. Among many sensor types, we give more attention to wearable and camera due to their prevalence. Through this Systematisation of Knowledge (SoK) paper, readers can gain a solid understanding of the development history and existing challenges of HAR, different categorisations of activities, obstacles in deep learning-based complex HAR that impact accuracy, and potential research directions.

Read more

5/7/2024

A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review
Total Score

0

A Critical Analysis on Machine Learning Techniques for Video-based Human Activity Recognition of Surveillance Systems: A Review

Shahriar Jahan, Roknuzzaman, Md Robiul Islam

Upsurging abnormal activities in crowded locations such as airports, train stations, bus stops, shopping malls, etc., urges the necessity for an intelligent surveillance system. An intelligent surveillance system can differentiate between normal and suspicious activities from real-time video analysis that will enable to take appropriate measures regarding the level of an anomaly instantaneously and efficiently. Video-based human activity recognition has intrigued many researchers with its pressing issues and a variety of applications ranging from simple hand gesture recognition to crucial behavior recognition in a surveillance system. This paper provides a critical survey of video-based Human Activity Recognition (HAR) techniques beginning with an examination of basic approaches for detecting and recognizing suspicious behavior followed by a critical analysis of machine learning and deep learning techniques such as Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), Hidden Markov Model (HMM), K-means Clustering etc. A detailed investigation and comparison are done on these learning techniques on the basis of feature extraction techniques, parameter initialization, and optimization algorithms, accuracy, etc. The purpose of this review is to prioritize positive schemes and to assist researchers with emerging advancements in this field's future endeavors. This paper also pragmatically discusses existing challenges in the field of HAR and examines the prospects in the field.

Read more

9/4/2024

👁️

Total Score

0

Towards LLM-Powered Ambient Sensor Based Multi-Person Human Activity Recognition

Xi Chen (M-PSI), Julien Cumin (M-PSI), Fano Ramparany (M-PSI), Dominique Vaufreydaz (M-PSI)

Human Activity Recognition (HAR) is one of the central problems in fields such as healthcare, elderly care, and security at home. However, traditional HAR approaches face challenges including data scarcity, difficulties in model generalization, and the complexity of recognizing activities in multi-person scenarios. This paper proposes a system framework called LAHAR, based on large language models. Utilizing prompt engineering techniques, LAHAR addresses HAR in multi-person scenarios by enabling subject separation and action-level descriptions of events occurring in the environment. We validated our approach on the ARAS dataset, and the results demonstrate that LAHAR achieves comparable accuracy to the state-of-the-art method at higher resolutions and maintains robustness in multi-person scenarios.

Read more

7/16/2024