SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

Read original: arXiv:2405.14398 - Published 5/24/2024 by Weiyu Guo, Ying Sun, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong

SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

Overview

This paper presents a novel approach called "SpGesture" for domain-adaptive sEMG-based gesture recognition.
SpGesture utilizes a Jaccard Attentive Spiking Neural Network (JASNet) to achieve accurate gesture recognition without requiring labeled data from the target domain.
The key innovation is the use of Jaccard attention, which enables the model to focus on the most relevant features for each input sample during the recognition process.

Plain English Explanation

SpGesture is a new technique for recognizing hand gestures using electrical signals from the muscles in your arms, known as sEMG (surface electromyography) data. The unique aspect of SpGesture is that it can work well even when the data you use to train the model is different from the data you want to use the model on.

Typically, machine learning models need a lot of labeled data from the same source as the data they'll be used on. But that can be impractical or expensive to get. SpGesture solves this by using a special type of neural network called a "Jaccard Attentive Spiking Neural Network" (JASNet). JASNet has a clever attention mechanism that allows it to focus on the most relevant features in the sEMG data, even if the data is from a different source than what the model was trained on.

This means you can train SpGesture on sEMG data from one group of people, and then use it to accurately recognize gestures from a completely different group, without needing to collect new labeled data from that group. The Jaccard attention mechanism is the key innovation that makes this possible.

Technical Explanation

The core of the SpGesture approach is the Jaccard Attentive Spiking Neural Network (JASNet) architecture. JASNet is a type of spiking neural network that uses a novel attention mechanism based on the Jaccard similarity coefficient to adaptively focus on the most relevant features in the input sEMG data.

Spiking neural networks are a biologically-inspired paradigm that aim to more closely mimic the way neurons fire in the human brain. In the JASNet architecture, the input sEMG data is first converted into spike trains, which are then processed by the spiking neural network layers.

The key innovation is the Jaccard attention mechanism, which computes the similarity between the current input sample and learned prototypes for each gesture class. This allows the model to dynamically attend to the most discriminative features for each input, enabling effective domain adaptation without requiring labeled target domain data.

The SpGesture framework was evaluated on several public sEMG gesture recognition datasets, demonstrating state-of-the-art performance in cross-subject and cross-domain scenarios compared to prior methods such as Efficient sEMG-based Cross-Subject Joint Angle Estimation, MeMEXtra: Meet Mixture Experts using Extra Tree-based sEMG, and others.

Critical Analysis

The SpGesture paper presents a compelling approach for domain-adaptive sEMG-based gesture recognition. The use of Jaccard attention in a spiking neural network architecture is a novel and promising technique that addresses an important practical challenge in real-world deployment of these systems.

However, the paper does not extensively explore the limitations of the approach. For example, it is unclear how well SpGesture would scale to a larger number of gesture classes or how robust it is to noisy or corrupted sEMG input data. Additionally, the paper does not discuss potential privacy or ethical concerns around the use of sEMG data for gesture recognition, which could be an important consideration for real-world applications.

Further research is needed to better understand the capabilities and limitations of the SpGesture approach, as well as to explore ways to make it more robust and scalable. Comparative studies against other domain adaptation techniques, such as those used in Semantic Gesticulator: Semantics-Aware Co-Speech Gesture Generation or SigGesture: Generalized Co-Speech Gesture Synthesis via Style Tokens, could also provide valuable insights.

Conclusion

The SpGesture framework presents a novel and promising approach for domain-adaptive sEMG-based gesture recognition using a Jaccard Attentive Spiking Neural Network. By dynamically focusing on the most relevant features in the input data, SpGesture can achieve accurate gesture recognition without requiring labeled data from the target domain, making it a practical solution for real-world applications.

While further research is needed to explore the limitations and potential ethical considerations of the approach, the core ideas behind SpGesture, such as the use of Jaccard attention in spiking neural networks, could have broader implications for other domains, including Identity-Free Artificial Emotional Intelligence via Micro-Expressions and beyond.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

SpGesture: Source-Free Domain-adaptive sEMG-based Gesture Recognition with Jaccard Attentive Spiking Neural Network

Weiyu Guo, Ying Sun, Yijie Xu, Ziyue Qiao, Yongkui Yang, Hui Xiong

Surface electromyography (sEMG) based gesture recognition offers a natural and intuitive interaction modality for wearable devices. Despite significant advancements in sEMG-based gesture-recognition models, existing methods often suffer from high computational latency and increased energy consumption. Additionally, the inherent instability of sEMG signals, combined with their sensitivity to distribution shifts in real-world settings, compromises model robustness. To tackle these challenges, we propose a novel SpGesture framework based on Spiking Neural Networks, which possesses several unique merits compared with existing methods: (1) Robustness: By utilizing membrane potential as a memory list, we pioneer the introduction of Source-Free Domain Adaptation into SNN for the first time. This enables SpGesture to mitigate the accuracy degradation caused by distribution shifts. (2) High Accuracy: With a novel Spiking Jaccard Attention, SpGesture enhances the SNNs' ability to represent sEMG features, leading to a notable rise in system accuracy. To validate SpGesture's performance, we collected a new sEMG gesture dataset which has different forearm postures, where SpGesture achieved the highest accuracy among the baselines ($89.26%$). Moreover, the actual deployment on the CPU demonstrated a system latency below 100ms, well within real-time requirements. This impressive performance showcases SpGesture's potential to enhance the applicability of sEMG in real-world scenarios. The code is available at https://anonymous.4open.science/r/SpGesture.

5/24/2024

Efficient sEMG-based Cross-Subject Joint Angle Estimation via Hierarchical Spiking Attentional Feature Decomposition Network

Xin Zhou, Chuang Lin, Can Wang, Xiaojiang Peng

As human-machine interaction systems are developing towards lightweight and pervasive direction, the role of simultaneous and proportional control (SPC) in human-machine interaction becomes increasingly prominent. However, existing continuous joint angle prediction algorithms based on surface electromyography (sEMG) typically incur high inference costs or are only applicable to specific subjects rather than cross-subject scenarios. Therefore, we proposed a hierarchical Spiking Attentional FEature decomposition Network (SAFE-Net) in order to reduce inference costs and improve recognition accuracy in cross-subject scenarios. This network first encodes the sEMG signals into neural spiking forms through a Spiking Sparse Attention Encoder (SSAE). The compressed features are then decomposed into kinematic features and biological features by a Spiking Attentional Feature Decomposition (SAFD) module. Finally, the kinematic features and biological features are decoded into joint angle values and subject identity, respectively. We validated the effectiveness of SAFE-Net on two datasets (SIAT-DB1 and SIAT-DB2) and compared it with two state-of-the-art methods, Informer and Spikformer. Experimental results demonstrate that, on the one hand, SSAE saves 39.1% and 37.5% power consumption respectively over them in terms of inference costs. On the other hand, SAFE-Net outperforms Informer and Spikformer in recognition accuracy on both datasets. This study showcased that the proposed SAFE-Net can provide accurate predictions in cross-subject scenarios, offering a promising vision for precise continuous control of lower limb rehabilitation exoskeleton robots.

7/9/2024

Energy-Efficient Spiking Recurrent Neural Network for Gesture Recognition on Embedded GPUs

Marzieh Hassanshahi Varposhti, Mahyar Shahsavari, Marcel van Gerven

Implementing AI algorithms on event-based embedded devices enables real-time processing of data, minimizes latency, and enhances power efficiency in edge computing. This research explores the deployment of a spiking recurrent neural network (SRNN) with liquid time constant neurons for gesture recognition. We focus on the energy efficiency and computational efficacy of NVIDIA Jetson Nano embedded GPU platforms. The embedded GPU showcases a 14-fold increase in power efficiency relative to a conventional GPU, making a compelling argument for its use in energy-constrained applications. The study's empirical findings also highlight that batch processing significantly boosts frame rates across various batch sizes while maintaining accuracy levels well above the baseline. These insights validate the SRNN with liquid time constant neurons as a robust model for interpreting temporal-spatial data in gesture recognition, striking a critical balance between processing speed and power frugality.

8/26/2024

🔎

Resonate-and-Fire Spiking Neurons for Target Detection and Hand Gesture Recognition: A Hybrid Approach

Ahmed Shaaban, Zeineb Chaabouni, Maximilian Strobel, Wolfgang Furtner, Robert Weigel, Fabian Lurz

Hand gesture recognition using radar often relies on computationally expensive fast Fourier transforms. This paper proposes an alternative approach that bypasses fast Fourier transforms using resonate-and-fire neurons. These neurons directly detect the hand in the time-domain signal, eliminating the need for fast Fourier transforms to retrieve range information. Following detection, a simple Goertzel algorithm is employed to extract five key features, eliminating the need for a second fast Fourier transform. These features are then fed into a recurrent neural network, achieving an accuracy of 98.21% for classifying five gestures. The proposed approach demonstrates competitive performance with reduced complexity compared to traditional methods

5/31/2024