Dual-TSST: A Dual-Branch Temporal-Spectral-Spatial Transformer Model for EEG Decoding

Read original: arXiv:2409.03251 - Published 9/6/2024 by Hongqi Li, Haodong Zhang, Yitong Chen

Dual-TSST: A Dual-Branch Temporal-Spectral-Spatial Transformer Model for EEG Decoding

Overview

This paper introduces Dual-TSST, a novel deep learning model for decoding electroencephalography (EEG) signals.
Dual-TSST is a dual-branch transformer-based architecture that captures both temporal and spectral-spatial features from EEG data.
The model demonstrates state-of-the-art performance on several EEG decoding tasks, including motor imagery classification and visual attention decoding.

Plain English Explanation

Electroencephalography (EEG) is a technique used to measure the electrical activity of the brain. Dual-TSST: A Dual-Branch Temporal-Spectral-Spatial Transformer Model for EEG Decoding is a new deep learning model that can analyze EEG data and decode what a person is thinking or doing based on their brain activity.

The key innovation of Dual-TSST is that it uses a special type of neural network called a "transformer" to capture both the temporal (time-based) and spectral-spatial (frequency and location-based) features of the EEG signals. By combining these two types of information, the model can more accurately interpret the complex patterns in the brain's electrical activity.

For example, when a person imagines moving their hand, specific patterns of brain activity occur over time and in different regions of the brain. Dual-TSST is able to detect these patterns and use them to predict that the person is imagining a hand movement, even if they are not physically moving their hand.

This type of EEG decoding has many potential applications, such as brain-computer interfaces that allow people to control computers or devices with their thoughts, or neural rehabilitation systems that help people recover from brain injuries.

Technical Explanation

Dual-TSST is a dual-branch neural network architecture that combines transformer-based models for temporal and spectral-spatial feature extraction from EEG data.

The temporal branch uses a transformer-based model to capture the temporal dynamics of the EEG signals over time. This allows the model to learn patterns in the way the brain's electrical activity changes over the course of a task or experience.

The spectral-spatial branch uses a convolutional neural network (CNN) to extract features related to the frequency and spatial distribution of the EEG signals. This allows the model to identify patterns in the way different regions of the brain are activated at different frequencies.

The outputs of these two branches are then fused together to create a comprehensive representation of the EEG data, which is then used for the final task-specific prediction (e.g., motor imagery classification, visual attention decoding).

The authors evaluate Dual-TSST on several public EEG datasets and demonstrate that it outperforms state-of-the-art methods for a variety of EEG decoding tasks. The model's ability to capture both temporal and spectral-spatial features appears to be a key factor in its strong performance.

Critical Analysis

The authors provide a thorough evaluation of Dual-TSST, testing it on multiple EEG datasets and comparing its performance to several baseline models. They also discuss potential limitations and areas for future research, such as the need for further investigation into the interpretability and generalizability of the model.

One potential concern is the computational complexity of the transformer-based components, which could limit the practical deployment of Dual-TSST in real-time applications. The authors mention that future work could explore ways to optimize the model's efficiency without sacrificing performance.

Additionally, the paper does not provide a detailed analysis of the specific temporal and spectral-spatial features learned by the model, which could offer valuable insights into the underlying neurophysiology of the EEG signals. Investigating these interpretable representations could be an interesting direction for future research.

Conclusion

Dual-TSST is a innovative deep learning model that demonstrates state-of-the-art performance on a variety of EEG decoding tasks. By effectively capturing both the temporal and spectral-spatial characteristics of EEG signals, the model is able to better interpret the complex patterns of brain activity.

This advance in EEG decoding could have significant implications for the development of brain-computer interfaces, neural rehabilitation systems, and other applications that rely on the ability to infer human intentions and cognitive states from brain activity. As the field of EEG-based neural engineering continues to evolve, models like Dual-TSST will play an increasingly important role in pushing the boundaries of what is possible.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Dual-TSST: A Dual-Branch Temporal-Spectral-Spatial Transformer Model for EEG Decoding

Hongqi Li, Haodong Zhang, Yitong Chen

The decoding of electroencephalography (EEG) signals allows access to user intentions conveniently, which plays an important role in the fields of human-machine interaction. To effectively extract sufficient characteristics of the multichannel EEG, a novel decoding architecture network with a dual-branch temporal-spectral-spatial transformer (Dual-TSST) is proposed in this study. Specifically, by utilizing convolutional neural networks (CNNs) on different branches, the proposed processing network first extracts the temporal-spatial features of the original EEG and the temporal-spectral-spatial features of time-frequency domain data converted by wavelet transformation, respectively. These perceived features are then integrated by a feature fusion block, serving as the input of the transformer to capture the global long-range dependencies entailed in the non-stationary EEG, and being classified via the global average pooling and multi-layer perceptron blocks. To evaluate the efficacy of the proposed approach, the competitive experiments are conducted on three publicly available datasets of BCI IV 2a, BCI IV 2b, and SEED, with the head-to-head comparison of more than ten other state-of-the-art methods. As a result, our proposed Dual-TSST performs superiorly in various tasks, which achieves the promising EEG classification performance of average accuracy of 80.67% in BCI IV 2a, 88.64% in BCI IV 2b, and 96.65% in SEED, respectively. Extensive ablation experiments conducted between the Dual-TSST and comparative baseline model also reveal the enhanced decoding performance with each module of our proposed method. This study provides a new approach to high-performance EEG decoding, and has great potential for future CNN-Transformer based applications.

9/6/2024

A Temporal-Spectral Fusion Transformer with Subject-Specific Adapter for Enhancing RSVP-BCI Decoding

Xujin Li, Wei Wei, Shuang Qiu, Huiguang He

The Rapid Serial Visual Presentation (RSVP)-based Brain-Computer Interface (BCI) is an efficient technology for target retrieval using electroencephalography (EEG) signals. The performance improvement of traditional decoding methods relies on a substantial amount of training data from new test subjects, which increases preparation time for BCI systems. Several studies introduce data from existing subjects to reduce the dependence of performance improvement on data from new subjects, but their optimization strategy based on adversarial learning with extensive data increases training time during the preparation procedure. Moreover, most previous methods only focus on the single-view information of EEG signals, but ignore the information from other views which may further improve performance. To enhance decoding performance while reducing preparation time, we propose a Temporal-Spectral fusion transformer with Subject-specific Adapter (TSformer-SA). Specifically, a cross-view interaction module is proposed to facilitate information transfer and extract common representations across two-view features extracted from EEG temporal signals and spectrogram images. Then, an attention-based fusion module fuses the features of two views to obtain comprehensive discriminative features for classification. Furthermore, a multi-view consistency loss is proposed to maximize the feature similarity between two views of the same EEG signal. Finally, we propose a subject-specific adapter to rapidly transfer the knowledge of the model trained on data from existing subjects to decode data from new subjects. Experimental results show that TSformer-SA significantly outperforms comparison methods and achieves outstanding performance with limited training data from new subjects. This facilitates efficient decoding and rapid deployment of BCI systems in practical use.

7/12/2024

EEG-DBNet: A Dual-Branch Network for Motor-Imagery Brain-Computer Interfaces

Xicheng Lou, Xinwei Li, Hongying Meng, Jun Hu, Meili Xu, Yue Zhao, Jiazhang Yang, Zhangyong Li

Motor imagery electroencephalogram (EEG)-based brain-computer interfaces (BCIs) offer significant advantages for individuals with restricted limb mobility. However, challenges such as low signal-to-noise ratio and limited spatial resolution impede accurate feature extraction from EEG signals, thereby affecting the classification accuracy of different actions. To address these challenges, this study proposes an end-to-end dual-branch network (EEG-DBNet) that decodes the temporal and spectral sequences of EEG signals in parallel through two distinct network branches. Each branch comprises a local convolutional block and a global convolutional block. The local convolutional block transforms the source signal from the temporal-spatial domain to the temporal-spectral domain. By varying the number of filters and convolution kernel sizes, the local convolutional blocks in different branches adjust the length of their respective dimension sequences. Different types of pooling layers are then employed to emphasize the features of various dimension sequences, setting the stage for subsequent global feature extraction. The global convolution block splits and reconstructs the feature of the signal sequence processed by the local convolution block in the same branch and further extracts features through the dilated causal convolutional neural networks. Finally, the outputs from the two branches are concatenated, and signal classification is completed via a fully connected layer. Our proposed method achieves classification accuracies of 85.84% and 91.60% on the BCI Competition 4-2a and BCI Competition 4-2b datasets, respectively, surpassing existing state-of-the-art models. The source code is available at https://github.com/xicheng105/EEG-DBNet.

6/21/2024

EEG classification for visual brain decoding with spatio-temporal and transformer based paradigms

Akanksha Sharma, Jyoti Nigam, Abhishek Rathore, Arnav Bhavsar

In this work, we delve into the EEG classification task in the domain of visual brain decoding via two frameworks, involving two different learning paradigms. Considering the spatio-temporal nature of EEG data, one of our frameworks is based on a CNN-BiLSTM model. The other involves a CNN-Transformer architecture which inherently involves the more versatile attention based learning paradigm. In both cases, a special 1D-CNN feature extraction module is used to generate the initial embeddings with 1D convolutions in the time and the EEG channel domains. Considering the EEG signals are noisy, non stationary and the discriminative features are even less clear (than in semantically structured data such as text or image), we also follow a window-based classification followed by majority voting during inference, to yield labels at a signal level. To illustrate how brain patterns correlate with different image classes, we visualize t-SNE plots of the BiLSTM embeddings alongside brain activation maps for the top 10 classes. These visualizations provide insightful revelations into the distinct neural signatures associated with each visual category, showcasing the BiLSTM's capability to capture and represent the discriminative brain activity linked to visual stimuli. We demonstrate the performance of our approach on the updated EEG-Imagenet dataset with positive comparisons with state-of-the-art methods.

8/12/2024