Detach-ROCKET: Sequential feature selection for time series classification with random convolutional kernels

Read original: arXiv:2309.14518 - Published 6/26/2024 by Gonzalo Uribarri, Federico Barone, Alessio Ansuini, Erik Frans'en

📶

Overview

Time Series Classification (TSC) is crucial in various fields like medicine, environmental science, and finance.
Machine learning models like Recurrent Neural Networks and InceptionTime can face scalability issues due to computational requirements.
ROCKET has emerged as an efficient alternative, achieving state-of-the-art performance and simplifying training.
However, many ROCKET features are redundant or non-informative, increasing computational load and compromising generalization.

Plain English Explanation

Time series data is information that changes over time, like a patient's heart rate or a company's stock price. Classifying this data is essential for important tasks like diagnosing diseases, detecting unusual events, and analyzing financial trends. While advanced machine learning models have been successful, they can be computationally demanding, making them difficult to use at scale.

ROCKET is a newer, more efficient approach that can achieve top-tier performance without the same level of complexity. It works by generating a large number of random features from the time series data and using those to train a simple classifier. However, many of these features aren't actually useful for making accurate predictions, adding unnecessary computational load and potentially reducing the model's ability to generalize to new data.

Technical Explanation

This paper introduces Sequential Feature Detachment (SFD), a method for identifying and removing non-essential features in ROCKET-based models, such as ROCKET, MiniRocket, and MultiRocket. SFD estimates feature importance using model coefficients and can handle large feature sets without complex hyperparameter tuning.

Testing on the UCR archive, a widely used benchmark for time series classification, shows that SFD can produce models with better test accuracy using only 10% of the original features. These pruned models are called Detach-ROCKET. The researchers also present an end-to-end procedure for determining an optimal balance between the number of features and model accuracy.

On the largest binary UCR dataset, Detach-ROCKET improved test accuracy by 0.6% while reducing the number of features by 98.9%. By enabling a significant reduction in model size without sacrificing accuracy, the Detach-ROCKET methodology improves computational efficiency and contributes to model interpretability.

Critical Analysis

The paper provides a thorough evaluation of the Detach-ROCKET approach on a wide range of time series classification benchmarks. However, the authors do not address how the method might perform on more complex, real-world time series datasets, which may have different characteristics than the UCR archive.

Additionally, the paper does not discuss the potential impact of feature selection on model robustness, an important consideration for applications like cyber attack detection or customer service volume management. Further research could explore the reliability and adversarial robustness of the pruned Detach-ROCKET models.

Conclusion

The Detach-ROCKET approach represents a significant advancement in time series classification by enabling a substantial reduction in model complexity without sacrificing accuracy. This improved computational efficiency and interpretability could make ROCKET-based models more accessible and useful for researchers and practitioners working with time series data across a variety of domains, from medical diagnosis to financial forecasting.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

📶

Detach-ROCKET: Sequential feature selection for time series classification with random convolutional kernels

Gonzalo Uribarri, Federico Barone, Alessio Ansuini, Erik Frans'en

Time Series Classification (TSC) is essential in fields like medicine, environmental science, and finance, enabling tasks such as disease diagnosis, anomaly detection, and stock price analysis. While machine learning models like Recurrent Neural Networks and InceptionTime are successful in numerous applications, they can face scalability issues due to computational requirements. Recently, ROCKET has emerged as an efficient alternative, achieving state-of-the-art performance and simplifying training by utilizing a large number of randomly generated features from the time series data. However, many of these features are redundant or non-informative, increasing computational load and compromising generalization. Here we introduce Sequential Feature Detachment (SFD) to identify and prune non-essential features in ROCKET-based models, such as ROCKET, MiniRocket, and MultiRocket. SFD estimates feature importance using model coefficients and can handle large feature sets without complex hyperparameter tuning. Testing on the UCR archive shows that SFD can produce models with better test accuracy using only 10% of the original features. We named these pruned models Detach-ROCKET. We also present an end-to-end procedure for determining an optimal balance between the number of features and model accuracy. On the largest binary UCR dataset, Detach-ROCKET improves test accuracy by 0.6% while reducing features by 98.9%. By enabling a significant reduction in model size without sacrificing accuracy, our methodology improves computational efficiency and contributes to model interpretability. We believe that Detach-ROCKET will be a valuable tool for researchers and practitioners working with time series data, who can find a user-friendly implementation of the model at url{https://github.com/gon-uri/detach_rocket}.

6/26/2024

Classification of Raw MEG/EEG Data with Detach-Rocket Ensemble: An Improved ROCKET Algorithm for Multivariate Time Series Analysis

Adri`a Solana, Erik Frans'en, Gonzalo Uribarri

Multivariate Time Series Classification (MTSC) is a ubiquitous problem in science and engineering, particularly in neuroscience, where most data acquisition modalities involve the simultaneous time-dependent recording of brain activity in multiple brain regions. In recent years, Random Convolutional Kernel models such as ROCKET and MiniRocket have emerged as highly effective time series classification algorithms, capable of achieving state-of-the-art accuracy results with low computational load. Despite their success, these types of models face two major challenges when employed in neuroscience: 1) they struggle to deal with high-dimensional data such as EEG and MEG, and 2) they are difficult to interpret. In this work, we present a novel ROCKET-based algorithm, named Detach-Rocket Ensemble, that is specifically designed to address these two problems in MTSC. Our algorithm leverages pruning to provide an integrated estimation of channel importance, and ensembles to achieve better accuracy and provide a label probability. Using a synthetic multivariate time series classification dataset in which we control the amount of information carried by each of the channels, we first show that our algorithm is able to correctly recover the channel importance for classification. Then, using two real-world datasets, a MEG dataset and an EEG dataset, we show that Detach-Rocket Ensemble is able to provide both interpretable channel relevance and competitive classification accuracy, even when applied directly to the raw brain data, without the need for feature engineering.

8/7/2024

🏷️

POCKET: Pruning Random Convolution Kernels for Time Series Classification from a Feature Selection Perspective

Shaowu Chen, Weize Sun, Lei Huang, Xiaopeng Li, Qingyuan Wang, Deepu John

In recent years, two competitive time series classification models, namely, ROCKET and MINIROCKET, have garnered considerable attention due to their low training cost and high accuracy. However, they rely on a large number of random 1-D convolutional kernels to comprehensively capture features, which is incompatible with resource-constrained devices. Despite the development of heuristic algorithms designed to recognize and prune redundant kernels, the inherent time-consuming nature of evolutionary algorithms hinders efficient evaluation. To efficiently prune models, this paper eliminates feature groups contributing minimally to the classifier, thereby discarding the associated random kernels without direct evaluation. To this end, we incorporate both group-level ($l_{2,1}$-norm) and element-level ($l_2$-norm) regularizations to the classifier, formulating the pruning challenge as a group elastic net classification problem. An ADMM-based algorithm is initially introduced to solve the problem, but it is computationally intensive. Building on the ADMM-based algorithm, we then propose our core algorithm, POCKET, which significantly speeds up the process by dividing the task into two sequential stages. In Stage 1, POCKET utilizes dynamically varying penalties to efficiently achieve group sparsity within the classifier, removing features associated with zero weights and their corresponding kernels. In Stage 2, the remaining kernels and features are used to refit a $l_2$-regularized classifier for enhanced performance. Experimental results on diverse time series datasets show that POCKET prunes up to 60% of kernels without a significant reduction in accuracy and performs 11$times$ faster than its counterparts. Our code is publicly available at https://github.com/ShaowuChen/POCKET.

7/26/2024

Time series classification with random convolution kernels based transforms: pooling operators and input representations matter

Mouhamadou Mansour Lo, Gildas Morvan, Mathieu Rossi, Fabrice Morganti, David Mercier

This article presents a new approach based on MiniRocket, called SelF-Rocket, for fast time series classification (TSC). Unlike existing approaches based on random convolution kernels, it dynamically selects the best couple of input representations and pooling operator during the training process. SelF-Rocket achieves state-of-the-art accuracy on the University of California Riverside (UCR) TSC benchmark datasets.

9/4/2024