Feature-Based Echo-State Networks: A Step Towards Interpretability and Minimalism in Reservoir Computer

Read original: arXiv:2403.19806 - Published 4/1/2024 by Debdipta Goswami

Feature-Based Echo-State Networks: A Step Towards Interpretability and Minimalism in Reservoir Computer

INTRODUCTION

This paper proposes a novel architecture for echo-state networks (ESNs), a type of recurrent neural network used for model-free prediction of dynamical systems. The proposed "feature-based echo-state network" (Feat-ESN) uses smaller, parallel reservoirs, each processing a different set of input features. This allows the model to be more interpretable, as the relative contribution of each feature to the output prediction can be determined. The Feat-ESN approach also significantly reduces the number of reservoir nodes required for effective prediction, which is particularly useful for high-dimensional systems like traffic volume forecasting.

The paper evaluates the Feat-ESN approach on two synthetic chaotic datasets and a real-world traffic volume dataset. It also includes an ablation study examining the impact of reservoir size. Overall, the Feat-ESN architecture provides improved interpretability and efficiency compared to standard ESNs, making it a useful technique for modeling complex dynamical systems.

Echo-State Networks for Dynamical Systems Prediction

Echo-state networks (ESNs) are a type of recurrent neural network with a large, randomly connected reservoir of neurons. The network takes input signals and processes them nonlinearly through the reservoir, with the output then linearly combined from the reservoir states. The performance of an ESN depends heavily on the richness of the reservoir dynamics.

The ESN consists of an input layer, a recurrent reservoir layer, and an output layer. The reservoir evolves nonlinearly over time, with the output linearly read out from the reservoir states. The input weights and reservoir weights are randomly initialized and held fixed, while the output weights are trained using linear regression with regularization.

During training, the ESN is driven by an input sequence, which generates a sequence of reservoir states. These reservoir states are stored in a matrix, and the corresponding output sequence is also stored. The output weights are then computed using a linear regression procedure that incorporates Tikhonov regularization to ensure numerical stability.

(a)

The provided text discusses the echo state property (ESP) of echo state networks (ESNs). It states that ESNs are universal approximators, meaning they can accurately represent any nonlinear operator with bounded memory if they satisfy the ESP. The ESP requires that the reservoir, represented by the weight matrix W, asymptotically washes out any information from the initial conditions. For the hyperbolic tangent activation function, it is empirically observed that the ESP holds when the spectral radius of W is less than one. To ensure this condition, W is normalized by its spectral radius.

Feature-Based Echo-State Network: A Bite Sized Approach

The paper presents a new architecture called Feat-ESN for echo state networks (ESNs) that aims to reduce the overall size of the reservoir while maintaining the predictive performance. The key components of Feat-ESN are:

Input map: Features are selected from the input vector, where a feature is any combination of the input components. A feature matrix is constructed to feed each feature separately into smaller individual reservoirs.

Reservoirs: Each feature is fed into a separate smaller reservoir. The total number of reservoir nodes is the product of the number of features and the block size of the individual reservoirs. The reservoirs evolve linearly, with each smaller reservoir depending only on its corresponding feature.

Readout: A nonlinear readout network is used to maintain the expressivity of the ESN. The readout is a combination of the reservoir, its nonlinear map, and a bias. The output is then linearly read out from this expanded reservoir representation.

The paper shows that Feat-ESN satisfies the universal approximation property under certain conditions on the feature matrix and the readout functions. Additionally, the relative magnitudes of the output weights can be used to prune the reservoir by removing less important features.

V Numerical Examples

This section presents the performance and ablation study of the Feat-ESN algorithm on three time series datasets: two generated by chaotic dynamical systems (Lorenz and Rössler systems) and one from real-time traffic flow data.

For the Lorenz and Rössler systems, Feat-ESN achieves better prediction accuracy compared to a regular ESN, using a much smaller number of reservoir nodes. The normalized root mean square error (NRMSE) is lower for Feat-ESN across different block sizes. The contribution of the nonlinear features to the output is found to be higher than the linear features.

For the traffic flow data, Feat-ESN is applied to predict the time series of traffic volume recorded by a sensor. To handle the partial observation from a single sensor, a delay embedding is used in the input layer. Feat-ESN outperforms the regular ESN, showing lower NRMSE and higher Pearson correlation between the predicted and true traffic volumes. The contribution of the features (delay combinations) decreases as the delay increases, as expected.

Overall, the results demonstrate the improved performance and reduced complexity of the Feat-ESN algorithm compared to a regular ESN for time series prediction tasks.

Conclusion

The paper proposes a feature-based approach to generate the reservoir for an echo-state network (ESN). The algorithm, called Feat-ESN, uses parallel smaller linear neuronal reservoirs driven by different input combinations, called features. This significantly reduces the computational complexity of the ESN while maintaining the same predictive performance as a much larger reservoir. The proposed approach demonstrates improved prediction performance with fewer reservoir nodes compared to a regular ESN for chaotic time-series. The method is then applied to real traffic data from the University of Maryland campus to predict traffic volume at various intersections.

Acknowledgement

The section acknowledges support from Dr. Derek A. Paley and the University of Maryland Department of Transportation for providing Numina sensor data, as well as Dr. Artur Wolek for preprocessing the data.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Feature-Based Echo-State Networks: A Step Towards Interpretability and Minimalism in Reservoir Computer

Debdipta Goswami

This paper proposes a novel and interpretable recurrent neural-network structure using the echo-state network (ESN) paradigm for time-series prediction. While the traditional ESNs perform well for dynamical systems prediction, it needs a large dynamic reservoir with increased computational complexity. It also lacks interpretability to discern contributions from different input combinations to the output. Here, a systematic reservoir architecture is developed using smaller parallel reservoirs driven by different input combinations, known as features, and then they are nonlinearly combined to produce the output. The resultant feature-based ESN (Feat-ESN) outperforms the traditional single-reservoir ESN with less reservoir nodes. The predictive capability of the proposed architecture is demonstrated on three systems: two synthetic datasets from chaotic dynamical systems and a set of real-time traffic data.

4/1/2024

Brain-Inspired Spike Echo State Network Dynamics for Aero-Engine Intelligent Fault Prediction

Mo-Ran Liu, Tao Sun, Xi-Ming Sun

Aero-engine fault prediction aims to accurately predict the development trend of the future state of aero-engines, so as to diagnose faults in advance. Traditional aero-engine parameter prediction methods mainly use the nonlinear mapping relationship of time series data but generally ignore the adequate spatiotemporal features contained in aero-engine data. To this end, we propose a brain-inspired spike echo state network (Spike-ESN) model for aero-engine intelligent fault prediction, which is used to effectively capture the evolution process of aero-engine time series data in the framework of spatiotemporal dynamics. In the proposed approach, we design a spike input layer based on Poisson distribution inspired by the spike neural encoding mechanism of biological neurons, which can extract the useful temporal characteristics in aero-engine sequence data. Then, the temporal characteristics are input into a spike reservoir through the current calculation method of spike accumulation in neurons, which projects the data into a high-dimensional sparse space. In addition, we use the ridge regression method to read out the internal state of the spike reservoir. Finally, the experimental results of aero-engine states prediction demonstrate the superiority and potential of the proposed method.

6/21/2024

Temporal Convolution Derived Multi-Layered Reservoir Computing

Johannes Viehweg, Dominik Walther, Prof. Dr. -Ing. Patrick Mader

The prediction of time series is a challenging task relevant in such diverse applications as analyzing financial data, forecasting flow dynamics or understanding biological processes. Especially chaotic time series that depend on a long history pose an exceptionally difficult problem. While machine learning has shown to be a promising approach for predicting such time series, it either demands long training time and much training data when using deep recurrent neural networks. Alternative, when using a reservoir computing approach it comes with high uncertainty and typically a high number of random initializations and extensive hyper-parameter tuning when using a reservoir computing approach. In this paper, we focus on the reservoir computing approach and propose a new mapping of input data into the reservoir's state space. Furthermore, we incorporate this method in two novel network architectures increasing parallelizability, depth and predictive capabilities of the neural network while reducing the dependence on randomness. For the evaluation, we approximate a set of time series from the Mackey-Glass equation, inhabiting non-chaotic as well as chaotic behavior and compare our approaches in regard to their predictive capabilities to echo state networks and gated recurrent units. For the chaotic time series, we observe an error reduction of up to $85.45%$ and up to $87.90%$ in contrast to echo state networks and gated recurrent units respectively. Furthermore, we also observe tremendous improvements for non-chaotic time series of up to $99.99%$ in contrast to existing approaches.

7/10/2024

🔮

Time-Series Forecasting and Sequence Learning Using Memristor-based Reservoir System

Abdullah M. Zyarah, Dhireesha Kudithipudi

Pushing the frontiers of time-series information processing in the ever-growing domain of edge devices with stringent resources has been impeded by the systems' ability to process information and learn locally on the device. Local processing and learning of time-series information typically demand intensive computations and massive storage as the process involves retrieving information and tuning hundreds of parameters back in time. In this work, we developed a memristor-based echo state network accelerator that features efficient temporal data processing and in-situ online learning. The proposed design is benchmarked using various datasets involving real-world tasks, such as forecasting the load energy consumption and weather conditions. The experimental results illustrate that the hardware model experiences a marginal degradation in performance as compared to the software counterpart. This is mainly attributed to the limited precision and dynamic range of network parameters when emulated using memristor devices. The proposed system is evaluated for lifespan, robustness, and energy-delay product. It is observed that the system demonstrates reasonable robustness for device failure below 10%, which may occur due to stuck-at faults. Furthermore, 247X reduction in energy consumption is achieved when compared to a custom CMOS digital design implemented at the same technology node.

9/17/2024