NeuroLGP-SM: Scalable Surrogate-Assisted Neuroevolution for Deep Neural Networks

2404.08786

Published 5/3/2024 by Fergal Stapleton, Edgar Galv'an

NeuroLGP-SM: Scalable Surrogate-Assisted Neuroevolution for Deep Neural Networks

Abstract

Evolutionary Algorithms (EAs) play a crucial role in the architectural configuration and training of Artificial Deep Neural Networks (DNNs), a process known as neuroevolution. However, neuroevolution is hindered by its inherent computational expense, requiring multiple generations, a large population, and numerous epochs. The most computationally intensive aspect lies in evaluating the fitness function of a single candidate solution. To address this challenge, we employ Surrogate-assisted EAs (SAEAs). While a few SAEAs approaches have been proposed in neuroevolution, none have been applied to truly large DNNs due to issues like intractable information usage. In this work, drawing inspiration from Genetic Programming semantics, we use phenotypic distance vectors, outputted from DNNs, alongside Kriging Partial Least Squares (KPLS), an approach that is effective in handling these large vectors, making them suitable for search. Our proposed approach, named Neuro-Linear Genetic Programming surrogate model (NeuroLGP-SM), efficiently and accurately estimates DNN fitness without the need for complete evaluations. NeuroLGP-SM demonstrates competitive or superior results compared to 12 other methods, including NeuroLGP without SM, convolutional neural networks, support vector machines, and autoencoders. Additionally, it is worth noting that NeuroLGP-SM is 25% more energy-efficient than its NeuroLGP counterpart. This efficiency advantage adds to the overall appeal of our proposed NeuroLGP-SM in optimising the configuration of large DNNs.

Create account to get full access

Overview

This paper introduces NeuroLGP-SM, a scalable surrogate-assisted neuroevolution approach for optimizing deep neural networks.
The key ideas are using a linear genetic programming (LGP) algorithm to evolve network architectures, and a surrogate model to efficiently evaluate the performance of candidate architectures.
The paper demonstrates the effectiveness of NeuroLGP-SM on several benchmark tasks, showing it can find high-performing network architectures more efficiently than traditional neuroevolution methods.

Plain English Explanation

NeuroLGP-SM is a new method for designing deep neural network architectures. It uses an evolutionary approach inspired by how genes evolve in nature.

The algorithm starts with a population of random neural network designs. It then iteratively selects the best-performing designs, makes small random changes to them, and tests the modified designs. Over many generations, the algorithm gradually evolves networks that are better at the target task, like image recognition or language modeling.

The key innovation in NeuroLGP-SM is using a surrogate model to quickly predict the performance of each network design, rather than having to train and test the full network. This makes the evolutionary process much more efficient, allowing NeuroLGP-SM to explore many more network designs than traditional methods.

The authors show that NeuroLGP-SM can find high-performing network architectures more quickly than other neuroevolution approaches on several standard benchmarks. This could lead to faster development of powerful deep learning models for real-world applications.

Technical Explanation

The core of NeuroLGP-SM is a linear genetic programming (LGP) algorithm that evolves the architecture of a deep neural network. The algorithm maintains a population of candidate network designs, represented as sequences of LGP instructions. In each generation, the best-performing designs are selected, mutated, and evaluated to produce the next generation.

To enable scalable optimization, NeuroLGP-SM uses a surrogate model to predict the performance of each candidate network, instead of fully training and evaluating each one. The surrogate model is a machine learning model that learns to approximate the true performance of network designs based on their LGP representations and a limited set of actual performance evaluations.

The authors evaluate NeuroLGP-SM on several popular deep learning benchmarks, including image classification, language modeling, and reinforcement learning tasks. They show that NeuroLGP-SM can find high-performing network architectures more efficiently than other neuroevolution methods, achieving state-of-the-art results on some tasks.

Critical Analysis

The key strengths of NeuroLGP-SM are its scalability and efficiency, enabled by the surrogate model. By avoiding the need to fully train and evaluate every candidate network design, the method can explore a much larger search space than traditional approaches.

However, the authors acknowledge several limitations of their work. First, the surrogate model may not always accurately predict the true performance of network designs, which could lead the evolutionary process astray. Second, the method is still computationally expensive compared to gradient-based neural architecture search techniques.

Additionally, the paper does not provide a detailed analysis of the network architectures found by NeuroLGP-SM, nor does it explore the generalization of the approach to other domains beyond the specific benchmarks evaluated. Further research would be needed to fully understand the strengths and weaknesses of the method.

Overall, NeuroLGP-SM represents an interesting step forward in the field of neuroevolution, but there are still opportunities for improvement and further investigation.

Conclusion

The NeuroLGP-SM method introduced in this paper demonstrates the potential of combining surrogate-assisted optimization with linear genetic programming for efficient neural network architecture search. By leveraging a surrogate model to quickly evaluate candidate designs, the approach can find high-performing network architectures more quickly than traditional neuroevolution methods.

This work could have important implications for accelerating the development of powerful deep learning models, particularly in domains where computational resources are limited. Further research to address the method's limitations and expand its applicability to other tasks and domains would be valuable contributions to the field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Large Language Models as Surrogate Models in Evolutionary Algorithms: A Preliminary Study

Hao Hao, Xiaoqun Zhang, Aimin Zhou

Large Language Models (LLMs) have achieved significant progress across various fields and have exhibited strong potential in evolutionary computation, such as generating new solutions and automating algorithm design. Surrogate-assisted selection is a core step in evolutionary algorithms to solve expensive optimization problems by reducing the number of real evaluations. Traditionally, this has relied on conventional machine learning methods, leveraging historical evaluated evaluations to predict the performance of new solutions. In this work, we propose a novel surrogate model based purely on LLM inference capabilities, eliminating the need for training. Specifically, we formulate model-assisted selection as a classification and regression problem, utilizing LLMs to directly evaluate the quality of new solutions based on historical data. This involves predicting whether a solution is good or bad, or approximating its value. This approach is then integrated into evolutionary algorithms, termed LLM-assisted EA (LAEA). Detailed experiments compared the visualization results of 2D data from 9 mainstream LLMs, as well as their performance on optimization problems. The experimental results demonstrate that LLMs have significant potential as surrogate models in evolutionary computation, achieving performance comparable to traditional surrogate models only using inference. This work offers new insights into the application of LLMs in evolutionary computation. Code is available at: https://github.com/hhyqhh/LAEA.git

6/18/2024

cs.NE

A First Look at Kolmogorov-Arnold Networks in Surrogate-assisted Evolutionary Algorithms

Hao Hao, Xiaoqun Zhang, Bingdong Li, Aimin Zhou

Surrogate-assisted Evolutionary Algorithm (SAEA) is an essential method for solving expensive expensive problems. Utilizing surrogate models to substitute the optimization function can significantly reduce reliance on the function evaluations during the search process, thereby lowering the optimization costs. The construction of surrogate models is a critical component in SAEAs, with numerous machine learning algorithms playing a pivotal role in the model-building phase. This paper introduces Kolmogorov-Arnold Networks (KANs) as surrogate models within SAEAs, examining their application and effectiveness. We employ KANs for regression and classification tasks, focusing on the selection of promising solutions during the search process, which consequently reduces the number of expensive function evaluations. Experimental results indicate that KANs demonstrate commendable performance within SAEAs, effectively decreasing the number of function calls and enhancing the optimization efficiency. The relevant code is publicly accessible and can be found in the GitHub repository.

5/28/2024

cs.NE

Towards evolution of Deep Neural Networks through contrastive Self-Supervised learning

Adriano Vinhas, Jo~ao Correia, Penousal Machado

Deep Neural Networks (DNNs) have been successfully applied to a wide range of problems. However, two main limitations are commonly pointed out. The first one is that they require long time to design. The other is that they heavily rely on labelled data, which can sometimes be costly and hard to obtain. In order to address the first problem, neuroevolution has been proved to be a plausible option to automate the design of DNNs. As for the second problem, self-supervised learning has been used to leverage unlabelled data to learn representations. Our goal is to study how neuroevolution can help self-supervised learning to bridge the gap to supervised learning in terms of performance. In this work, we propose a framework that is able to evolve deep neural networks using self-supervised learning. Our results on the CIFAR-10 dataset show that it is possible to evolve adequate neural networks while reducing the reliance on labelled data. Moreover, an analysis to the structure of the evolved networks suggests that the amount of labelled data fed to them has less effect on the structure of networks that learned via self-supervised learning, when compared to individuals that relied on supervised learning.

6/21/2024

cs.NE cs.AI cs.LG

Neuroevolving Electronic Dynamical Networks

Derek Whitley

Neuroevolution is a powerful method of applying an evolutionary algorithm to refine the performance of artificial neural networks through natural selection; however, the fitness evaluation of these networks can be time-consuming and computationally expensive, particularly for continuous time recurrent neural networks (CTRNNs) that necessitate the simulation of differential equations. To overcome this challenge, field programmable gate arrays (FPGAs) have emerged as an increasingly popular solution, due to their high performance and low power consumption. Further, their ability to undergo dynamic and partial reconfiguration enables the extremely rapid evaluation of the fitness of CTRNNs, effectively addressing the bottleneck associated with conventional methods. By incorporating fitness evaluation directly upon the programmable logic of the FPGA, hyper-parallel evaluation becomes feasible, dramatically reducing the time required for assessment. This inherent parallelism of FPGAs accelerates the entire neuroevolutionary process by several orders of magnitude, facilitating faster convergence to an optimal solution. The work presented in this study demonstrates the potential of utilizing dynamic and partial reconfiguration on capable FPGAs as a powerful platform for neuroevolving dynamic neural networks.

4/9/2024

cs.NE cs.AI cs.AR