Efficient Learning of Accurate Surrogates for Simulations of Complex Systems

2207.12855

Published 5/20/2024 by A. Diaw, M. McKerns, I. Sagert, L. G. Stanton, M. S. Murillo

🐍

Abstract

Machine learning methods are increasingly used to build computationally inexpensive surrogates for complex physical models. The predictive capability of these surrogates suffers when data are noisy, sparse, or time-dependent. As we are interested in finding a surrogate that provides valid predictions of any potential future model evaluations, we introduce an online learning method empowered by optimizer-driven sampling. The method has two advantages over current approaches. First, it ensures that all turning points on the model response surface are included in the training data. Second, after any new model evaluations, surrogates are tested and retrained (updated) if the score drops below a validity threshold. Tests on benchmark functions reveal that optimizer-directed sampling generally outperforms traditional sampling methods in terms of accuracy around local extrema, even when the scoring metric favors overall accuracy. We apply our method to simulations of nuclear matter to demonstrate that highly accurate surrogates for the nuclear equation of state can be reliably auto-generated from expensive calculations using a few model evaluations.

Create account to get full access

Overview

Machine learning is being used to create simplified models that can predict the outputs of complex physical simulations
These simplified models, called "surrogates", can make predictions much faster than the original simulations
However, the accuracy of these surrogates suffers when the data used to train them is noisy, sparse, or changes over time
This paper introduces a new method to train surrogates that addresses these issues by using optimization techniques to intelligently sample the input space and continuously update the surrogate model

Plain English Explanation

The paper describes a new technique for building machine learning models that can accurately predict the outputs of complex physical simulations. These predictive models, called "surrogates," are computationally much cheaper to run than the original simulations, which is useful when the simulations are slow or expensive.

However, surrogate models can struggle to make accurate predictions if the data used to train them is noisy, sparse, or changes over time. To address these issues, the researchers developed a method that uses optimization algorithms to intelligently choose which inputs to simulate and test, ensuring that the training data captures all the important features of the model's behavior, even around local extrema.

After each new simulation, the surrogate model is tested to make sure its predictions are still valid. If the accuracy drops below a certain threshold, the model is automatically updated and retrained. This continuous monitoring and updating helps the surrogate stay accurate even as the underlying simulation changes over time.

The researchers tested this method on some standard benchmark problems as well as a real-world simulation of nuclear matter. They found that the optimizer-guided sampling approach outperformed traditional sampling methods, producing highly accurate surrogates that could reliably predict the original simulations using just a few data points.

Technical Explanation

This paper introduces an "online learning" method for building surrogate models that can accurately predict the outputs of complex physical simulations. The key innovations are:

Optimizer-Driven Sampling: The method uses optimization algorithms to intelligently choose which input conditions to simulate and use for training the surrogate model. This ensures that the training data captures all the important features of the model's behavior, including local extrema.
Continuous Model Validation and Updating: After each new simulation is added to the training data, the surrogate model is tested to ensure its predictions are still valid. If the accuracy drops below a specified threshold, the model is automatically updated and retrained. This helps the surrogate stay accurate even as the underlying simulation changes over time.

The researchers tested this approach on both benchmark functions and a real-world simulation of the nuclear equation of state. They found that the optimizer-guided sampling generally outperformed traditional sampling methods, producing surrogates that were more accurate around local extrema, even when the evaluation metric favored overall accuracy.

Critical Analysis

The paper presents a compelling approach for building reliable surrogate models, particularly when dealing with complex, time-dependent physical simulations. The use of optimization-driven sampling to ensure coverage of important model features is a clever idea, and the continuous model validation and updating is an effective way to maintain accuracy.

That said, the paper does not address some potential limitations of the method. For example, the optimization process used to select training points could become computationally expensive for very high-dimensional input spaces. Additionally, the paper does not explore how the method might perform when dealing with discontinuities or other challenging features in the model response surface.

Further research could investigate strategies for scaling the optimization-based sampling to higher dimensions, as well as techniques for handling more complex model behaviors. Exploring ways to incorporate multi-fidelity information could also be a fruitful avenue for improving the surrogate modeling approach.

Overall, this paper represents an interesting and valuable contribution to the field of surrogate modeling, offering a novel approach that can produce highly accurate predictive models from limited data. The continuous validation and updating mechanism is a particularly compelling feature that could have wide-ranging applications.

Conclusion

This paper presents a new method for building accurate surrogate models of complex physical simulations. The key innovations are the use of optimization-driven sampling to ensure the training data captures important model features, and the continuous validation and updating of the surrogate to maintain accuracy over time.

Tests on benchmark problems and a nuclear matter simulation show that this approach can produce highly reliable surrogates using just a few model evaluations. This is a significant advance over traditional sampling methods, which can struggle to capture local extrema and other important details.

The ability to automatically generate accurate predictive models from expensive simulations has wide-ranging implications. It could enable faster design exploration, sensitivity analysis, and optimization in fields like engineering, materials science, and energy research. Further development and application of this technique could lead to substantial gains in computational efficiency and scientific insights.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Enhancing Multi-Objective Optimization through Machine Learning-Supported Multiphysics Simulation

Diego Botache, Jens Decke, Winfried Ripken, Abhinay Dornipati, Franz Gotz-Hahn, Mohamed Ayeb, Bernhard Sick

This paper presents a methodological framework for training, self-optimising, and self-organising surrogate models to approximate and speed up multiobjective optimisation of technical systems based on multiphysics simulations. At the hand of two real-world datasets, we illustrate that surrogate models can be trained on relatively small amounts of data to approximate the underlying simulations accurately. Including explainable AI techniques allow for highlighting feature relevancy or dependencies and supporting the possible extension of the used datasets. One of the datasets was created for this paper and is made publicly available for the broader scientific community. Extensive experiments combine four machine learning and deep learning algorithms with an evolutionary optimisation algorithm. The performance of the combined training and optimisation pipeline is evaluated by verifying the generated Pareto-optimal results using the ground truth simulations. The results from our pipeline and a comprehensive evaluation strategy show the potential for efficiently acquiring solution candidates in multiobjective optimisation tasks by reducing the number of simulations and conserving a higher prediction accuracy, i.e., with a MAPE score under 5% for one of the presented use cases.

4/4/2024

cs.LG

Adaptive Gradient Enhanced Gaussian Process Surrogates for Inverse Problems

Phillip Semler, Martin Weiser

Generating simulated training data needed for constructing sufficiently accurate surrogate models to be used for efficient optimization or parameter identification can incur a huge computational effort in the offline phase. We consider a fully adaptive greedy approach to the computational design of experiments problem using gradient-enhanced Gaussian process regression as surrogates. Designs are incrementally defined by solving an optimization problem for accuracy given a certain computational budget. We address not only the choice of evaluation points but also of required simulation accuracy, both of values and gradients of the forward model. Numerical results show a significant reduction of the computational effort compared to just position-adaptive and static designs as well as a clear benefit of including gradient information into the surrogate training.

4/3/2024

cs.NA

Simulating, Fast and Slow: Learning Policies for Black-Box Optimization

Fabio Valerio Massoli, Tim Bakker, Thomas Hehn, Tribhuvanesh Orekondy, Arash Behboodi

In recent years, solving optimization problems involving black-box simulators has become a point of focus for the machine learning community due to their ubiquity in science and engineering. The simulators describe a forward process $f_{mathrm{sim}}: (psi, x) rightarrow y$ from simulation parameters $psi$ and input data $x$ to observations $y$, and the goal of the optimization problem is to find parameters $psi$ that minimize a desired loss function. Sophisticated optimization algorithms typically require gradient information regarding the forward process, $f_{mathrm{sim}}$, with respect to the parameters $psi$. However, obtaining gradients from black-box simulators can often be prohibitively expensive or, in some cases, impossible. Furthermore, in many applications, practitioners aim to solve a set of related problems. Thus, starting the optimization ``ab initio, i.e. from scratch, each time might be inefficient if the forward model is expensive to evaluate. To address those challenges, this paper introduces a novel method for solving classes of similar black-box optimization problems by learning an active learning policy that guides a differentiable surrogate's training and uses the surrogate's gradients to optimize the simulation parameters with gradient descent. After training the policy, downstream optimization of problems involving black-box simulators requires up to $sim$90% fewer expensive simulator calls compared to baselines such as local surrogate-based approaches, numerical optimization, and Bayesian methods.

6/7/2024

cs.LG

A Framework for Strategic Discovery of Credible Neural Network Surrogate Models under Uncertainty

Pratyush Kumar Singh, Kathryn A. Farrell-Maupin, Danial Faghihi

The widespread integration of deep neural networks in developing data-driven surrogate models for high-fidelity simulations of complex physical systems highlights the critical necessity for robust uncertainty quantification techniques and credibility assessment methodologies, ensuring the reliable deployment of surrogate models in consequential decision-making. This study presents the Occam Plausibility Algorithm for surrogate models (OPAL-surrogate), providing a systematic framework to uncover predictive neural network-based surrogate models within the large space of potential models, including various neural network classes and choices of architecture and hyperparameters. The framework is grounded in hierarchical Bayesian inferences and employs model validation tests to evaluate the credibility and prediction reliability of the surrogate models under uncertainty. Leveraging these principles, OPAL-surrogate introduces a systematic and efficient strategy for balancing the trade-off between model complexity, accuracy, and prediction uncertainty. The effectiveness of OPAL-surrogate is demonstrated through two modeling problems, including the deformation of porous materials for building insulation and turbulent combustion flow for the ablation of solid fuels within hybrid rocket motors.

5/15/2024

cs.CE cs.LG