Hyperparameter Importance Analysis for Multi-Objective AutoML

2405.07640

Published 5/16/2024 by Daphne Theodorakopoulos, Frederic Stahl, Marius Lindauer

⛏️

Abstract

Hyperparameter optimization plays a pivotal role in enhancing the predictive performance and generalization capabilities of ML models. However, in many applications, we do not only care about predictive performance but also about objectives such as inference time, memory, or energy consumption. In such MOO scenarios, determining the importance of hyperparameters poses a significant challenge due to the complex interplay between the conflicting objectives. In this paper, we propose the first method for assessing the importance of hyperparameters in the context of multi-objective hyperparameter optimization. Our approach leverages surrogate-based hyperparameter importance (HPI) measures, i.e. fANOVA and ablation paths, to provide insights into the impact of hyperparameters on the optimization objectives. Specifically, we compute the a-priori scalarization of the objectives and determine the importance of the hyperparameters for different objective tradeoffs. Through extensive empirical evaluations on diverse benchmark datasets with three different objectives paired with accuracy, namely time, demographic parity, and energy consumption, we demonstrate the effectiveness and robustness of our proposed method. Our findings not only offer valuable guidance for hyperparameter tuning in MOO tasks but also contribute to advancing the understanding of HPI in complex optimization scenarios.

Create account to get full access

Overview

Hyperparameter optimization is crucial for enhancing the performance and generalization of machine learning (ML) models.
In many real-world applications, we care about not just predictive accuracy but also other objectives like inference time, memory usage, or energy consumption.
When optimizing multiple, potentially conflicting objectives (multi-objective optimization, or MOO), determining the importance of hyperparameters becomes a significant challenge.
This paper proposes the first method for assessing hyperparameter importance in the context of multi-objective hyperparameter optimization.

Plain English Explanation

Hyperparameters are the settings we tweak when training a machine learning model, like the learning rate or the number of layers in a neural network. Choosing the right hyperparameters is crucial for getting a model to perform well and generalize to new data. Hyperparameter optimization can even be harmful.

However, in many real-world applications, we care about more than just how accurate the model is. We might also want it to be fast, compact, or energy-efficient. This creates a challenge, because optimizing for one of these objectives can hurt the others. Towards leveraging AutoML for sustainable deep learning and common pitfalls to avoid in multi-objective optimization are relevant here.

The paper proposes a new way to understand which hyperparameters are most important for each of these different objectives. This can help guide hyperparameter tuning in multi-objective optimization tasks. Enhancing multi-objective optimization through machine learning and a simpler, better importance metric are related ideas.

Technical Explanation

The key innovation in this paper is a method for computing hyperparameter importance (HPI) measures in the context of multi-objective optimization. The authors leverage surrogate-based HPI approaches like fANOVA and ablation paths, which provide insights into how much each hyperparameter impacts the different optimization objectives.

Specifically, the authors first compute a scalar (single-valued) version of the multiple objectives by applying a scalarization function. They then use the fANOVA and ablation path techniques to determine which hyperparameters are most important for this scalarized objective.

By repeating this process for different scalarization functions, which encode different tradeoffs between the original objectives, the authors are able to get a comprehensive understanding of hyperparameter importance under various multi-objective scenarios.

The authors evaluate their approach on several benchmark datasets, optimizing for accuracy along with three other objectives: inference time, demographic parity, and energy consumption. Their results demonstrate the effectiveness and robustness of their proposed method for providing valuable guidance on hyperparameter tuning in multi-objective optimization tasks.

Critical Analysis

The authors acknowledge several limitations of their work. First, their method relies on the availability of accurate surrogate models, which may not always be easy to obtain. Additionally, the choice of scalarization function can significantly impact the resulting hyperparameter importance estimates, and the authors do not provide clear guidance on how to select the most appropriate scalarization function.

Furthermore, the authors' experiments are limited to a relatively small number of hyperparameters and objectives. It's unclear how well their approach would scale to settings with a larger number of hyperparameters or objectives, which are common in real-world applications.

Finally, the authors do not address the potential bias and fairness implications of their approach. By optimizing for objectives like demographic parity, they may be implicitly encoding certain societal biases into the hyperparameter importance estimates, which could have downstream consequences.

Despite these limitations, the authors' work represents an important step towards understanding hyperparameter importance in the context of multi-objective optimization. Further research is needed to address the challenges and expand the applicability of this approach to a wider range of practical scenarios.

Conclusion

This paper presents a novel method for assessing the importance of hyperparameters in multi-objective optimization (MOO) settings. By leveraging surrogate-based importance measures and scalarization techniques, the authors provide valuable insights into the complex interplay between hyperparameters and conflicting optimization objectives.

The authors' findings offer guidance for hyperparameter tuning in MOO tasks and contribute to advancing the understanding of hyperparameter importance in complex optimization scenarios. As machine learning models are increasingly deployed in real-world applications with multiple, often competing objectives, this work represents an important step towards more holistic hyperparameter optimization and model development.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

🛠️

Multi-Objective Hyperparameter Optimization in Machine Learning -- An Overview

Florian Karl, Tobias Pielok, Julia Moosbauer, Florian Pfisterer, Stefan Coors, Martin Binder, Lennart Schneider, Janek Thomas, Jakob Richter, Michel Lang, Eduardo C. Garrido-Merch'an, Juergen Branke, Bernd Bischl

Hyperparameter optimization constitutes a large part of typical modern machine learning workflows. This arises from the fact that machine learning methods and corresponding preprocessing steps often only yield optimal performance when hyperparameters are properly tuned. But in many applications, we are not only interested in optimizing ML pipelines solely for predictive accuracy; additional metrics or constraints must be considered when determining an optimal configuration, resulting in a multi-objective optimization problem. This is often neglected in practice, due to a lack of knowledge and readily available software implementations for multi-objective hyperparameter optimization. In this work, we introduce the reader to the basics of multi-objective hyperparameter optimization and motivate its usefulness in applied ML. Furthermore, we provide an extensive survey of existing optimization strategies, both from the domain of evolutionary algorithms and Bayesian optimization. We illustrate the utility of MOO in several specific ML applications, considering objectives such as operating conditions, prediction time, sparseness, fairness, interpretability and robustness.

6/7/2024

cs.LG stat.ML

🛠️

Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It

Yuta Saito, Masahiro Nomura

There has been a growing interest in off-policy evaluation in the literature such as recommender systems and personalized medicine. We have so far seen significant progress in developing estimators aimed at accurately estimating the effectiveness of counterfactual policies based on biased logged data. However, there are many cases where those estimators are used not only to evaluate the value of decision making policies but also to search for the best hyperparameters from a large candidate space. This work explores the latter hyperparameter optimization (HPO) task for off-policy learning. We empirically show that naively applying an unbiased estimator of the generalization performance as a surrogate objective in HPO can cause an unexpected failure, merely pursuing hyperparameters whose generalization performance is greatly overestimated. We then propose simple and computationally efficient corrections to the typical HPO procedure to deal with the aforementioned issues simultaneously. Empirical investigations demonstrate the effectiveness of our proposed HPO algorithm in situations where the typical procedure fails severely.

4/24/2024

cs.LG

Trajectory-Based Multi-Objective Hyperparameter Optimization for Model Retraining

Wenyu Wang, Zheyi Fan, Szu Hui Ng

Training machine learning models inherently involves a resource-intensive and noisy iterative learning procedure that allows epoch-wise monitoring of the model performance. However, in multi-objective hyperparameter optimization scenarios, the insights gained from the iterative learning procedure typically remain underutilized. We notice that tracking the model performance across multiple epochs under a hyperparameter setting creates a trajectory in the objective space and that trade-offs along the trajectories are often overlooked despite their potential to offer valuable insights to decision-making for model retraining. Therefore, in this study, we propose to enhance the multi-objective hyperparameter optimization problem by having training epochs as an additional decision variable to incorporate trajectory information. Correspondingly, we present a novel trajectory-based multi-objective Bayesian optimization algorithm characterized by two features: 1) an acquisition function that captures the improvement made by the predictive trajectory of any hyperparameter setting and 2) a multi-objective early stopping mechanism that determines when to terminate the trajectory to maximize epoch efficiency. Numerical experiments on diverse synthetic simulations and hyperparameter tuning benchmarks indicate that our algorithm outperforms the state-of-the-art multi-objective optimizers in both locating better trade-offs and tuning efficiency.

5/27/2024

cs.LG

On the Hyperparameter Loss Landscapes of Machine Learning Models: An Exploratory Study

Mingyu Huang, Ke Li

Previous efforts on hyperparameter optimization (HPO) of machine learning (ML) models predominately focus on algorithmic advances, yet little is known about the topography of the underlying hyperparameter (HP) loss landscape, which plays a fundamental role in governing the search process of HPO. While several works have conducted fitness landscape analysis (FLA) on various ML systems, they are limited to properties of isolated landscape without interrogating the potential structural similarities among them. The exploration of such similarities can provide a novel perspective for understanding the mechanism behind modern HPO methods, but has been missing, possibly due to the expensive cost of large-scale landscape construction, and the lack of effective analysis methods. In this paper, we mapped 1,500 HP loss landscapes of 6 representative ML models on 63 datasets across different fidelity levels, with 11M+ configurations. By conducting exploratory analysis on these landscapes with fine-grained visualizations and dedicated FLA metrics, we observed a similar landscape topography across a wide range of models, datasets, and fidelities, and shed light on several central topics in HPO.

5/27/2024

cs.LG