Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

Read original: arXiv:2401.09507 - Published 5/22/2024 by Shuai Yang, Hao Yang, Zhuang Zou, Linhe Xu, Shuo Yuan, Yifan Zeng

Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

Overview

• This paper proposes a method called "Deep Ensemble Shape Calibration" for multi-field post-hoc calibration in online advertising. • The method aims to improve the calibration of machine learning models used in online advertising by accounting for the effects of different feature fields on the model's output. • The paper introduces a Basis Calibration Function that learns a field-aware attention mechanism to adjust the model's outputs for better calibration.

Plain English Explanation

Online advertising platforms use machine learning models to predict the likelihood that a user will engage with an ad. However, these models can sometimes be over-confident or under-confident in their predictions, which can negatively impact the effectiveness of the advertising campaigns.

The Deep Ensemble Shape Calibration method proposed in this paper aims to address this issue by improving the calibration of these models. Calibration refers to the alignment between a model's predicted probabilities and the actual observed probabilities.

The key idea is that different features (or "fields") used by the model, such as user demographics, device information, or browsing history, can have varying effects on the model's outputs. The Basis Calibration Function learns to account for these field-specific effects, adjusting the model's outputs to be better calibrated.

Imagine you're buying a used car, and the salesperson tells you there's a 90% chance the car will last for 5 more years. But in reality, after 2 years, the car breaks down. This would be an example of an over-confident prediction. The Basis Calibration Function aims to prevent this kind of miscalibration in online advertising models.

By improving calibration, the Deep Ensemble Shape Calibration method can help advertisers make more informed decisions about ad placement and budget allocation, ultimately leading to more effective and efficient advertising campaigns.

Technical Explanation

The Deep Ensemble Shape Calibration method consists of two main components:

Field-aware Attention: The model learns a field-specific attention mechanism that adjusts the contribution of each feature field to the final prediction. This allows the calibration function to account for the varying effects of different fields on the model's outputs.
Basis Calibration Function: This is a neural network that learns to map the original model's outputs to better-calibrated probabilities. The Basis Calibration Function takes into account the field-specific attention weights to perform the calibration.

The authors evaluate their method on several benchmark datasets for online advertising, as well as a large-scale industrial dataset. They show that the Deep Ensemble Shape Calibration method outperforms existing calibration techniques, such as Calibration-then-Calculation, Gain-Aware Prediction Correctness Optimization, and Online Calibrated Conformal Prediction, in terms of calibration and downstream task performance.

Critical Analysis

The Deep Ensemble Shape Calibration method provides a compelling approach to improving the calibration of machine learning models in online advertising. The field-aware attention mechanism and Basis Calibration Function are well-designed and effectively address the issue of heterogeneous feature effects on model outputs.

However, the paper does not discuss the potential computational overhead of the method, which may be a concern for real-time advertising applications. Additionally, the authors do not explore the interpretability of the field-specific attention weights, which could be valuable for understanding the model's decision-making process.

Furthermore, the paper focuses on post-hoc calibration, which may not be as effective as multi-calibration approaches that aim to produce well-calibrated outputs during the training process. Future research could investigate the potential benefits of incorporating multi-calibration techniques into the Deep Ensemble Shape Calibration framework.

Conclusion

The Deep Ensemble Shape Calibration method presented in this paper offers a promising solution for improving the calibration of machine learning models used in online advertising. By accounting for the heterogeneous effects of different feature fields, the method can help advertisers make more informed decisions and achieve more effective and efficient advertising campaigns.

While the paper provides a solid technical foundation, future research could explore the computational efficiency, interpretability, and the potential benefits of integrating multi-calibration techniques into the proposed framework. Overall, this work contributes to the ongoing efforts to enhance the reliability and trustworthiness of machine learning models in the high-stakes domain of online advertising.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Deep Ensemble Shape Calibration: Multi-Field Post-hoc Calibration in Online Advertising

Shuai Yang, Hao Yang, Zhuang Zou, Linhe Xu, Shuo Yuan, Yifan Zeng

In the e-commerce advertising scenario, estimating the true probabilities (known as a calibrated estimate) on Click-Through Rate (CTR) and Conversion Rate (CVR) is critical. Previous research has introduced numerous solutions for addressing the calibration problem. These methods typically involve the training of calibrators using a validation set and subsequently applying these calibrators to correct the original estimated values during online inference. However, what sets e-commerce advertising scenarios apart is the challenge of multi-field calibration. Multi-field calibration requires achieving calibration in each field. In order to achieve multi-field calibration, it is necessary to have a strong data utilization ability. Because the quantity of pCTR specified range for a single field-value (such as user ID and item ID) sample is relatively small, this makes the calibrator more difficult to train. However, existing methods have difficulty effectively addressing these issues. To solve these problems, we propose a new method named Deep Ensemble Shape Calibration (DESC). In terms of business understanding and interpretability, we decompose multi-field calibration into value calibration and shape calibration. We introduce innovative basis calibration functions, which enhance both function expression capabilities and data utilization by combining these basis calibration functions. A significant advancement lies in the development of an allocator capable of allocating the most suitable calibrators to different estimation error distributions within diverse fields and values. We achieve significant improvements in both public and industrial datasets. In online experiments, we observe a +2.5% increase in CVR and +4.0% in GMV (Gross Merchandise Volume). Our code is now available at: https://github.com/HaoYang0123/DESC.

5/22/2024

Confidence-Aware Multi-Field Model Calibration

Yuang Zhao, Chuhan Wu, Qinglin Jia, Hong Zhu, Jia Yan, Libin Zong, Linxuan Zhang, Zhenhua Dong, Muyu Zhang

Accurately predicting the probabilities of user feedback, such as clicks and conversions, is critical for advertisement ranking and bidding. However, there often exist unwanted mismatches between predicted probabilities and true likelihoods due to the rapid shift of data distributions and intrinsic model biases. Calibration aims to address this issue by post-processing model predictions, and field-aware calibration can adjust model output on different feature field values to satisfy fine-grained advertising demands. Unfortunately, the observed samples corresponding to certain field values can be seriously limited to make confident calibrations, which may yield bias amplification and online disturbance. In this paper, we propose a confidence-aware multi-field calibration method, which adaptively adjusts the calibration intensity based on confidence levels derived from sample statistics. It also utilizes multiple fields for joint model calibration according to their importance to mitigate the impact of data sparsity on a single field. Extensive offline and online experiments show the superiority of our method in boosting advertising performance and reducing prediction deviations.

5/22/2024

🎲

Posterior Probability Matters: Doubly-Adaptive Calibration for Neural Predictions in Online Advertising

Penghui Wei, Weimin Zhang, Ruijie Hou, Jinquan Liu, Shaoguo Liu, Liang Wang, Bo Zheng

Predicting user response probabilities is vital for ad ranking and bidding. We hope that predictive models can produce accurate probabilistic predictions that reflect true likelihoods. Calibration techniques aim to post-process model predictions to posterior probabilities. Field-level calibration -- which performs calibration w.r.t. to a specific field value -- is fine-grained and more practical. In this paper we propose a doubly-adaptive approach AdaCalib. It learns an isotonic function family to calibrate model predictions with the guidance of posterior statistics, and field-adaptive mechanisms are designed to ensure that the posterior is appropriate for the field value to be calibrated. Experiments verify that AdaCalib achieves significant improvement on calibration performance. It has been deployed online and beats previous approach.

5/28/2024

Calibration-then-Calculation: A Variance Reduced Metric Framework in Deep Click-Through Rate Prediction Models

Yewen Fan, Nian Si, Xiangchen Song, Kun Zhang

The adoption of deep learning across various fields has been extensive, yet there is a lack of focus on evaluating the performance of deep learning pipelines. Typically, with the increased use of large datasets and complex models, the training process is run only once and the result is compared to previous benchmarks. This practice can lead to imprecise comparisons due to the variance in neural network evaluation metrics, which stems from the inherent randomness in the training process. Traditional solutions, such as running the training process multiple times, are often infeasible due to computational constraints. In this paper, we introduce a novel metric framework, the Calibrated Loss Metric, designed to address this issue by reducing the variance present in its conventional counterpart. Consequently, this new metric enhances the accuracy in detecting effective modeling improvements. Our approach is substantiated by theoretical justifications and extensive experimental validations within the context of Deep Click-Through Rate Prediction Models.

5/21/2024