Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

Read original: arXiv:2406.07932 - Published 6/14/2024 by Haiyuan Zhao, Guohao Cai, Jieming Zhu, Zhenhua Dong, Jun Xu, Ji-Rong Wen

Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

Overview

This paper addresses the problem of duration bias in video recommendation systems, where longer videos are often recommended over shorter ones due to the focus on maximizing total watch time.
The authors propose a novel approach called "Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time" (CDVR), which uses counterfactual modeling to estimate the potential watch time of a user for a given video, rather than relying solely on the actual watch time.
By incorporating this counterfactual watch time signal, the recommendation system can better balance the trade-off between user engagement and video duration, leading to more diverse and relevant recommendations.

Plain English Explanation

The paper tackles an issue called "duration bias" in video recommendation systems. Duration bias occurs when recommendation algorithms favor longer videos over shorter ones, as they aim to maximize the total amount of time users spend watching videos. This can lead to a skewed selection of content, where users may not see the most relevant or interesting videos.

To address this problem, the researchers developed a new approach called CDVR. Instead of just looking at how long users actually watched a video, CDVR uses a "counterfactual" model to estimate how long a user might have watched a video if it had been recommended. This counterfactual watch time signal is then incorporated into the recommendation algorithm, helping it to better balance user engagement and video duration.

By using this counterfactual approach, the recommendation system can make more balanced and relevant suggestions, providing users with a more diverse and engaging selection of videos. This could help improve the overall user experience and satisfaction with the video platform.

Technical Explanation

The key technical components of the CDVR approach are:

Counterfactual Watch Time Estimation: The authors develop a model to estimate the potential watch time of a user for a given video, based on factors such as the user's past engagement, the video's characteristics, and the user-video interaction. This counterfactual watch time signal is used in addition to the actual observed watch time.
Recommendation Model: The authors integrate the counterfactual watch time signal into a deep learning-based video recommendation model. This allows the model to better balance the trade-off between maximizing user engagement (as measured by the counterfactual watch time) and minimizing video duration bias.
Offline and Online Evaluation: The authors conduct both offline experiments on historical data and online A/B testing on a real-world video platform to evaluate the effectiveness of the CDVR approach. The results show that CDVR can significantly improve recommendation performance and user satisfaction compared to traditional duration-biased approaches.

Critical Analysis

The authors acknowledge some limitations of their work, including the potential for the counterfactual model to be biased or inaccurate, and the challenge of generalizing the approach to different video platforms or domains. Additionally, the paper does not address potential ethical concerns around the use of counterfactual modeling in recommendation systems, such as issues of fairness and transparency.

While the CDVR approach appears promising, further research could explore ways to enhance the counterfactual modeling, incorporate additional signals (e.g., causal contrastive learning, time-aware modeling), and address the ethical implications of such techniques in real-world video recommendation systems.

Conclusion

This paper presents a novel approach, CDVR, to counteract duration bias in video recommendation systems. By incorporating counterfactual watch time estimation, the recommendation model can better balance user engagement and video duration, leading to more diverse and relevant recommendations. The results demonstrate the effectiveness of this approach in improving recommendation performance and user satisfaction.

The CDVR technique represents an important step forward in addressing the challenges of duration bias in video recommendation, and its underlying principles may have broader applications in other time-series modeling tasks or areas of causal inference. As video platforms continue to play an increasingly central role in our digital lives, techniques like CDVR will become increasingly important in ensuring that recommendation systems serve users' interests and preferences in a fair and transparent manner.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

Haiyuan Zhao, Guohao Cai, Jieming Zhu, Zhenhua Dong, Jun Xu, Ji-Rong Wen

In video recommendation, an ongoing effort is to satisfy users' personalized information needs by leveraging their logged watch time. However, watch time prediction suffers from duration bias, hindering its ability to reflect users' interests accurately. Existing label-correction approaches attempt to uncover user interests through grouping and normalizing observed watch time according to video duration. Although effective to some extent, we found that these approaches regard completely played records (i.e., a user watches the entire video) as equally high interest, which deviates from what we observed on real datasets: users have varied explicit feedback proportion when completely playing videos. In this paper, we introduce the counterfactual watch time(CWT), the potential watch time a user would spend on the video if its duration is sufficiently long. Analysis shows that the duration bias is caused by the truncation of CWT due to the video duration limitation, which usually occurs on those completely played records. Besides, a Counterfactual Watch Model (CWM) is proposed, revealing that CWT equals the time users get the maximum benefit from video recommender systems. Moreover, a cost-based transform function is defined to transform the CWT into the estimation of user interest, and the model can be learned by optimizing a counterfactual likelihood function defined over observed user watch times. Extensive experiments on three real video recommendation datasets and online A/B testing demonstrated that CWM effectively enhanced video recommendation accuracy and counteracted the duration bias.

6/14/2024

$SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis$

SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis

Shentao Yang, Haichuan Yang, Linna Du, Adithya Ganesh, Bo Peng, Boying Liu, Serena Li, Ji Liu

The significance of estimating video watch time has been highlighted by the rising importance of (short) video recommendation, which has become a core product of mainstream social media platforms. Modeling video watch time, however, has been challenged by the complexity of user-video interaction, such as different user behavior modes in watching the recommended videos and varying watching probabilities over the video horizon. Despite the importance and challenges, existing literature on modeling video watch time mostly focuses on relatively black-box mechanical enhancement of the classical regression/classification losses, without factoring in user behavior in a principled manner. In this paper, we for the first time take on a user-centric perspective to model video watch time, from which we propose a white-box statistical framework that directly translates various user behavior assumptions in watching (short) videos into statistical watch time models. These behavior assumptions are portrayed by our domain knowledge on users' behavior modes in video watching. We further employ bucketization to cope with user's non-stationary watching probability over the video horizon, which additionally helps to respect the constraint of video length and facilitate the practical compatibility between the continuous regression event of watch time and other binary classification events. We test our models extensively on two public datasets, a large-scale offline industrial dataset, and an online A/B test on a short video platform with hundreds of millions of daily-active users. On all experiments, our models perform competitively against strong relevant baselines, demonstrating the efficacy of our user-centric perspective and proposed framework.

8/16/2024

Conditional Quantile Estimation for Uncertain Watch Time in Short-Video Recommendation

Chengzhi Lin, Shuchang Liu, Chuyuan Wang, Yongqi Liu

Accurately predicting watch time is crucial for optimizing recommendations and user experience in short video platforms. However, existing methods that estimate a single average watch time often fail to capture the inherent uncertainty and diversity in user engagement patterns. In this paper, we propose the Conditional Quantile Estimation (CQE) framework to model the entire conditional distribution of watch time. Using quantile regression, CQE characterizes the complex watch-time distribution for each user-video pair, providing a flexible and comprehensive approach to understanding user behavior. We further design multiple strategies to combine the quantile estimates, adapting to different recommendation scenarios and user preferences. Extensive offline experiments and online A/B tests demonstrate the superiority of CQE in watch time prediction and user engagement modeling. In particular, the online deployment of CQE in KuaiShow has led to significant improvements in key evaluation metrics, including active days, active users, engagement duration, and video view counts. These results highlight the practical impact of our proposed approach in enhancing the user experience and overall performance of the short video recommendation system. The code will be released after publication.

8/1/2024

Interest Clock: Time Perception in Real-Time Streaming Recommendation System

Yongchun Zhu, Jingwu Chen, Ling Chen, Yitan Li, Feng Zhang, Zuotao Liu

User preferences follow a dynamic pattern over a day, e.g., at 8 am, a user might prefer to read news, while at 8 pm, they might prefer to watch movies. Time modeling aims to enable recommendation systems to perceive time changes to capture users' dynamic preferences over time, which is an important and challenging problem in recommendation systems. Especially, streaming recommendation systems in the industry, with only available samples of the current moment, present greater challenges for time modeling. There is still a lack of effective time modeling methods for streaming recommendation systems. In this paper, we propose an effective and universal method Interest Clock to perceive time information in recommendation systems. Interest Clock first encodes users' time-aware preferences into a clock (hour-level personalized features) and then uses Gaussian distribution to smooth and aggregate them into the final interest clock embedding according to the current time for the final prediction. By arming base models with Interest Clock, we conduct online A/B tests, obtaining +0.509% and +0.758% improvements on user active days and app duration respectively. Besides, the extended offline experiments show improvements as well. Interest Clock has been deployed on Douyin Music App.

5/1/2024