SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis

Read original: arXiv:2408.07759 - Published 8/16/2024 by Shentao Yang, Haichuan Yang, Linna Du, Adithya Ganesh, Bo Peng, Boying Liu, Serena Li, Ji Liu

$SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis$

Overview

The paper focuses on statistical modeling of video watch time through user behavior analysis.
It proposes a novel approach called SWaT to predict video watch time using various user features.
The model aims to help video platforms better understand user engagement and improve their recommendation systems.

Plain English Explanation

The paper looks at how users interact with online videos and how long they watch them. The researchers developed a new method called SWaT to predict how long a user will watch a video based on their behavior and other factors.

The goal is to help video platforms, like YouTube or Netflix, better understand how engaged their users are. This information can then be used to improve their video recommendation systems and provide a better experience for viewers. For example, if the model predicts a user is likely to stop watching a video after a few minutes, the platform could recommend a shorter video that the user may be more likely to watch all the way through.

Technical Explanation

The paper proposes the SWaT (Statistical Watch Time) model to predict a user's video watch time. The model uses various user features, such as their past viewing history, demographic information, and engagement signals like comments and likes.

The researchers collected a large dataset of user interactions with online videos and used statistical techniques like survival analysis to train the SWaT model. This allows the model to not only predict the total watch time, but also the probability of a user stopping watching at different points in the video.

The paper evaluates the SWaT model's performance on real-world video platform data and compares it to other watch time prediction approaches. The results show SWaT outperforms existing methods, demonstrating its effectiveness at modeling user engagement and video consumption patterns.

Critical Analysis

The paper provides a comprehensive and rigorous approach to predicting video watch time through user behavior analysis. The use of survival analysis techniques is a novel and well-justified approach to modeling the time-dependent nature of video engagement.

However, the paper does not address potential biases or limitations in the dataset used to train the model. The generalizability of the SWaT model to different video platforms or content types is also not explored.

Additionally, while the paper discusses the practical applications of watch time prediction for improving recommendation systems, it does not delve into the ethical implications of using such models, such as the potential for reinforcing user biases or influencing viewer behavior in undesirable ways.

Conclusion

The SWaT model proposed in this paper represents a significant advancement in understanding and predicting user engagement with online videos. By leveraging various user features and statistical techniques, the model can provide video platforms with valuable insights to enhance their recommendation systems and improve the user experience.

While the paper highlights the technical merits of the approach, further research is needed to address potential limitations and explore the broader implications of using such models in real-world video platforms.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

$SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis$

SWaT: Statistical Modeling of Video Watch Time through User Behavior Analysis

Shentao Yang, Haichuan Yang, Linna Du, Adithya Ganesh, Bo Peng, Boying Liu, Serena Li, Ji Liu

The significance of estimating video watch time has been highlighted by the rising importance of (short) video recommendation, which has become a core product of mainstream social media platforms. Modeling video watch time, however, has been challenged by the complexity of user-video interaction, such as different user behavior modes in watching the recommended videos and varying watching probabilities over the video horizon. Despite the importance and challenges, existing literature on modeling video watch time mostly focuses on relatively black-box mechanical enhancement of the classical regression/classification losses, without factoring in user behavior in a principled manner. In this paper, we for the first time take on a user-centric perspective to model video watch time, from which we propose a white-box statistical framework that directly translates various user behavior assumptions in watching (short) videos into statistical watch time models. These behavior assumptions are portrayed by our domain knowledge on users' behavior modes in video watching. We further employ bucketization to cope with user's non-stationary watching probability over the video horizon, which additionally helps to respect the constraint of video length and facilitate the practical compatibility between the continuous regression event of watch time and other binary classification events. We test our models extensively on two public datasets, a large-scale offline industrial dataset, and an online A/B test on a short video platform with hundreds of millions of daily-active users. On all experiments, our models perform competitively against strong relevant baselines, demonstrating the efficacy of our user-centric perspective and proposed framework.

8/16/2024

Counteracting Duration Bias in Video Recommendation via Counterfactual Watch Time

Haiyuan Zhao, Guohao Cai, Jieming Zhu, Zhenhua Dong, Jun Xu, Ji-Rong Wen

In video recommendation, an ongoing effort is to satisfy users' personalized information needs by leveraging their logged watch time. However, watch time prediction suffers from duration bias, hindering its ability to reflect users' interests accurately. Existing label-correction approaches attempt to uncover user interests through grouping and normalizing observed watch time according to video duration. Although effective to some extent, we found that these approaches regard completely played records (i.e., a user watches the entire video) as equally high interest, which deviates from what we observed on real datasets: users have varied explicit feedback proportion when completely playing videos. In this paper, we introduce the counterfactual watch time(CWT), the potential watch time a user would spend on the video if its duration is sufficiently long. Analysis shows that the duration bias is caused by the truncation of CWT due to the video duration limitation, which usually occurs on those completely played records. Besides, a Counterfactual Watch Model (CWM) is proposed, revealing that CWT equals the time users get the maximum benefit from video recommender systems. Moreover, a cost-based transform function is defined to transform the CWT into the estimation of user interest, and the model can be learned by optimizing a counterfactual likelihood function defined over observed user watch times. Extensive experiments on three real video recommendation datasets and online A/B testing demonstrated that CWM effectively enhanced video recommendation accuracy and counteracted the duration bias.

6/14/2024

Conditional Quantile Estimation for Uncertain Watch Time in Short-Video Recommendation

Chengzhi Lin, Shuchang Liu, Chuyuan Wang, Yongqi Liu

Accurately predicting watch time is crucial for optimizing recommendations and user experience in short video platforms. However, existing methods that estimate a single average watch time often fail to capture the inherent uncertainty and diversity in user engagement patterns. In this paper, we propose the Conditional Quantile Estimation (CQE) framework to model the entire conditional distribution of watch time. Using quantile regression, CQE characterizes the complex watch-time distribution for each user-video pair, providing a flexible and comprehensive approach to understanding user behavior. We further design multiple strategies to combine the quantile estimates, adapting to different recommendation scenarios and user preferences. Extensive offline experiments and online A/B tests demonstrate the superiority of CQE in watch time prediction and user engagement modeling. In particular, the online deployment of CQE in KuaiShow has led to significant improvements in key evaluation metrics, including active days, active users, engagement duration, and video view counts. These results highlight the practical impact of our proposed approach in enhancing the user experience and overall performance of the short video recommendation system. The code will be released after publication.

8/1/2024

Interest Clock: Time Perception in Real-Time Streaming Recommendation System

Yongchun Zhu, Jingwu Chen, Ling Chen, Yitan Li, Feng Zhang, Zuotao Liu

User preferences follow a dynamic pattern over a day, e.g., at 8 am, a user might prefer to read news, while at 8 pm, they might prefer to watch movies. Time modeling aims to enable recommendation systems to perceive time changes to capture users' dynamic preferences over time, which is an important and challenging problem in recommendation systems. Especially, streaming recommendation systems in the industry, with only available samples of the current moment, present greater challenges for time modeling. There is still a lack of effective time modeling methods for streaming recommendation systems. In this paper, we propose an effective and universal method Interest Clock to perceive time information in recommendation systems. Interest Clock first encodes users' time-aware preferences into a clock (hour-level personalized features) and then uses Gaussian distribution to smooth and aggregate them into the final interest clock embedding according to the current time for the final prediction. By arming base models with Interest Clock, we conduct online A/B tests, obtaining +0.509% and +0.758% improvements on user active days and app duration respectively. Besides, the extended offline experiments show improvements as well. Interest Clock has been deployed on Douyin Music App.

5/1/2024