SoMeR: Multi-View User Representation Learning for Social Media

Read original: arXiv:2405.05275 - Published 5/10/2024 by Siyi Guo, Keith Burghardt, Valeria Pant`e, Kristina Lerman

SoMeR: Multi-View User Representation Learning for Social Media

Overview

This paper presents SoMeR, a multi-view user representation learning framework for social media
SoMeR jointly learns user representations from various data modalities on social media, including text, image, and social network structure
The authors demonstrate that SoMeR outperforms state-of-the-art methods on several social media analysis tasks, such as user profiling and recommendation

Plain English Explanation

SoMeR is a new approach for understanding users on social media platforms. Social media data contains a lot of different information about users, like the text they write, the images they share, and how they are connected to other users. SoMeR uses all of these different "views" of a user to learn a comprehensive representation, or summary, of that user. This representation can then be used for tasks like figuring out a user's interests and preferences, or recommending content they might like.

The key idea behind SoMeR is that by considering all the different types of information about a user, we can get a more complete and accurate understanding of who they are and what they care about. This is better than just looking at one type of data, like the text they write. The authors show that SoMeR outperforms other methods that don't take this multi-view approach, demonstrating the benefits of their technique.

Technical Explanation

The SoMeR: Multi-View User Representation Learning for Social Media paper presents a novel framework for learning comprehensive representations of social media users. The key innovation is the multi-view learning approach, which jointly optimizes user representations from text, image, and social network structure data.

Specifically, the authors first extract features from each data modality using pre-trained neural networks. They then use a multi-view fusion module to combine these features into a unified user representation. This fusion module learns to weight the different views based on their relevance for the target task, such as user profiling or recommendation.

The authors evaluate SoMeR on several benchmark social media datasets and show that it outperforms state-of-the-art methods across a range of tasks. For example, SoMeR achieves higher accuracy in predicting user demographics and interests compared to baselines that only use a single data view.

Critical Analysis

The SoMeR paper makes a compelling case for the benefits of multi-view user representation learning for social media analysis. By considering diverse data sources, the approach is able to capture richer user profiles that lead to performance gains on downstream tasks.

One limitation noted by the authors is the computational complexity of jointly optimizing representations from multiple modalities. This could be a challenge for real-world deployment, especially for large-scale social media platforms. The authors mention plans to explore more efficient architectures and training strategies in future work.

Additionally, while SoMeR demonstrated strong empirical results, the paper does not provide much insight into the relative importance of the different data views. It would be interesting to see ablation studies or interpretability analysis to understand which modalities are most crucial for different applications.

Overall, the SoMeR framework represents an important step forward in leveraging the richness of social media data for user understanding. As social platforms continue to evolve, techniques like this will become increasingly vital for powering personalized experiences and content recommendations.

Conclusion

The SoMeR: Multi-View User Representation Learning for Social Media paper introduces a novel approach for learning comprehensive representations of social media users. By jointly modeling text, image, and social network data, SoMeR is able to capture richer user profiles that lead to improved performance on tasks like user profiling and recommendation.

The multi-view learning framework represents an important advance in social media analytics, demonstrating the value of integrating diverse data sources to gain a more holistic understanding of online users. As social platforms continue to grow, techniques like SoMeR will become increasingly crucial for powering personalized experiences and understanding user preferences at scale.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

SoMeR: Multi-View User Representation Learning for Social Media

Siyi Guo, Keith Burghardt, Valeria Pant`e, Kristina Lerman

User representation learning aims to capture user preferences, interests, and behaviors in low-dimensional vector representations. These representations have widespread applications in recommendation systems and advertising; however, existing methods typically rely on specific features like text content, activity patterns, or platform metadata, failing to holistically model user behavior across different modalities. To address this limitation, we propose SoMeR, a Social Media user Representation learning framework that incorporates temporal activities, text content, profile information, and network interactions to learn comprehensive user portraits. SoMeR encodes user post streams as sequences of timestamped textual features, uses transformers to embed this along with profile data, and jointly trains with link prediction and contrastive learning objectives to capture user similarity. We demonstrate SoMeR's versatility through two applications: 1) Identifying inauthentic accounts involved in coordinated influence operations by detecting users posting similar content simultaneously, and 2) Measuring increased polarization in online discussions after major events by quantifying how users with different beliefs moved farther apart in the embedding space. SoMeR's ability to holistically model users enables new solutions to important problems around disinformation, societal tensions, and online behavior understanding.

5/10/2024

Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations

Zhicheng Ren, Zhiping Xiao, Yizhou Sun

With the rapid development of social media, the importance of analyzing social network user data has also been put on the agenda. User representation learning in social media is a critical area of research, based on which we can conduct personalized content delivery, or detect malicious actors. Being more complicated than many other types of data, social network user data has inherent multimodal nature. Various multimodal approaches have been proposed to harness both text (i.e. post content) and relation (i.e. inter-user interaction) information to learn user embeddings of higher quality. The advent of Graph Neural Network models enables more end-to-end integration of user text embeddings and user interaction graphs in social networks. However, most of those approaches do not adequately elucidate which aspects of the data - text or graph structure information - are more helpful for predicting each specific user under a particular task, putting some burden on personalized downstream analysis and untrustworthy information filtering. We propose a simple yet effective framework called Contribution-Aware Multimodal User Embedding (CAMUE) for social networks. We have demonstrated with empirical evidence, that our approach can provide personalized explainable predictions, automatically mitigating the impact of unreliable information. We also conducted case studies to show how reasonable our results are. We observe that for most users, graph structure information is more trustworthy than text information, but there are some reasonable cases where text helps more. Our work paves the way for more explainable, reliable, and effective social media user embedding which allows for better personalized content delivery.

9/6/2024

⚙️

Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta

Wei Zhang, Dai Li, Chen Liang, Fang Zhou, Zhongke Zhang, Xuewei Wang, Ru Li, Yi Zhou, Yaning Huang, Dong Liang, Kai Wang, Zhangyuan Wang, Zhengxing Chen, Fenggang Wu, Minghai Chen, Huayu Li, Yunnan Wu, Zhan Shu, Mindi Yuan, Sri Reddy

Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and improved infrastructure efficiency.

5/24/2024

Async Learned User Embeddings for Ads Delivery Optimization

Mingwei Tang, Meng Liu, Hong Li, Junjie Yang, Chenglin Wei, Boyang Li, Dai Li, Rengan Xu, Yifan Xu, Zehua Zhang, Xiangyu Wang, Linfeng Liu, Yuelei Xie, Chengye Liu, Labib Fawaz, Li Li, Hongnan Wang, Bill Zhu, Sri Reddy

In recommendation systems, high-quality user embeddings can capture subtle preferences, enable precise similarity calculations, and adapt to changing preferences over time to maintain relevance. The effectiveness of recommendation systems depends on the quality of user embedding. We propose to asynchronously learn high fidelity user embeddings for billions of users each day from sequence based multimodal user activities through a Transformer-like large scale feature learning module. The async learned user representations embeddings (ALURE) are further converted to user similarity graphs through graph learning and then combined with user realtime activities to retrieval highly related ads candidates for the ads delivery system. Our method shows significant gains in both offline and online experiments.

6/26/2024