CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation

2405.02429

Published 5/7/2024 by Yaoyiran Li, Xiang Zhai, Moustafa Alzantot, Keyi Yu, Ivan Vuli'c, Anna Korhonen, Mohamed Hammad

CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation

Abstract

Traditional recommender systems such as matrix factorization methods rely on learning a shared dense embedding space to represent both items and user preferences. Sequence models such as RNN, GRUs, and, recently, Transformers have also excelled in the task of sequential recommendation. This task requires understanding the sequential structure present in users' historical interactions to predict the next item they may like. Building upon the success of Large Language Models (LLMs) in a variety of tasks, researchers have recently explored using LLMs that are pretrained on vast corpora of text for sequential recommendation. To use LLMs in sequential recommendations, both the history of user interactions and the model's prediction of the next item are expressed in text form. We propose CALRec, a two-stage LLM finetuning framework that finetunes a pretrained LLM in a two-tower fashion using a mixture of two contrastive losses and a language modeling loss: the LLM is first finetuned on a data mixture from multiple domains followed by another round of target domain finetuning. Our model significantly outperforms many state-of-the-art baselines (+37% in Recall@1 and +24% in NDCG@10) and systematic ablation studies reveal that (i) both stages of finetuning are crucial, and, when combined, we achieve improved performance, and (ii) contrastive alignment is effective among the target domains explored in our experiments.

Get summaries of the top AI research delivered straight to your inbox:

Overview

Contrastive Alignment of Generative LLMs For Sequential Recommendation (CALRec): A novel approach to improve sequential recommendation using large language models (LLMs)
Combines contrastive learning and domain-specific fine-tuning to enhance the performance of LLMs for sequential recommendation tasks
Aims to better capture user preferences and item dependencies through the learned representations

Plain English Explanation

In the age of large language models (LLMs), researchers have been exploring ways to leverage these powerful models for various applications, including sequential recommendation. The paper "CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation" introduces a novel approach to improve the performance of LLMs in sequential recommendation tasks.

The key idea is to combine two powerful techniques: contrastive learning and domain-specific fine-tuning. Contrastive learning is a way of training machine learning models to learn useful representations by identifying similarities and differences between data points. By applying this approach to LLMs, the researchers aim to help the models better capture the temporal awareness and user preferences that are crucial for effective sequential recommendation.

The domain-specific fine-tuning step then further adapts the LLM to the specific task of sequential recommendation, drawing on the model's inherent language understanding capabilities to enhance its performance.

Through this combined approach, the researchers hope to create LLMs that can provide more personalized and dynamic recommendations to users, better reflecting their evolving interests and preferences over time.

Technical Explanation

The CALRec framework consists of two main components:

Contrastive Alignment: The researchers first apply contrastive learning to the LLM, training it to learn representations that capture the relationships between successive items in a user's interaction sequence. This helps the model better understand the temporal dynamics and dependencies within the recommendation data.
Domain-specific Fine-tuning: After the contrastive alignment step, the LLM is further fine-tuned on the specific task of sequential recommendation. This allows the model to adapt its language understanding capabilities to the nuances of the recommendation domain, improving its performance on this task.

The researchers evaluate the CALRec framework on several benchmark datasets and compare its performance to state-of-the-art sequential recommendation methods. The results demonstrate that the combined approach of contrastive alignment and domain-specific fine-tuning leads to significant improvements in recommendation accuracy, particularly in capturing long-term user preferences and item dependencies.

Critical Analysis

The CALRec framework presents a promising approach to leveraging the power of LLMs for sequential recommendation tasks. By incorporating contrastive learning, the model is able to better capture the temporal and contextual relationships within user interaction data, which is a crucial aspect of effective sequential recommendation.

However, the paper does not address potential limitations or challenges that may arise in real-world deployment scenarios. For example, the performance of the model may be sensitive to the quality and diversity of the training data, and the researchers do not discuss how the framework might handle sparse or noisy data.

Additionally, while the domain-specific fine-tuning step is intended to adapt the LLM to the recommendation task, the researchers do not provide a detailed analysis of the specific changes or adaptations made to the model during this process. Further insights into these adaptations could help inform future research in this area.

It would also be valuable to explore the generalizability of the CALRec framework to other types of recommendation tasks, such as personalized recommendation or cross-domain recommendation, where the ability to capture user preferences and item dependencies may be equally important.

Conclusion

The "CALRec: Contrastive Alignment of Generative LLMs For Sequential Recommendation" paper presents a novel approach to leveraging the power of large language models for the task of sequential recommendation. By combining contrastive learning and domain-specific fine-tuning, the researchers have demonstrated significant improvements in recommendation accuracy, particularly in capturing long-term user preferences and item dependencies.

This work highlights the potential of LLMs to serve as powerful foundations for building advanced recommendation systems, providing more personalized and dynamic recommendations to users. As the field of recommender systems continues to evolve, approaches like CALRec may play an increasingly important role in delivering engaging and relevant experiences to users across a wide range of applications.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

💬

Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation

Bowen Zheng, Yupeng Hou, Hongyu Lu, Yu Chen, Wayne Xin Zhao, Ming Chen, Ji-Rong Wen

Recently, large language models (LLMs) have shown great potential in recommender systems, either improving existing recommendation models or serving as the backbone. However, there exists a large semantic gap between LLMs and recommender systems, since items to be recommended are often indexed by discrete identifiers (item ID) out of the LLM's vocabulary. In essence, LLMs capture language semantics while recommender systems imply collaborative semantics, making it difficult to sufficiently leverage the model capacity of LLMs for recommendation. To address this challenge, in this paper, we propose a new LLM-based recommendation model called LC-Rec, which can better integrate language and collaborative semantics for recommender systems. Our approach can directly generate items from the entire item set for recommendation, without relying on candidate items. Specifically, we make two major contributions in our approach. For item indexing, we design a learning-based vector quantization method with uniform semantic mapping, which can assign meaningful and non-conflicting IDs (called item indices) for items. For alignment tuning, we propose a series of specially designed tuning tasks to enhance the integration of collaborative semantics in LLMs. Our fine-tuning tasks enforce LLMs to deeply integrate language and collaborative semantics (characterized by the learned item indices), so as to achieve an effective adaptation to recommender systems. Extensive experiments demonstrate the effectiveness of our method, showing that our approach can outperform a number of competitive baselines including traditional recommenders and existing LLM-based recommenders. Our code is available at https://github.com/RUCAIBox/LC-Rec/.

4/22/2024

cs.IR

Reformulating Sequential Recommendation: Learning Dynamic User Interest with Content-enriched Language Modeling

Junzhe Jiang, Shang Qu, Mingyue Cheng, Qi Liu, Zhiding Liu, Hao Zhang, Rujiao Zhang, Kai Zhang, Rui Li, Jiatong Li, Min Gao

Recommender systems are indispensable in the realm of online applications, and sequential recommendation has enjoyed considerable prevalence due to its capacity to encapsulate the dynamic shifts in user interests. However, previous sequential modeling methods still have limitations in capturing contextual information. The primary reason is the lack of understanding of domain-specific knowledge and item-related textual content. Fortunately, the emergence of powerful language models has unlocked the potential to incorporate extensive world knowledge into recommendation algorithms, enabling them to go beyond simple item attributes and truly understand the world surrounding user preferences. To achieve this, we propose LANCER, which leverages the semantic understanding capabilities of pre-trained language models to generate personalized recommendations. Our approach bridges the gap between language models and recommender systems, resulting in more human-like recommendations. We demonstrate the effectiveness of our approach through a series of experiments conducted on multiple benchmark datasets, showing promising results and providing valuable insights into the influence of our model on sequential recommendation tasks. Furthermore, our experimental codes are publicly available at https://github.com/Gnimixy/lancer.

4/16/2024

cs.IR cs.CL

Recommender Systems in the Era of Large Language Models (LLMs)

Zihuai Zhao, Wenqi Fan, Jiatong Li, Yunqing Liu, Xiaowei Mei, Yiqi Wang, Zhen Wen, Fei Wang, Xiangyu Zhao, Jiliang Tang, Qing Li

With the prosperity of e-commerce and web applications, Recommender Systems (RecSys) have become an important component of our daily life, providing personalized suggestions that cater to user preferences. While Deep Neural Networks (DNNs) have made significant advancements in enhancing recommender systems by modeling user-item interactions and incorporating textual side information, DNN-based methods still face limitations, such as difficulties in understanding users' interests and capturing textual side information, inabilities in generalizing to various recommendation scenarios and reasoning on their predictions, etc. Meanwhile, the emergence of Large Language Models (LLMs), such as ChatGPT and GPT4, has revolutionized the fields of Natural Language Processing (NLP) and Artificial Intelligence (AI), due to their remarkable abilities in fundamental responsibilities of language understanding and generation, as well as impressive generalization and reasoning capabilities. As a result, recent studies have attempted to harness the power of LLMs to enhance recommender systems. Given the rapid evolution of this research direction in recommender systems, there is a pressing need for a systematic overview that summarizes existing LLM-empowered recommender systems, to provide researchers in relevant fields with an in-depth understanding. Therefore, in this paper, we conduct a comprehensive review of LLM-empowered recommender systems from various aspects including Pre-training, Fine-tuning, and Prompting. More specifically, we first introduce representative methods to harness the power of LLMs (as a feature encoder) for learning representations of users and items. Then, we review recent techniques of LLMs for enhancing recommender systems from three paradigms, namely pre-training, fine-tuning, and prompting. Finally, we comprehensively discuss future directions in this emerging field.

4/23/2024

cs.IR cs.AI cs.CL

💬

LLM-Rec: Personalized Recommendation via Prompting Large Language Models

Hanjia Lyu, Song Jiang, Hanqing Zeng, Yinglong Xia, Qifan Wang, Si Zhang, Ren Chen, Christopher Leung, Jiajie Tang, Jiebo Luo

Text-based recommendation holds a wide range of practical applications due to its versatility, as textual descriptions can represent nearly any type of item. However, directly employing the original item descriptions may not yield optimal recommendation performance due to the lack of comprehensive information to align with user preferences. Recent advances in large language models (LLMs) have showcased their remarkable ability to harness commonsense knowledge and reasoning. In this study, we introduce a novel approach, coined LLM-Rec, which incorporates four distinct prompting strategies of text enrichment for improving personalized text-based recommendations. Our empirical experiments reveal that using LLM-augmented text significantly enhances recommendation quality. Even basic MLP (Multi-Layer Perceptron) models achieve comparable or even better results than complex content-based methods. Notably, the success of LLM-Rec lies in its prompting strategies, which effectively tap into the language model's comprehension of both general and specific item characteristics. This highlights the importance of employing diverse prompts and input augmentation techniques to boost the recommendation effectiveness of LLMs.

4/3/2024

cs.CL cs.AI cs.IR