DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation

Read original: arXiv:2406.00011 - Published 6/5/2024 by Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang

DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation

Overview

The paper proposes a novel recommendation framework called DisCo that aims to harmoniously disentangle and collaborate between tabular and semantic space for improved recommendation performance.
DisCo leverages both structured tabular data and unstructured semantic data to capture diverse user preferences and item characteristics, and then aligns these two representations to enhance the recommendation process.
The authors demonstrate the effectiveness of DisCo through extensive experiments on various recommendation tasks, showing its superiority over state-of-the-art baselines.

Plain English Explanation

Recommender systems are algorithms that suggest products or services that users might like, based on their past preferences and behaviors. These systems often rely on two main types of data: structured tabular data (like user ratings or item features) and unstructured semantic data (like product descriptions or user reviews).

The paper introduces a new recommender system framework called DisCo that aims to combine these two types of data in a more harmonious way. The key idea is to first "disentangle" the tabular and semantic data into separate representations, and then "collaborate" between these two representations to improve the final recommendations.

By disentangling the data, DisCo can capture diverse user preferences and item characteristics more effectively. And by collaborating between the tabular and semantic spaces, DisCo can leverage the strengths of both types of data to make better recommendations.

The researchers demonstrate the effectiveness of DisCo through experiments on various recommendation tasks, showing that it outperforms other state-of-the-art recommendation approaches. This suggests that harmoniously combining structured and unstructured data can lead to significant improvements in recommendation performance.

Technical Explanation

The paper proposes a framework called DisCo (Disentanglement and Collaboration) that aims to effectively leverage both tabular and semantic data for recommendation tasks. DisCo consists of two key components:

Disentanglement Module: This module takes in the tabular and semantic data separately and learns distinct representations for each. For the tabular data, it uses a multi-layer perceptron to encode user and item features. For the semantic data, it uses a pre-trained language model like BERT to extract contextual embeddings.
Collaboration Module: This module then aligns the tabular and semantic representations by learning a transformation matrix that maps them into a shared latent space. This allows the model to capture the interactions between the two data modalities and exploit their complementary strengths.

The final recommendation score is computed as a weighted sum of the tabular and semantic representations in the shared latent space. The authors train DisCo end-to-end using a combination of recommendation, disentanglement, and alignment losses.

The researchers evaluate DisCo on several benchmark recommendation datasets, comparing it to state-of-the-art baselines that use either tabular or semantic data alone, as well as other multi-modal approaches. The results demonstrate the superiority of DisCo, showcasing its ability to effectively leverage the complementary information from both data sources.

Critical Analysis

The paper presents a well-designed and thorough evaluation of the DisCo framework, considering a range of recommendation tasks and baselines. The authors also discuss several limitations and future research directions:

The current implementation of DisCo assumes the availability of both tabular and semantic data for all items. In real-world scenarios, this may not always be the case, and the model's performance may degrade when facing incomplete data.
The paper does not explore the role of the pre-trained language model used in the semantic encoding component. Investigating different architectures or fine-tuning strategies for this module could potentially further improve performance.
The authors mention that the collaboration module could be extended to handle more than two data modalities, but they do not provide any empirical results or analyses on the scalability of the framework.

Additionally, one could argue that the proposed approach is not entirely novel, as it builds upon the general idea of multi-modal recommendation, which has been explored in previous work. However, the specific design choices and the thorough empirical evaluation make DisCo a valuable contribution to the field.

Conclusion

The DisCo framework presented in this paper offers a promising approach to harmoniously leverage both tabular and semantic data for improved recommendation performance. By disentangling the two data modalities and then collaborating between their representations, the model can effectively capture diverse user preferences and item characteristics.

The strong empirical results demonstrate the potential of this approach, suggesting that future recommendation systems could benefit from similar techniques that seamlessly integrate structured and unstructured data sources. As the authors note, further research is needed to address the limitations and explore the scalability of the framework, but the core ideas presented in this paper represent an important step forward in the field of recommender systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation

Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang

Recommender systems play important roles in various applications such as e-commerce, social media, etc. Conventional recommendation methods usually model the collaborative signals within the tabular representation space. Despite the personalization modeling and the efficiency, the latent semantic dependencies are omitted. Methods that introduce semantics into recommendation then emerge, injecting knowledge from the semantic representation space where the general language understanding are compressed. However, existing semantic-enhanced recommendation methods focus on aligning the two spaces, during which the representations of the two spaces tend to get close while the unique patterns are discarded and not well explored. In this paper, we propose DisCo to Disentangle the unique patterns from the two representation spaces and Collaborate the two spaces for recommendation enhancement, where both the specificity and the consistency of the two spaces are captured. Concretely, we propose 1) a dual-side attentive network to capture the intra-domain patterns and the inter-domain patterns, 2) a sufficiency constraint to preserve the task-relevant information of each representation space and filter out the noise, and 3) a disentanglement constraint to avoid the model from discarding the unique information. These modules strike a balance between disentanglement and collaboration of the two representation spaces to produce informative pattern vectors, which could serve as extra features and be appended to arbitrary recommendation backbones for enhancement. Experiment results validate the superiority of our method against different models and the compatibility of DisCo over different backbones. Various ablation studies and efficiency analysis are also conducted to justify each model component.

6/5/2024

DaRec: A Disentangled Alignment Framework for Large Language Model and Recommender System

Xihong Yang, Heming Jing, Zixing Zhang, Jindong Wang, Huakang Niu, Shuaiqiang Wang, Yu Lu, Junfeng Wang, Dawei Yin, Xinwang Liu, En Zhu, Defu Lian, Erxue Min

Benefiting from the strong reasoning capabilities, Large language models (LLMs) have demonstrated remarkable performance in recommender systems. Various efforts have been made to distill knowledge from LLMs to enhance collaborative models, employing techniques like contrastive learning for representation alignment. In this work, we prove that directly aligning the representations of LLMs and collaborative models is sub-optimal for enhancing downstream recommendation tasks performance, based on the information theorem. Consequently, the challenge of effectively aligning semantic representations between collaborative models and LLMs remains unresolved. Inspired by this viewpoint, we propose a novel plug-and-play alignment framework for LLMs and collaborative models. Specifically, we first disentangle the latent representations of both LLMs and collaborative models into specific and shared components via projection layers and representation regularization. Subsequently, we perform both global and local structure alignment on the shared representations to facilitate knowledge transfer. Additionally, we theoretically prove that the specific and shared representations contain more pertinent and less irrelevant information, which can enhance the effectiveness of downstream recommendation tasks. Extensive experimental results on benchmark datasets demonstrate that our method is superior to existing state-of-the-art algorithms.

8/16/2024

Disentangling ID and Modality Effects for Session-based Recommendation

Xiaokun Zhang, Bo Xu, Zhaochun Ren, Xiaochen Wang, Hongfei Lin, Fenglong Ma

Session-based recommendation aims to predict intents of anonymous users based on their limited behaviors. Modeling user behaviors involves two distinct rationales: co-occurrence patterns reflected by item IDs, and fine-grained preferences represented by item modalities (e.g., text and images). However, existing methods typically entangle these causes, leading to their failure in achieving accurate and explainable recommendations. To this end, we propose a novel framework DIMO to disentangle the effects of ID and modality in the task. At the item level, we introduce a co-occurrence representation schema to explicitly incorporate cooccurrence patterns into ID representations. Simultaneously, DIMO aligns different modalities into a unified semantic space to represent them uniformly. At the session level, we present a multi-view self-supervised disentanglement, including proxy mechanism and counterfactual inference, to disentangle ID and modality effects without supervised signals. Leveraging these disentangled causes, DIMO provides recommendations via causal inference and further creates two templates for generating explanations. Extensive experiments on multiple real-world datasets demonstrate the consistent superiority of DIMO over existing methods. Further analysis also confirms DIMO's effectiveness in generating explanations.

4/22/2024

Extracting Essential and Disentangled Knowledge for Recommendation Enhancement

Kounianhua Du, Jizheng Chen, Jianghao Lin, Menghui Zhu, Bo Chen, Shuai Li, Ruiming Tang

Recommender models play a vital role in various industrial scenarios, while often faced with the catastrophic forgetting problem caused by the fast shifting data distribution, e.g., the evolving user interests, click signals fluctuation during sales promotions, etc. To alleviate this problem, a common approach is to reuse knowledge from the historical data. However, preserving the vast and fast-accumulating data is hard, which causes dramatic storage overhead. Memorizing old data through a parametric knowledge base is then proposed, which compresses the vast amount of raw data into model parameters. Despite the flexibility, how to improve the memorization and generalization capabilities of the parametric knowledge base is challenging. In this paper, we propose two constraints to extract Essential and Disentangled Knowledge from past data for rational and generalized recommendation enhancement, which improves the capabilities of the parametric knowledge base without increasing the size of it. The essential principle helps to compress the input into representative vectors that capture the task-relevant information and filter out the noisy information. The disentanglement principle reduces the redundancy of stored information and pushes the knowledge base to focus on capturing the disentangled invariant patterns. These two rules together promote rational compression of information for robust and generalized knowledge representations. Extensive experiments on two datasets justify the effectiveness of the proposed method.

6/4/2024