Challenging the Myth of Graph Collaborative Filtering: a Reasoned and Reproducibility-driven Analysis

Read original: arXiv:2308.00404 - Published 5/28/2024 by Vito Walter Anelli, Daniele Malitesta, Claudio Pomo, Alejandro Bellog'in, Tommaso Di Noia, Eugenio Di Sciascio

🛠️

Overview

The paper focuses on the replicability of results from popular graph neural network-based (GNN) recommender system models.
It successfully replicates results from six GNN models (NGCF, DGCF, LightGCN, SGL, UltraGCN, and GFCF) on three benchmark datasets (Gowalla, Yelp 2018, and Amazon Book).
The paper also compares these GNN models with traditional collaborative filtering (CF) models and extends the analysis to two new datasets (Allrecipes and BookCrossing).
The goal is to identify the impact of dataset characteristics on recommendation accuracy and understand which models are influenced by intrinsic features in the dataset structure.

Plain English Explanation

The paper discusses the success of graph neural network-based models in recommender systems. These models effectively represent users and items as a graph, which allows them to capture the relationships between them.

However, the paper notes that many previous studies on these models have simply used results from baseline papers without verifying their validity for the specific configuration being analyzed. To address this issue, the researchers focused on replicating the results of six popular and recent GNN-based recommender models (NGCF, DGCF, LightGCN, SGL, UltraGCN, and GFCF) on three common benchmark datasets (Gowalla, Yelp 2018, and Amazon Book).

The researchers also compared the performance of these GNN models with traditional collaborative filtering (CF) models, which have historically performed well in offline evaluations. Furthermore, they extended the study to two new datasets (Allrecipes and BookCrossing) that have not been extensively studied in the existing literature.

The key insight from this research is that the performance of the GNN models can differ significantly between the benchmark datasets and the new datasets. This suggests that the characteristics of the dataset, such as the structure of the user-item interactions, can have a significant impact on the accuracy of the recommendations. By analyzing the information flow from users' neighborhoods, the researchers aimed to identify which models are influenced by the intrinsic features of the dataset.

Technical Explanation

The paper presents a code that successfully replicates the results of six popular and recent GNN-based recommender models: NGCF, DGCF, LightGCN, SGL, UltraGCN, and GFCF. The experiments were conducted on three common benchmark datasets: Gowalla, Yelp 2018, and Amazon Book.

In addition to the GNN models, the researchers also evaluated the performance of traditional collaborative filtering (CF) models, which have historically performed well in offline evaluations. Furthermore, the study was extended to two new datasets, Allrecipes and BookCrossing, which have not been extensively explored in the existing literature.

By investigating the information flow from users' neighborhoods, the researchers aimed to identify which models are influenced by the intrinsic features of the dataset structure. The results showed that the performance of the GNN models can vary significantly between the benchmark datasets and the new datasets, suggesting that the characteristics of the dataset have a significant impact on the accuracy of the recommendations.

Critical Analysis

The paper makes a valuable contribution by focusing on the replicability of results from popular GNN-based recommender system models. It is important to verify the validity of results across different configurations and datasets, as many previous studies have simply adopted findings from baseline papers without further validation.

One potential limitation of the study is that it only considers a limited set of GNN-based models and benchmark datasets. While the researchers did extend the analysis to two new datasets, there may be other models and datasets that could provide additional insights.

Additionally, the paper does not delve deeply into the specific characteristics of the datasets that influence the performance of the different models. Further analysis of the dataset features, such as the sparsity of user-item interactions, the distribution of ratings, or the presence of cold-start users or items, could help to better understand the factors that contribute to the observed differences in model performance.

Overall, the paper presents a valuable contribution to the field of recommender systems by highlighting the importance of replicability and the impact of dataset characteristics on the performance of GNN-based models. The code made available by the researchers is a valuable resource for the community and can serve as a foundation for further exploration and validation of these models.

Conclusion

The success of graph neural network-based models has significantly advanced recommender systems, but the replicability of results from these models has been a concern. This paper addresses this issue by successfully replicating the results of six popular GNN-based recommender models on three benchmark datasets and comparing their performance to traditional collaborative filtering models.

The key finding is that the performance of the GNN models can vary significantly between the benchmark datasets and new, less-studied datasets. This suggests that the characteristics of the dataset, such as the structure of user-item interactions, can have a significant impact on the accuracy of the recommendations. By analyzing the information flow from users' neighborhoods, the researchers aimed to identify which models are influenced by the intrinsic features of the dataset.

This research highlights the importance of verifying the validity of results across different configurations and datasets, and it provides a valuable resource for the recommender systems community in the form of the publicly available code. As the field continues to evolve, further exploration of dataset characteristics and their impact on model performance will be crucial for developing more robust and reliable recommender systems.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🛠️

Challenging the Myth of Graph Collaborative Filtering: a Reasoned and Reproducibility-driven Analysis

Vito Walter Anelli, Daniele Malitesta, Claudio Pomo, Alejandro Bellog'in, Tommaso Di Noia, Eugenio Di Sciascio

The success of graph neural network-based models (GNNs) has significantly advanced recommender systems by effectively modeling users and items as a bipartite, undirected graph. However, many original graph-based works often adopt results from baseline papers without verifying their validity for the specific configuration under analysis. Our work addresses this issue by focusing on the replicability of results. We present a code that successfully replicates results from six popular and recent graph recommendation models (NGCF, DGCF, LightGCN, SGL, UltraGCN, and GFCF) on three common benchmark datasets (Gowalla, Yelp 2018, and Amazon Book). Additionally, we compare these graph models with traditional collaborative filtering models that historically performed well in offline evaluations. Furthermore, we extend our study to two new datasets (Allrecipes and BookCrossing) that lack established setups in existing literature. As the performance on these datasets differs from the previous benchmarks, we analyze the impact of specific dataset characteristics on recommendation accuracy. By investigating the information flow from users' neighborhoods, we aim to identify which models are influenced by intrinsic features in the dataset structure. The code to reproduce our experiments is available at: https://github.com/sisinflab/Graph-RSs-Reproducibility.

5/28/2024

Cluster-based Graph Collaborative Filtering

Fan Liu, Shuai Zhao, Zhiyong Cheng, Liqiang Nie, Mohan Kankanhalli

Graph Convolution Networks (GCNs) have significantly succeeded in learning user and item representations for recommendation systems. The core of their efficacy is the ability to explicitly exploit the collaborative signals from both the first- and high-order neighboring nodes. However, most existing GCN-based methods overlook the multiple interests of users while performing high-order graph convolution. Thus, the noisy information from unreliable neighbor nodes (e.g., users with dissimilar interests) negatively impacts the representation learning of the target node. Additionally, conducting graph convolution operations without differentiating high-order neighbors suffers the over-smoothing issue when stacking more layers, resulting in performance degradation. In this paper, we aim to capture more valuable information from high-order neighboring nodes while avoiding noise for better representation learning of the target node. To achieve this goal, we propose a novel GCN-based recommendation model, termed Cluster-based Graph Collaborative Filtering (ClusterGCF). This model performs high-order graph convolution on cluster-specific graphs, which are constructed by capturing the multiple interests of users and identifying the common interests among them. Specifically, we design an unsupervised and optimizable soft node clustering approach to classify user and item nodes into multiple clusters. Based on the soft node clustering results and the topology of the user-item interaction graph, we assign the nodes with probabilities for different clusters to construct the cluster-specific graphs. To evaluate the effectiveness of ClusterGCF, we conducted extensive experiments on four publicly available datasets. Experimental results demonstrate that our model can significantly improve recommendation performance.

4/17/2024

Neural Causal Graph Collaborative Filtering

Xiangmeng Wang, Qian Li, Dianer Yu, Wei Huang, Guandong Xu

Graph collaborative filtering (GCF) has gained considerable attention in recommendation systems by leveraging graph learning techniques to enhance collaborative filtering (CF). One classical approach in GCF is to learn user and item embeddings with Graph Convolutional Network (GCN) and utilize these embeddings for CF models. However, existing GCN-based methods are insufficient in generating satisfactory embeddings for CF models. This is because they fail to model complex node dependencies and variable relation dependencies from a given graph, making the learned embeddings fragile to uncover the root causes of user interests. In this work, we propose to integrate causal modeling with the learning process of GCN-based GCF models, leveraging causality-aware graph embeddings to capture complex causal relations in recommendations. We complete the task by 1) Causal Graph conceptualization, 2) Neural Causal Model parameterization and 3) Variational inference for Neural Causal Model. Our Neural Causal Model, called Neural Causal Graph Collaborative Filtering (NCGCF), enables causal modeling for GCN-based GCF to facilitate accurate recommendations. Extensive experiments show that NCGCF provides precise recommendations that align with user preferences. We release our code and processed datasets at https://github.com/Chrystalii/CNGCF.

4/9/2024

➖

A Novel Evaluation Perspective on GNNs-based Recommender Systems through the Topology of the User-Item Graph

Daniele Malitesta, Claudio Pomo, Vito Walter Anelli, Alberto Carlo Maria Mancino, Tommaso Di Noia, Eugenio Di Sciascio

Recently, graph neural networks (GNNs)-based recommender systems have encountered great success in recommendation. As the number of GNNs approaches rises, some works have started questioning the theoretical and empirical reasons behind their superior performance. Nevertheless, this investigation still disregards that GNNs treat the recommendation data as a topological graph structure. Building on this assumption, in this work, we provide a novel evaluation perspective on GNNs-based recommendation, which investigates the impact of the graph topology on the recommendation performance. To this end, we select some (topological) properties of the recommendation data and three GNNs-based recommender systems (i.e., LightGCN, DGCF, and SVD-GCN). Then, starting from three popular recommendation datasets (i.e., Yelp2018, Gowalla, and Amazon-Book) we sample them to obtain 1,800 size-reduced datasets that still resemble the original ones but can encompass a wider range of topological structures. We use this procedure to build a large pool of samples for which data characteristics and recommendation performance of the selected GNNs models are measured. Through an explanatory framework, we find strong correspondences between graph topology and GNNs performance, offering a novel evaluation perspective on these models.

8/22/2024