Networked Inequality: Preferential Attachment Bias in Graph Neural Network Link Prediction

Read original: arXiv:2309.17417 - Published 6/6/2024 by Arjun Subramonian, Levent Sagun, Yizhou Sun

🧠

Overview

This paper explores how the degree bias in networks affects Graph Convolutional Network (GCN) link prediction.
The researchers uncover that GCNs with a symmetric normalized graph filter have a within-group preferential attachment bias, which can lead to unfairness in link prediction.
They propose a new within-group fairness metric and a training-time strategy to alleviate this unfairness in citation, social, and credit networks.

Plain English Explanation

Graph neural networks (GNNs) are increasingly used in various applications like recommending academic papers, suggesting collaborators, and finding friends on social networks. Prior research has looked at fairness in GNN link prediction at the individual level, but the fairness within social groups (e.g., queer women) and the "rich get richer" dynamics have not been well explored.

This is important because these factors can contribute to imbalances in network power and influence. In this paper, the researchers shed light on how the inherent bias towards high-degree nodes in networks affects GCN, a type of GNN, and its link prediction performance.

They show that GCNs with a particular graph filter have a built-in preference for connecting to nodes with many existing connections. This can amplify the differences in the number of connections (degrees) between social groups, leading to unfairness.

To address this, the researchers propose a new metric to quantify the unfairness within social groups. They also develop a simple training technique to reduce this unfairness, which they demonstrate is effective on citation, social, and credit networks.

Technical Explanation

The researchers theoretically analyze Graph Convolutional Networks (GCNs) with a symmetric normalized graph filter and show that they exhibit a within-group preferential attachment bias. This means that GCNs tend to create more links between nodes that already have many connections, exacerbating power imbalances in the network.

To validate their theoretical findings, the researchers conduct experiments on real-world citation, collaboration, and online social networks. They find that the degree bias in these networks does indeed lead to unfairness in GCN link prediction, with certain social groups being disadvantaged.

To address this issue, the researchers propose a new within-group fairness metric that quantifies the disparities in link prediction scores within social groups. This metric can be used to evaluate and improve the fairness of GCN link prediction.

The researchers also introduce a simple training-time strategy to alleviate the within-group unfairness in GCN link prediction. This strategy is shown to be effective in improving fairness on the citation, social, and credit networks studied.

Critical Analysis

The researchers provide a comprehensive theoretical and empirical analysis of the degree bias in GCN link prediction and its implications for fairness. However, the paper does not explore the potential causes of this bias in depth, such as the role of the graph convolutional filter or the properties of the input networks.

Additionally, while the proposed fairness metric and training-time strategy are promising, their effectiveness may be limited to the specific networks and social group definitions used in the experiments. Further research is needed to understand the generalizability of these approaches and their performance on a wider range of networks and group definitions.

The paper also does not discuss the potential trade-offs between fairness and other desirable properties of link prediction, such as accuracy or efficiency. Exploring these trade-offs could provide valuable insights for practitioners seeking to balance these competing objectives.

Conclusion

This paper provides important theoretical and empirical insights into the degree bias in GCN link prediction and its consequences for fairness within social groups. The researchers' findings highlight the need to consider group-level fairness in addition to individual-level fairness when deploying GNNs in real-world applications.

The proposed fairness metric and training-time strategy offer promising approaches to mitigate these biases, but further research is needed to fully understand their limitations and applicability. As GNNs become increasingly ubiquitous, addressing these fairness concerns will be crucial for ensuring that these powerful models benefit all members of society equitably.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🧠

Networked Inequality: Preferential Attachment Bias in Graph Neural Network Link Prediction

Arjun Subramonian, Levent Sagun, Yizhou Sun

Graph neural network (GNN) link prediction is increasingly deployed in citation, collaboration, and online social networks to recommend academic literature, collaborators, and friends. While prior research has investigated the dyadic fairness of GNN link prediction, the within-group (e.g., queer women) fairness and rich get richer dynamics of link prediction remain underexplored. However, these aspects have significant consequences for degree and power imbalances in networks. In this paper, we shed light on how degree bias in networks affects Graph Convolutional Network (GCN) link prediction. In particular, we theoretically uncover that GCNs with a symmetric normalized graph filter have a within-group preferential attachment bias. We validate our theoretical analysis on real-world citation, collaboration, and online social networks. We further bridge GCN's preferential attachment bias with unfairness in link prediction and propose a new within-group fairness metric. This metric quantifies disparities in link prediction scores within social groups, towards combating the amplification of degree and power disparities. Finally, we propose a simple training-time strategy to alleviate within-group unfairness, and we show that it is effective on citation, social, and credit networks.

6/6/2024

Promoting Fairness in Link Prediction with Graph Enhancement

Yezi Liu, Hanning Chen, Mohsen Imani

Link prediction is a crucial task in network analysis, but it has been shown to be prone to biased predictions, particularly when links are unfairly predicted between nodes from different sensitive groups. In this paper, we study the fair link prediction problem, which aims to ensure that the predicted link probability is independent of the sensitive attributes of the connected nodes. Existing methods typically incorporate debiasing techniques within graph embeddings to mitigate this issue. However, training on large real-world graphs is already challenging, and adding fairness constraints can further complicate the process. To overcome this challenge, we propose FairLink, a method that learns a fairness-enhanced graph to bypass the need for debiasing during the link predictor's training. FairLink maintains link prediction accuracy by ensuring that the enhanced graph follows a training trajectory similar to that of the original input graph. Meanwhile, it enhances fairness by minimizing the absolute difference in link probabilities between node pairs within the same sensitive group and those between node pairs from different sensitive groups. Our extensive experiments on multiple large-scale graphs demonstrate that FairLink not only promotes fairness but also often achieves link prediction accuracy comparable to baseline methods. Most importantly, the enhanced graph exhibits strong generalizability across different GNN architectures.

9/16/2024

🔮

Link Prediction in Bipartite Networks

c{S}ukru Demir .Inan Ozer (GSU), Gunce Keziban Orman (GSU), Vincent Labatut (LIA)

Bipartite networks serve as highly suitable models to represent systems involving interactions between two distinct types of entities, such as online dating platforms, job search services, or ecommerce websites. These models can be leveraged to tackle a number of tasks, including link prediction among the most useful ones, especially to design recommendation systems. However, if this task has garnered much interest when conducted on unipartite (i.e. standard) networks, it is far from being the case for bipartite ones. In this study, we address this gap by performing an experimental comparison of 19 link prediction methods able to handle bipartite graphs. Some come directly from the literature, and some are adapted by us from techniques originally designed for unipartite networks. We also propose to repurpose recommendation systems based on graph convolutional networks (GCN) as a novel link prediction solution for bipartite networks. To conduct our experiments, we constitute a benchmark of 3 real-world bipartite network datasets with various topologies. Our results indicate that GCN-based personalized recommendation systems, which have received significant attention in recent years, can produce successful results for link prediction in bipartite networks. Furthermore, purely heuristic metrics that do not rely on any learning process, like the Structural Perturbation Method (SPM), can also achieve success.

6/12/2024

Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks

Arjun Subramonian, Jian Kang, Yizhou Sun

Graph Neural Networks (GNNs) often perform better for high-degree nodes than low-degree nodes on node classification tasks. This degree bias can reinforce social marginalization by, e.g., sidelining authors of lowly-cited papers when predicting paper topics in citation networks. While researchers have proposed numerous hypotheses for why GNN degree bias occurs, we find via a survey of 38 degree bias papers that these hypotheses are often not rigorously validated, and can even be contradictory. Thus, we provide an analysis of the origins of degree bias in message-passing GNNs with different graph filters. We prove that high-degree test nodes tend to have a lower probability of misclassification regardless of how GNNs are trained. Moreover, we show that degree bias arises from a variety of factors that are associated with a node's degree (e.g., homophily of neighbors, diversity of neighbors). Furthermore, we show that during training, some GNNs may adjust their loss on low-degree nodes more slowly than on high-degree nodes; however, with sufficiently many epochs of training, message-passing GNNs can achieve their maximum possible training accuracy, which is not significantly limited by their expressive power. Throughout our analysis, we connect our findings to previously-proposed hypotheses for the origins of degree bias, supporting and unifying some while drawing doubt to others. We validate our theoretical findings on 8 common real-world networks, and based on our theoretical and empirical insights, describe a roadmap to alleviate degree bias.

4/5/2024