Heterogeneous Graph Contrastive Learning with Meta-path Contexts and Adaptively Weighted Negative Samples

Read original: arXiv:2212.13847 - Published 4/8/2024 by Jianxiang Yu, Qingqing Ge, Xiang Li, Aoying Zhou

🤿

Overview

This paper introduces a new method called MEOW (Meta-path-Enriched cOntrast learning on Weighted negative samples) for heterogeneous graph contrastive learning.
Existing methods that use meta-paths (sequences of object types) to construct contrastive views often ignore the rich context information describing how objects are connected by those meta-paths.
They also fail to properly distinguish negative samples, which can adversely impact model performance.
MEOW addresses these issues by considering both meta-path contexts and weighted negative samples to learn better node representations.

Plain English Explanation

Heterogeneous graph contrastive learning is a technique used to learn useful representations of nodes (e.g., entities or objects) in complex, multi-typed graphs. The key idea is to create two different "views" of the same node and then train the model to recognize when two views correspond to the same node versus when they correspond to different nodes.

Most existing methods use meta-paths - sequences of node types that capture semantic relationships - to construct these contrastive views. However, they often ignore important information about

how

the nodes are connected by those meta-paths. For example, two nodes might be connected by a meta-path that indicates they are both employees of the same company, but the specific details of their employment (e.g., job titles, departments) are not captured.

Additionally, these methods struggle to effectively distinguish "negative" samples (i.e., pairs of views that should not be considered similar). Properly handling negative samples is crucial for the contrastive learning process to work well and produce high-quality node representations.

To address these limitations, the MEOW method constructs two types of contrastive views: a coarse view that captures which nodes are connected by meta-paths, and a fine-grained view that uses meta-path context information to characterize how the nodes are connected. MEOW also introduces a novel approach to weighting negative samples, based on node clustering and prototypical contrastive learning, to better distinguish them during training.

The authors also propose a variant called AdaMEOW that adaptively learns soft weights for negative samples, further improving node representation quality.

Technical Explanation

The key technical aspects of MEOW are as follows:

Coarse and Fine-grained Views: MEOW constructs two contrastive views of each node. The coarse view reflects which nodes are connected by meta-paths, while the fine-grained view uses meta-path contexts to capture details on how the nodes are connected.
Weighted Negative Samples: The authors recognize limitations in the standard InfoNCE loss for computing gradients of negative samples. To better distinguish negative samples, MEOW learns hard-valued weights for them based on node clustering and prototypical contrastive learning.
AdaMEOW: A variant of MEOW that adaptively learns soft-valued weights of negative samples, further improving node representation quality.

The authors conduct extensive experiments to demonstrate the superiority of MEOW and AdaMEOW compared to other state-of-the-art methods for heterogeneous graph contrastive learning.

Critical Analysis

The paper makes a compelling case for the importance of considering meta-path context information and properly handling negative samples in heterogeneous graph contrastive learning. The MEOW and AdaMEOW methods appear to be novel and effective approaches to addressing these challenges.

That said, the paper does not discuss any major limitations or caveats of the proposed methods. It would be helpful to understand the computational complexity of MEOW and AdaMEOW, as well as any potential performance trade-offs or edge cases where the methods might not perform as well.

Additionally, the authors could have explored the generalizability of their findings by evaluating the methods on a wider range of datasets and tasks, such as evaluating large language models using contrast sets or more realistic evaluation setups for community detection models.

Overall, the paper presents a promising new direction for heterogeneous graph contrastive learning, but further research and analysis could help strengthen the claims and explore the broader implications of the work.

Conclusion

The MEOW and AdaMEOW methods introduced in this paper represent an important advancement in heterogeneous graph contrastive learning. By incorporating meta-path context information and learning more effective negative sample weighting, these approaches demonstrate superior performance compared to existing methods.

The techniques described in this paper could have significant impact on a wide range of applications that rely on learning high-quality representations of nodes in complex, multi-typed graphs, such as personalized federated learning for spatio-temporal forecasting or positive-unlabeled contrastive learning. Further research and real-world deployments of these methods could lead to substantial improvements in graph-based machine learning and data mining.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🤿

Heterogeneous Graph Contrastive Learning with Meta-path Contexts and Adaptively Weighted Negative Samples

Jianxiang Yu, Qingqing Ge, Xiang Li, Aoying Zhou

Heterogeneous graph contrastive learning has received wide attention recently. Some existing methods use meta-paths, which are sequences of object types that capture semantic relationships between objects, to construct contrastive views. However, most of them ignore the rich meta-path context information that describes how two objects are connected by meta-paths. Further, they fail to distinguish negative samples, which could adversely affect the model performance. To address the problems, we propose MEOW, which considers both meta-path contexts and weighted negative samples. Specifically, MEOW constructs a coarse view and a fine-grained view for contrast. The former reflects which objects are connected by meta-paths, while the latter uses meta-path contexts and characterizes details on how the objects are connected. Then, we theoretically analyze the InfoNCE loss and recognize its limitations for computing gradients of negative samples. To better distinguish negative samples, we learn hard-valued weights for them based on node clustering and use prototypical contrastive learning to pull close embeddings of nodes in the same cluster. In addition, we propose a variant model AdaMEOW that adaptively learns soft-valued weights of negative samples to further improve node representation. Finally, we conduct extensive experiments to show the superiority of MEOW and AdaMEOW against other state-of-the-art methods.

4/8/2024

Meta-GPS++: Enhancing Graph Meta-Learning with Contrastive Learning and Self-Training

Yonghao Liu, Mengyu Li, Ximing Li, Lan Huang, Fausto Giunchiglia, Yanchun Liang, Xiaoyue Feng, Renchu Guan

Node classification is an essential problem in graph learning. However, many models typically obtain unsatisfactory performance when applied to few-shot scenarios. Some studies have attempted to combine meta-learning with graph neural networks to solve few-shot node classification on graphs. Despite their promising performance, some limitations remain. First, they employ the node encoding mechanism of homophilic graphs to learn node embeddings, even in heterophilic graphs. Second, existing models based on meta-learning ignore the interference of randomness in the learning process. Third, they are trained using only limited labeled nodes within the specific task, without explicitly utilizing numerous unlabeled nodes. Finally, they treat almost all sampled tasks equally without customizing them for their uniqueness. To address these issues, we propose a novel framework for few-shot node classification called Meta-GPS++. Specifically, we first adopt an efficient method to learn discriminative node representations on homophilic and heterophilic graphs. Then, we leverage a prototype-based approach to initialize parameters and contrastive learning for regularizing the distribution of node embeddings. Moreover, we apply self-training to extract valuable information from unlabeled nodes. Additionally, we adopt S$^2$ (scaling & shifting) transformation to learn transferable knowledge from diverse tasks. The results on real-world datasets show the superiority of Meta-GPS++. Our code is available here.

7/23/2024

LAMP: Learnable Meta-Path Guided Adversarial Contrastive Learning for Heterogeneous Graphs

Siqing Li, Jin-Duk Park, Wei Huang, Xin Cao, Won-Yong Shin, Zhiqiang Xu

Heterogeneous graph neural networks (HGNNs) have significantly propelled the information retrieval (IR) field. Still, the effectiveness of HGNNs heavily relies on high-quality labels, which are often expensive to acquire. This challenge has shifted attention towards Heterogeneous Graph Contrastive Learning (HGCL), which usually requires pre-defined meta-paths. However, our findings reveal that meta-path combinations significantly affect performance in unsupervised settings, an aspect often overlooked in current literature. Existing HGCL methods have considerable variability in outcomes across different meta-path combinations, thereby challenging the optimization process to achieve consistent and high performance. In response, we introduce textsf{LAMP} (underline{textbf{L}}earnunderline{textbf{A}}ble underline{textbf{M}}eta-underline{textbf{P}}ath), a novel adversarial contrastive learning approach that integrates various meta-path sub-graphs into a unified and stable structure, leveraging the overlap among these sub-graphs. To address the denseness of this integrated sub-graph, we propose an adversarial training strategy for edge pruning, maintaining sparsity to enhance model performance and robustness. textsf{LAMP} aims to maximize the difference between meta-path and network schema views for guiding contrastive learning to capture the most meaningful information. Our extensive experimental study conducted on four diverse datasets from the Heterogeneous Graph Benchmark (HGB) demonstrates that textsf{LAMP} significantly outperforms existing state-of-the-art unsupervised models in terms of accuracy and robustness.

9/11/2024

From Overfitting to Robustness: Quantity, Quality, and Variety Oriented Negative Sample Selection in Graph Contrastive Learning

Adnan Ali, Jinlong Li, Huanhuan Chen, Ali Kashif Bashir

Graph contrastive learning (GCL) aims to contrast positive-negative counterparts to learn the node embeddings, whereas graph data augmentation methods are employed to generate these positive-negative samples. The variation, quantity, and quality of negative samples compared to positive samples play crucial roles in learning meaningful embeddings for node classification downstream tasks. Less variation, excessive quantity, and low-quality negative samples cause the model to be overfitted for particular nodes, resulting in less robust models. To solve the overfitting problem in the GCL paradigm, this study proposes a novel Cumulative Sample Selection (CSS) algorithm by comprehensively considering negative samples' quality, variations, and quantity. Initially, three negative sample pools are constructed: easy, medium, and hard negative samples, which contain 25%, 50%, and 25% of the total available negative samples, respectively. Then, 10% negative samples are selected from each of these three negative sample pools for training the model. After that, a decision agent module evaluates model training results and decides whether to explore more negative samples from three negative sample pools by increasing the ratio or keep exploiting the current sampling ratio. The proposed algorithm is integrated into a proposed graph contrastive learning framework named NegAmplify. NegAmplify is compared with the SOTA methods on nine graph node classification datasets, with seven achieving better node classification accuracy with up to 2.86% improvement.

6/24/2024