Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs

Read original: arXiv:2407.15431 - Published 7/23/2024 by Huanjing Zhao, Beining Yang, Yukuo Cen, Junyu Ren, Chenhui Zhang, Yuxiao Dong, Evgeny Kharlamov, Shu Zhao, Jie Tang

Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs

Overview

This paper explores pre-training and prompting techniques for few-shot node classification on text-attributed graphs.
The authors propose a novel pre-training approach called Pure Transformer Pre-training (PTP) that leverages both graph structure and textual attributes.
They also introduce a prompting-based method called Prompt-based Text-Attribute Graph Classification (PTGC) for few-shot node classification.
The proposed techniques are evaluated on several real-world datasets and show significant performance improvements over existing methods.

Plain English Explanation

The paper focuses on the task of node classification on text-attributed graphs, which are graphs where each node has associated textual information. This is a common scenario in many real-world applications, such as classifying research papers into topics based on their titles and abstracts.

The challenge is that in many cases, we only have a small number of labeled nodes (called "few-shot learning"), making it difficult to train effective models. To address this, the authors propose two key innovations:

Pure Transformer Pre-training (PTP): This is a method for pre-training the model on the graph structure and textual attributes, before fine-tuning it on the few labeled nodes. This helps the model learn useful representations that can be applied to the few-shot task.
Prompt-based Text-Attribute Graph Classification (PTGC): This is a prompting technique that allows the model to leverage the textual attributes of nodes more effectively during the few-shot learning process.

By combining these two techniques, the authors are able to significantly improve the performance of few-shot node classification on text-attributed graphs, outperforming previous state-of-the-art methods.

Technical Explanation

The key technical contributions of the paper are:

Pure Transformer Pre-training (PTP): The authors propose a pre-training approach that jointly learns representations from both the graph structure and the textual attributes of nodes. This is done by training a Transformer-based model to predict the textual attributes of nodes given their graph neighborhoods.
Prompt-based Text-Attribute Graph Classification (PTGC): For the few-shot node classification task, the authors introduce a prompting-based method that conditions the model's predictions on prompts that encode the textual attributes of the nodes. This allows the model to better leverage the rich textual information during the few-shot learning process.
Experiments: The authors evaluate their proposed techniques on several real-world text-attributed graph datasets, including citation networks and social media networks. They show that PTP and PTGC significantly outperform previous state-of-the-art methods for few-shot node classification.

Critical Analysis

The paper presents a well-designed and thorough investigation of pre-training and prompting techniques for few-shot node classification on text-attributed graphs. The authors acknowledge several limitations and areas for further research, such as:

The performance of the proposed methods may be sensitive to the quality and quantity of the textual attributes, which can vary across real-world datasets.
The computational complexity of the Transformer-based models used in PTP and PTGC may be a concern for large-scale graphs.
The paper does not explore the potential synergies between PTP and other self-supervised graph representation learning methods, such as CONGRAT or TAGA.

Overall, this paper makes a valuable contribution to the field of graph machine learning, demonstrating the benefits of leveraging both graph structure and textual attributes for few-shot node classification. The proposed techniques could be further extended and applied to a wider range of text-attributed graph problems.

Conclusion

This paper presents two novel techniques, Pure Transformer Pre-training (PTP) and Prompt-based Text-Attribute Graph Classification (PTGC), for addressing the challenge of few-shot node classification on text-attributed graphs. By combining pre-training on graph structure and textual attributes with prompting-based few-shot learning, the authors are able to significantly improve the performance of node classification models on several real-world datasets. This work highlights the importance of jointly modeling graph structure and textual information for graph machine learning tasks and opens up new directions for future research in this area.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs

Huanjing Zhao, Beining Yang, Yukuo Cen, Junyu Ren, Chenhui Zhang, Yuxiao Dong, Evgeny Kharlamov, Shu Zhao, Jie Tang

The text-attributed graph (TAG) is one kind of important real-world graph-structured data with each node associated with raw texts. For TAGs, traditional few-shot node classification methods directly conduct training on the pre-processed node features and do not consider the raw texts. The performance is highly dependent on the choice of the feature pre-processing method. In this paper, we propose P2TAG, a framework designed for few-shot node classification on TAGs with graph pre-training and prompting. P2TAG first pre-trains the language model (LM) and graph neural network (GNN) on TAGs with self-supervised loss. To fully utilize the ability of language models, we adapt the masked language modeling objective for our framework. The pre-trained model is then used for the few-shot node classification with a mixed prompt method, which simultaneously considers both text and graph information. We conduct experiments on six real-world TAGs, including paper citation networks and product co-purchasing networks. Experimental results demonstrate that our proposed framework outperforms existing graph few-shot learning methods on these datasets with +18.98% ~ +35.98% improvements.

7/23/2024

A Pure Transformer Pretraining Framework on Text-attributed Graphs

Yu Song, Haitao Mao, Jiachen Xiao, Jingzhe Liu, Zhikai Chen, Wei Jin, Carl Yang, Jiliang Tang, Hui Liu

Pretraining plays a pivotal role in acquiring generalized knowledge from large-scale data, achieving remarkable successes as evidenced by large models in CV and NLP. However, progress in the graph domain remains limited due to fundamental challenges such as feature heterogeneity and structural heterogeneity. Recently, increasing efforts have been made to enhance node feature quality with Large Language Models (LLMs) on text-attributed graphs (TAGs), demonstrating superiority to traditional bag-of-words or word2vec techniques. These high-quality node features reduce the previously critical role of graph structure, resulting in a modest performance gap between Graph Neural Networks (GNNs) and structure-agnostic Multi-Layer Perceptrons (MLPs). Motivated by this, we introduce a feature-centric pretraining perspective by treating graph structure as a prior and leveraging the rich, unified feature space to learn refined interaction patterns that generalizes across graphs. Our framework, Graph Sequence Pretraining with Transformer (GSPT), samples node contexts through random walks and employs masked feature reconstruction to capture pairwise proximity in the LLM-unified feature space using a standard Transformer. By utilizing unified text representations rather than varying structures, our framework achieves significantly better transferability among graphs within the same domain. GSPT can be easily adapted to both node classification and link prediction, demonstrating promising empirical success on various datasets.

6/21/2024

🏷️

Prompt Tuning on Graph-augmented Low-resource Text Classification

Zhihao Wen, Yuan Fang

Text classification is a fundamental problem in information retrieval with many real-world applications, such as predicting the topics of online articles and the categories of e-commerce product descriptions. However, low-resource text classification, with no or few labeled samples, presents a serious concern for supervised learning. Meanwhile, many text data are inherently grounded on a network structure, such as a hyperlink/citation network for online articles, and a user-item purchase network for e-commerce products. These graph structures capture rich semantic relationships, which can potentially augment low-resource text classification. In this paper, we propose a novel model called Graph-Grounded Pre-training and Prompting (G2P2) to address low-resource text classification in a two-pronged approach. During pre-training, we propose three graph interaction-based contrastive strategies to jointly pre-train a graph-text model; during downstream classification, we explore handcrafted discrete prompts and continuous prompt tuning for the jointly pre-trained model to achieve zero- and few-shot classification, respectively. Moreover, we explore the possibility of employing continuous prompt tuning for zero-shot inference. Specifically, we aim to generalize continuous prompts to unseen classes while leveraging a set of base classes. To this end, we extend G2P2 into G2P2$^*$, hinging on a new architecture of conditional prompt tuning. Extensive experiments on four real-world datasets demonstrate the strength of G2P2 in zero- and few-shot low-resource text classification tasks, and illustrate the advantage of G2P2$^*$ in dealing with unseen classes.

8/20/2024

Hound: Hunting Supervision Signals for Few and Zero Shot Node Classification on Text-attributed Graph

Yuxiang Wang, Xiao Yan, Shiyu Jin, Quanqing Xu, Chuanhui Yang, Yuanyuan Zhu, Chuang Hu, Bo Du, Jiawei Jiang

Text-attributed graph (TAG) is an important type of graph structured data with text descriptions for each node. Few- and zero-shot node classification on TAGs have many applications in fields such as academia and social networks. However, the two tasks are challenging due to the lack of supervision signals, and existing methods only use the contrastive loss to align graph-based node embedding and language-based text embedding. In this paper, we propose Hound to improve accuracy by introducing more supervision signals, and the core idea is to go beyond the node-text pairs that come with data. Specifically, we design three augmentation techniques, i.e., node perturbation, text matching, and semantics negation to provide more reference nodes for each text and vice versa. Node perturbation adds/drops edges to produce diversified node embeddings that can be matched with a text. Text matching retrieves texts with similar embeddings to match with a node. Semantics negation uses a negative prompt to construct a negative text with the opposite semantics, which is contrasted with the original node and text. We evaluate Hound on 5 datasets and compare with 13 state-of-the-art baselines. The results show that Hound consistently outperforms all baselines, and its accuracy improvements over the best-performing baseline are usually over 5%.

9/4/2024