tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs)

Read original: arXiv:2402.02456 - Published 6/4/2024 by Junhua Zeng, Chao Li, Zhun Sun, Qibin Zhao, Guoxu Zhou

🌐

Overview

Tensor networks are a powerful tool for representing high-dimensional data, but selecting the right tensor network structure is a challenging problem.
Existing algorithms for tensor network structure search (TN-SS) are manually crafted heuristics that suffer from poor performance and the curse of dimensionality.
This paper proposes a novel approach called tnGPS that uses large language models (LLMs) to automatically discover new TN-SS algorithms, eliminating the need for human experts.

Plain English Explanation

Tensor networks are a way to efficiently represent and work with extremely complex, high-dimensional data. Imagine you have a huge, multi-dimensional dataset that's too complex for a regular computer to handle. Tensor networks can break that data down into smaller, more manageable pieces, making it much easier to work with.

However, choosing the right tensor network structure for a given problem is a major challenge. Most existing algorithms for this "tensor network structure search" (TN-SS) problem are manually created by human experts, and they often perform poorly, especially as the data gets more complex.

In this paper, the researchers propose a new approach called tnGPS that uses large language models (LLMs) to automatically discover new TN-SS algorithms, without relying on human experts. By studying how human experts innovate in this area, the researchers have created a system that can generate and refine new TN-SS algorithms on its own, potentially leading to better performance than the existing, manually-crafted methods.

Technical Explanation

The key innovation in this paper is the use of large language models (LLMs) to automatically discover new tensor network structure search (TN-SS) algorithms. The researchers observed that human experts often follow a common workflow when innovating in this area, and they have modeled this workflow in an "automatic algorithm discovery framework" called tnGPS.

tnGPS is an elaborate prompting pipeline that instructs LLMs to generate new TN-SS algorithms through iterative refinement and enhancement. The framework takes inspiration from how human experts approach this problem, breaking down the task into smaller steps and using the LLM to generate, evaluate, and improve candidate algorithms.

The researchers evaluated the performance of the algorithms discovered by tnGPS on benchmark tasks, and found that they outperformed the current state-of-the-art TN-SS methods. This suggests that leveraging the capabilities of LLMs can be a powerful approach for automating the discovery of new algorithms in this domain, potentially overcoming the limitations of manually crafted heuristics.

Critical Analysis

While the tnGPS framework shows promising results, the paper acknowledges that there are still some limitations and areas for further research. For example, the paper does not provide detailed insights into the inner workings of the LLM-generated algorithms, making it difficult to fully understand their strengths and weaknesses.

Additionally, the performance of the tnGPS-discovered algorithms was only evaluated on a limited set of benchmark tasks. It would be important to see how these algorithms perform on a wider range of real-world problems to assess their broader applicability and generalizability.

Another potential area for further exploration is the scalability of the tnGPS approach as the complexity of the target algorithms increases. The paper does not address how the framework would handle the discovery of more sophisticated TN-SS algorithms that might be required for extremely high-dimensional or complex datasets.

Despite these caveats, the overall concept of leveraging large language models (LLMs) to automate the discovery of new algorithms is a promising direction that could have significant implications for fields like neural architecture search (NAS) and other areas of combinatorial optimization.

Conclusion

This paper presents a novel approach called tnGPS that uses large language models to automatically discover new tensor network structure search (TN-SS) algorithms, eliminating the need for manual, human-crafted heuristics. The experimental results demonstrate the superior performance of the algorithms discovered by tnGPS compared to existing state-of-the-art methods.

While there are still some limitations and areas for further research, the overall concept of leveraging LLMs to automate algorithm discovery is a significant step forward in the field of tensor networks and high-dimensional representation. If successfully scaled and refined, this approach could have far-reaching implications for a wide range of optimization and problem-solving tasks in the future.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

🌐

tnGPS: Discovering Unknown Tensor Network Structure Search Algorithms via Large Language Models (LLMs)

Junhua Zeng, Chao Li, Zhun Sun, Qibin Zhao, Guoxu Zhou

Tensor networks are efficient for extremely high-dimensional representation, but their model selection, known as tensor network structure search (TN-SS), is a challenging problem. Although several works have targeted TN-SS, most existing algorithms are manually crafted heuristics with poor performance, suffering from the curse of dimensionality and local convergence. In this work, we jump out of the box, studying how to harness large language models (LLMs) to automatically discover new TN-SS algorithms, replacing the involvement of human experts. By observing how human experts innovate in research, we model their common workflow and propose an automatic algorithm discovery framework called tnGPS. The proposed framework is an elaborate prompting pipeline that instruct LLMs to generate new TN-SS algorithms through iterative refinement and enhancement. The experimental results demonstrate that the algorithms discovered by tnGPS exhibit superior performance in benchmarks compared to the current state-of-the-art methods.

6/4/2024

Computation-friendly Graph Neural Network Design by Accumulating Knowledge on Large Language Models

Jialiang Wang, Shimin Di, Hanmo Liu, Zhili Wang, Jiachuan Wang, Lei Chen, Xiaofang Zhou

Graph Neural Networks (GNNs), like other neural networks, have shown remarkable success but are hampered by the complexity of their architecture designs, which heavily depend on specific data and tasks. Traditionally, designing proper architectures involves trial and error, which requires intensive manual effort to optimize various components. To reduce human workload, researchers try to develop automated algorithms to design GNNs. However, both experts and automated algorithms suffer from two major issues in designing GNNs: 1) the substantial computational resources expended in repeatedly trying candidate GNN architectures until a feasible design is achieved, and 2) the intricate and prolonged processes required for humans or algorithms to accumulate knowledge of the interrelationship between graphs, GNNs, and performance. To further enhance the automation of GNN architecture design, we propose a computation-friendly way to empower Large Language Models (LLMs) with specialized knowledge in designing GNNs, thereby drastically shortening the computational overhead and development cycle of designing GNN architectures. Our framework begins by establishing a knowledge retrieval pipeline that comprehends the intercorrelations between graphs, GNNs, and performance. This pipeline converts past model design experiences into structured knowledge for LLM reference, allowing it to quickly suggest initial model proposals. Subsequently, we introduce a knowledge-driven search strategy that emulates the exploration-exploitation process of human experts, enabling quick refinement of initial proposals within a promising scope. Extensive experiments demonstrate that our framework can efficiently deliver promising (e.g., Top-5.77%) initial model proposals for unseen datasets within seconds and without any prior training and achieve outstanding search performance in a few iterations.

8/14/2024

💬

Large Language Models as Topological Structure Enhancers for Text-Attributed Graphs

Shengyin Sun, Yuxiang Ren, Chen Ma, Xuecang Zhang

The latest advancements in large language models (LLMs) have revolutionized the field of natural language processing (NLP). Inspired by the success of LLMs in NLP tasks, some recent work has begun investigating the potential of applying LLMs in graph learning tasks. However, most of the existing work focuses on utilizing LLMs as powerful node feature augmenters, leaving employing LLMs to enhance graph topological structures an understudied problem. In this work, we explore how to leverage the information retrieval and text generation capabilities of LLMs to refine/enhance the topological structure of text-attributed graphs (TAGs) under the node classification setting. First, we propose using LLMs to help remove unreliable edges and add reliable ones in the TAG. Specifically, we first let the LLM output the semantic similarity between node attributes through delicate prompt designs, and then perform edge deletion and edge addition based on the similarity. Second, we propose using pseudo-labels generated by the LLM to improve graph topology, that is, we introduce the pseudo-label propagation as a regularization to guide the graph neural network (GNN) in learning proper edge weights. Finally, we incorporate the two aforementioned LLM-based methods for graph topological refinement into the process of GNN training, and perform extensive experiments on four real-world datasets. The experimental results demonstrate the effectiveness of LLM-based graph topology refinement (achieving a 0.15%--2.47% performance gain on public benchmarks).

7/25/2024

💬

Exploring Combinatorial Problem Solving with Large Language Models: A Case Study on the Travelling Salesman Problem Using GPT-3.5 Turbo

Mahmoud Masoud, Ahmed Abdelhay, Mohammed Elhenawy

Large Language Models (LLMs) are deep learning models designed to generate text based on textual input. Although researchers have been developing these models for more complex tasks such as code generation and general reasoning, few efforts have explored how LLMs can be applied to combinatorial problems. In this research, we investigate the potential of LLMs to solve the Travelling Salesman Problem (TSP). Utilizing GPT-3.5 Turbo, we conducted experiments employing various approaches, including zero-shot in-context learning, few-shot in-context learning, and chain-of-thoughts (CoT). Consequently, we fine-tuned GPT-3.5 Turbo to solve a specific problem size and tested it using a set of various instance sizes. The fine-tuned models demonstrated promising performance on problems identical in size to the training instances and generalized well to larger problems. Furthermore, to improve the performance of the fine-tuned model without incurring additional training costs, we adopted a self-ensemble approach to improve the quality of the solutions.

5/6/2024