Graph Automorphism Group Equivariant Neural Networks

2307.07810

Published 5/29/2024 by Edward Pearce-Crump, William J. Knottenbelt

🧠

Abstract

Permutation equivariant neural networks are typically used to learn from data that lives on a graph. However, for any graph $G$ that has $n$ vertices, using the symmetric group $S_n$ as its group of symmetries does not take into account the relations that exist between the vertices. Given that the actual group of symmetries is the automorphism group Aut$(G)$, we show how to construct neural networks that are equivariant to Aut$(G)$ by obtaining a full characterisation of the learnable, linear, Aut$(G)$-equivariant functions between layers that are some tensor power of $mathbb{R}^{n}$. In particular, we find a spanning set of matrices for these layer functions in the standard basis of $mathbb{R}^{n}$. This result has important consequences for learning from data whose group of symmetries is a finite group because a theorem by Frucht (1938) showed that any finite group is isomorphic to the automorphism group of a graph.

Create account to get full access

Overview

This paper presents a new approach to constructing neural networks that are equivariant to the automorphism group of a graph, rather than just the symmetric group.
The authors show how to characterize the learnable, linear, automorphism group-equivariant functions between layers of the neural network.
This has important implications for learning from data whose group of symmetries is a finite group, as any finite group can be represented as the automorphism group of a graph.

Plain English Explanation

Neural networks are a type of machine learning model that can be used to learn from data that has certain symmetries, such as data that lives on a graph. Typically, these neural networks are designed to be equivariant to the symmetric group, which represents all possible permutations of the vertices in the graph.

However, the authors of this paper argue that this approach does not fully capture the relationships between the vertices in the graph. Instead, they propose constructing neural networks that are equivariant to the automorphism group of the graph, which is the group of symmetries that preserve the structure of the graph.

To do this, the authors provide a full characterization of the learnable, linear, automorphism group-equivariant functions that can be used as the layer functions in the neural network. This means they have identified a set of matrices that can be used to transform the data in a way that is equivariant to the automorphism group of the graph.

This result is important because it allows for the construction of neural networks that can better capture the underlying structure of the data, which can lead to improved performance on tasks like graph classification or graph generation. Additionally, the authors note that any finite group can be represented as the automorphism group of a graph, so their results have broad implications for learning from data with various symmetries.

Technical Explanation

The key contribution of this paper is the characterization of the learnable, linear, Aut(G)-equivariant functions between layers of a neural network, where G is a graph and Aut(G) is its automorphism group.

The authors show that these layer functions can be expressed as a linear combination of a set of basis matrices, which they derive. This basis captures the full space of learnable, linear, Aut(G)-equivariant functions.

To achieve this, the authors first establish that the space of Aut(G)-equivariant functions between tensor powers of R^n (where n is the number of vertices in the graph) forms a matrix algebra. They then characterize the structure of this matrix algebra, finding a spanning set of matrices that can be used to parameterize any Aut(G)-equivariant function.

This result is significant because it provides a principled way to construct neural networks that are tailored to the symmetries of the data, rather than relying on the more generic symmetric group. The authors note that this is particularly important for learning from data whose group of symmetries is a finite group, as any finite group can be realized as the automorphism group of some graph.

Critical Analysis

The authors have provided a rigorous mathematical analysis of Aut(G)-equivariant neural networks, but there are a few potential limitations and areas for further research:

The characterization of the Aut(G)-equivariant layer functions assumes linearity, which may not capture all the relevant transformations needed for complex real-world data. Extending this work to nonlinear equivariant functions could be an important next step.
The authors focus on the theoretical characterization of the layer functions, but do not provide extensive experimental validation of the performance of Aut(G)-equivariant neural networks in practice. Empirical studies on real-world tasks would help demonstrate the practical benefits of this approach.
The reliance on the automorphism group Aut(G) may be challenging in cases where the structure of the graph is not fully known or easily captured. Approaches that can handle uncertainty in the graph structure could be an interesting direction for future research.

Overall, this paper provides a solid theoretical foundation for constructing neural networks that are tailored to the symmetries of graph-structured data. Further work is needed to explore the practical implications and potential limitations of this approach.

Conclusion

This paper presents a new approach to constructing neural networks that are equivariant to the automorphism group of a graph, rather than just the symmetric group. The authors provide a full characterization of the learnable, linear, automorphism group-equivariant functions that can be used as the layer functions in these neural networks.

This result is significant because it allows for the construction of neural networks that can better capture the underlying structure of graph-structured data, which can lead to improved performance on tasks like graph classification and generation. Additionally, the authors note that any finite group can be represented as the automorphism group of a graph, so their findings have broad implications for learning from data with various symmetries.

While there are some potential limitations and areas for further research, this paper lays an important theoretical foundation for the development of more effective and principled neural network architectures for graph-structured data.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Related Papers

Permutation-equivariant quantum convolutional neural networks

Sreetama Das, Filippo Caruso

The Symmetric group $S_{n}$ manifests itself in large classes of quantum systems as the invariance of certain characteristics of a quantum state with respect to permuting the qubits. The subgroups of $S_{n}$ arise, among many other contexts, to describe label symmetry of classical images with respect to spatial transformations, e.g. reflection or rotation. Equipped with the formalism of geometric quantum machine learning, in this work we propose the architectures of equivariant quantum convolutional neural networks (EQCNNs) adherent to $S_{n}$ and its subgroups. We demonstrate that a careful choice of pixel-to-qubit embedding order can facilitate easy construction of EQCNNs for small subgroups of $S_{n}$. Our novel EQCNN architecture corresponding to the full permutation group $S_{n}$ is built by applying all possible QCNNs with equal probability, which can also be conceptualized as a dropout strategy in quantum neural networks. For subgroups of $S_{n}$, our numerical results using MNIST datasets show better classification accuracy than non-equivariant QCNNs. The $S_{n}$-equivariant QCNN architecture shows significantly improved training and test performance than non-equivariant QCNN for classification of connected and non-connected graphs. When trained with sufficiently large number of data, the $S_{n}$-equivariant QCNN shows better average performance compared to $S_{n}$-equivariant QNN . These results contribute towards building powerful quantum machine learning architectures in permutation-symmetric systems.

4/30/2024

cs.AI cs.CV cs.LG

🧠

Unifying O(3) Equivariant Neural Networks Design with Tensor-Network Formalism

Zimu Li, Zihan Pengmei, Han Zheng, Erik Thiede, Junyu Liu, Risi Kondor

Many learning tasks, including learning potential energy surfaces from ab initio calculations, involve global spatial symmetries and permutational symmetry between atoms or general particles. Equivariant graph neural networks are a standard approach to such problems, with one of the most successful methods employing tensor products between various tensors that transform under the spatial group. However, as the number of different tensors and the complexity of relationships between them increase, maintaining parsimony and equivariance becomes increasingly challenging. In this paper, we propose using fusion diagrams, a technique widely employed in simulating SU($2$)-symmetric quantum many-body problems, to design new equivariant components for equivariant neural networks. This results in a diagrammatic approach to constructing novel neural network architectures. When applied to particles within a given local neighborhood, the resulting components, which we term fusion blocks, serve as universal approximators of any continuous equivariant function defined in the neighborhood. We incorporate a fusion block into pre-existing equivariant architectures (Cormorant and MACE), leading to improved performance with fewer parameters on a range of challenging chemical problems. Furthermore, we apply group-equivariant neural networks to study non-adiabatic molecular dynamics of stilbene cis-trans isomerization. Our approach, which combines tensor networks with equivariant neural networks, suggests a potentially fruitful direction for designing more expressive equivariant neural networks.

5/24/2024

cs.LG cs.AI stat.ML

🧠

Theory for Equivariant Quantum Neural Networks

Quynh T. Nguyen, Louis Schatzki, Paolo Braccia, Michael Ragone, Patrick J. Coles, Frederic Sauvage, Martin Larocca, M. Cerezo

Quantum neural network architectures that have little-to-no inductive biases are known to face trainability and generalization issues. Inspired by a similar problem, recent breakthroughs in machine learning address this challenge by creating models encoding the symmetries of the learning task. This is materialized through the usage of equivariant neural networks whose action commutes with that of the symmetry. In this work, we import these ideas to the quantum realm by presenting a comprehensive theoretical framework to design equivariant quantum neural networks (EQNN) for essentially any relevant symmetry group. We develop multiple methods to construct equivariant layers for EQNNs and analyze their advantages and drawbacks. Our methods can find unitary or general equivariant quantum channels efficiently even when the symmetry group is exponentially large or continuous. As a special implementation, we show how standard quantum convolutional neural networks (QCNN) can be generalized to group-equivariant QCNNs where both the convolution and pooling layers are equivariant to the symmetry group. We then numerically demonstrate the effectiveness of a SU(2)-equivariant QCNN over symmetry-agnostic QCNN on a classification task of phases of matter in the bond-alternating Heisenberg model. Our framework can be readily applied to virtually all areas of quantum machine learning. Lastly, we discuss about how symmetry-informed models such as EQNNs provide hopes to alleviate central challenges such as barren plateaus, poor local minima, and sample complexity.

5/14/2024

cs.LG stat.ML

Scale Equivariant Graph Metanetworks

Ioannis Kalogeropoulos, Giorgos Bouritsas, Yannis Panagakis

This paper pertains to an emerging machine learning paradigm: learning higher-order functions, i.e. functions whose inputs are functions themselves, $textit{particularly when these inputs are Neural Networks (NNs)}$. With the growing interest in architectures that process NNs, a recurring design principle has permeated the field: adhering to the permutation symmetries arising from the connectionist structure of NNs. $textit{However, are these the sole symmetries present in NN parameterizations}$? Zooming into most practical activation functions (e.g. sine, ReLU, tanh) answers this question negatively and gives rise to intriguing new symmetries, which we collectively refer to as $textit{scaling symmetries}$, that is, non-zero scalar multiplications and divisions of weights and biases. In this work, we propose $textit{Scale Equivariant Graph MetaNetworks - ScaleGMNs}$, a framework that adapts the Graph Metanetwork (message-passing) paradigm by incorporating scaling symmetries and thus rendering neuron and edge representations equivariant to valid scalings. We introduce novel building blocks, of independent technical interest, that allow for equivariance or invariance with respect to individual scalar multipliers or their product and use them in all components of ScaleGMN. Furthermore, we prove that, under certain expressivity conditions, ScaleGMN can simulate the forward and backward pass of any input feedforward neural network. Experimental results demonstrate that our method advances the state-of-the-art performance for several datasets and activation functions, highlighting the power of scaling symmetries as an inductive bias for NN processing.

6/18/2024

cs.LG