Neural networks that respect local gauge symmetry learn lattice gauge physics more directly
Researchers introduce a new kind of graph neural network that builds local gauge symmetry into its core. The network works with matrix-valued features that transform correctly under independent symmetry changes at each lattice site. By enforcing these rules at every layer, the model can learn both local and nonlocal physical quantities in systems governed by gauge symmetry without throwing away important information.
The work targets lattice gauge theory, a discrete framework used to describe fields with local symmetry. In that setting the basic variables live on links between sites and are group-valued matrices Uij. Under a local gauge change, each link is transformed as Uij → gi Uij g†j, where gi and gj are independent transformations at the two sites. The authors design message-passing on a graph so that node and edge features transform in the same way. Messages and updates are restricted to tensor operations that are compatible with these site-dependent transformations. This keeps the symmetry exact inside the network rather than enforcing it only on the final output.
At a high level, message passing in the new architecture acts like gauge-covariant transport along lattice paths. Repeated local updates let loop-like structures and extended correlations—objects such as Wilson lines and Wilson loops that physicists use to measure flux—emerge from the network’s internal representations. Because the model works directly with the link matrices instead of reducing them immediately to scalar invariants, it can represent non-Abelian effects where matrix multiplication does not commute and path dependence matters.
Why this matters: many previous machine-learning models either built in only global symmetries (like rotations) or tried to remove gauge freedom up front by computing invariant loop variables. The invariant strategy can work for Abelian (commuting) gauge groups but is incomplete for non-Abelian groups, where no finite set of local loops fully characterizes the field. By making the local symmetry part of the architecture itself, the approach aims to be more data efficient and physically consistent. The authors report validation across pure gauge, gauge–matter, and dynamical regimes, suggesting the method applies to a range of lattice-gauge problems and related quantum models.