Towards Understanding the Expressive Power of GNNs with Global Readout

arXiv cs.LG / 4/28/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper analyzes the expressive power of message-passing aggregate-combine-readout graph neural networks (ACR-GNNs), focusing on which first-order (FO) properties they can represent.
  • It shows that using sum aggregation together with global readout enables ACR-GNNs to capture FO properties that cannot be expressed in logical framework C2, strengthening earlier 2026 results with specialized aggregation/readout.
  • The authors propose two approaches to restore a tight logical characterisation relative to C2: restricting local aggregation (while keeping global readout unrestricted) or bounding graph degree (while allowing unbounded graph size).
  • Under either restriction, the FO properties captured by ACR-GNNs correspond exactly to those definable in graded modal logic with global counting modalities, clarifying how unbounded interaction between aggregation and readout expands beyond C2.
  • Overall, the work provides both lower and upper bounds on how C2 fragments relate to GNN expressiveness, framing C2 as a baseline that becomes insufficient when aggregation and global readout interact without limits.

Abstract

We study the expressive power of message-passing aggregate-combine-readout graph neural networks (ACR-GNNs). Particularly, we focus on the first-order (FO) properties expressible by this formalism. While a tight logical characterisation remains a difficult open question, we make two contributions towards answering it. First, we show that sum aggregation and readout suffice for GNNs to capture FO properties that cannot be expressed in the logic C2 on both directed and undirected graphs. This strengthens known results by Hauke and Wa{\l}{\k e}ga (2026) where aggregation and readout functions are specially crafted for the task. Second, we identify two natural ways of restoring characterisability (with regard to C2) for ACR-GNNs. One option is to limit local aggregation (without imposing restrictions on global readout), whilst the second is to run ACR-GNNs over graphs of bounded degree (but unbounded size). In both cases, the FO properties captured by GNNs are exactly those definable by a formula in graded modal logic with global counting modalities. Our results thus establish an innate lower- and upper-bound in terms of how far (fragments of) C2 can be taken to characterise GNNs, and imply that is indeed the unbounded interaction of aggregation and readout that pushes the logical expressive power of GNNs above C2.