Probably Approximately Consensus: On the Learning Theory of Finding Common Ground

arXiv cs.LG / 4/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper models “consensus” as an interval in a one-dimensional opinion space, aiming to capture not only users’ expressed statements but also the relative importance (salience) of different topics.
  • It derives the low-dimensional opinion space from potentially high-dimensional data using embedding and dimensionality reduction, then defines an objective that maximizes expected agreement over a distribution of issues.
  • The authors propose an efficient Empirical Risk Minimization (ERM) algorithm and provide PAC-learning theoretical guarantees for learning an optimal consensus region.
  • Initial experiments evaluate the proposed method and explore faster ways to identify optimal consensus regions, showing that selectively asking users based on an existing set of statements can significantly reduce the number of queries.
  • Overall, the work connects learning theory with consensus elicitation for online deliberation, offering both a principled modeling approach and learnability guarantees.

Abstract

A primary goal of online deliberation platforms is to identify ideas that are broadly agreeable to a community of users through their expressed preferences. Yet, consensus elicitation should ideally extend beyond the specific statements provided by users and should incorporate the relative salience of particular topics. We address this issue by modelling consensus as an interval in a one-dimensional opinion space derived from potentially high-dimensional data via embedding and dimensionality reduction. We define an objective that maximizes expected agreement within a hypothesis interval where the expectation is over an underlying distribution of issues, implicitly taking into account their salience. We propose an efficient Empirical Risk Minimization (ERM) algorithm and establish PAC-learning guarantees. Our initial experiments demonstrate the performance of our algorithm and examine more efficient approaches to identifying optimal consensus regions. We find that through selectively querying users on an existing sample of statements, we can reduce the number of queries needed to a practical number.