ARC-AGI-2 Technical Report

arXiv cs.CL / 3/10/2026

Ideas & Deep AnalysisModels & Research

Key Points

  • The ARC-AGI-2 system advances performance on the Abstraction and Reasoning Corpus (ARC) by combining neural inference with structure-aware priors and online task adaptation.
  • It reformulates ARC reasoning as a sequence modeling problem using a compact 125-token task encoding processed by a modified LongT5 transformer architecture.
  • A novel data augmentation framework enforcing invariance through group symmetries, grid traversals, and automata perturbations broadens the hypothesis space for reasoning.
  • The system employs test-time training with lightweight LoRA adaptation to specialize on unseen tasks by learning transformation logic from demonstrations.
  • A symmetry-aware decoding and scoring mechanism aggregates likelihoods from augmented task views to achieve multi-perspective reasoning, resulting in performance surpassing previous neural ARC solvers and approaching human-level generalization.

Computer Science > Computation and Language

arXiv:2603.06590 (cs)
[Submitted on 4 Feb 2026]

Title:ARC-AGI-2 Technical Report

View a PDF of the paper titled ARC-AGI-2 Technical Report, by Wallyson Lemes de Oliveira and 12 other authors
View PDF HTML (experimental)
Abstract:The Abstraction and Reasoning Corpus (ARC) is designed to assess generalization beyond pattern matching, requiring models to infer symbolic rules from very few examples. In this work, we present a transformer-based system that advances ARC performance by combining neural inference with structure-aware priors and online task adaptation. Our approach is built on four key ideas. First, we reformulate ARC reasoning as a sequence modeling problem using a compact task encoding with only 125 tokens, enabling efficient long-context processing with a modified LongT5 architecture. Second, we introduce a principled augmentation framework based on group symmetries, grid traversals, and automata perturbations, enforcing invariance to representation changes. Third, we apply test-time training (TTT) with lightweight LoRA adaptation, allowing the model to specialize to each unseen task by learning its transformation logic from demonstrations. Fourth, we design a symmetry-aware decoding and scoring pipeline that aggregates likelihoods across augmented task views, effectively performing ``multi-perspective reasoning'' over candidate solutions. We demonstrate that these components work synergistically: augmentations expand hypothesis space, TTT sharpens local reasoning, and symmetry-based scoring improves solution consistency. Our final system achieves a significant improvement over transformer baselines and surpasses prior neural ARC solvers, closing the gap toward human-level generalization.
Comments:
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as: arXiv:2603.06590 [cs.CL]
  (or arXiv:2603.06590v1 [cs.CL] for this version)
  https://doi.org/10.48550/arXiv.2603.06590
Focus to learn more
arXiv-issued DOI via DataCite

Submission history

From: Matteo Caorsi [view email]
[v1] Wed, 4 Feb 2026 10:03:56 UTC (11,173 KB)
Full-text links:

Access Paper:

Current browse context:
cs.CL
< prev   |   next >
Change to browse by:

References & Citations

export BibTeX citation Loading...

BibTeX formatted citation

×
Data provided by:

Bookmark

BibSonomy logo Reddit logo
Bibliographic Tools

Bibliographic and Citation Tools

Bibliographic Explorer Toggle
Bibliographic Explorer (What is the Explorer?)
Connected Papers Toggle
Connected Papers (What is Connected Papers?)
Litmaps Toggle
Litmaps (What is Litmaps?)
scite.ai Toggle
scite Smart Citations (What are Smart Citations?)
Code, Data, Media

Code, Data and Media Associated with this Article

alphaXiv Toggle
alphaXiv (What is alphaXiv?)
Links to Code Toggle
CatalyzeX Code Finder for Papers (What is CatalyzeX?)
DagsHub Toggle
DagsHub (What is DagsHub?)
GotitPub Toggle
Gotit.pub (What is GotitPub?)
Huggingface Toggle
Hugging Face (What is Huggingface?)
Links to Code Toggle
Papers with Code (What is Papers with Code?)
ScienceCast Toggle
ScienceCast (What is ScienceCast?)
Demos

Demos

Replicate Toggle
Replicate (What is Replicate?)
Spaces Toggle
Hugging Face Spaces (What is Spaces?)
Spaces Toggle
TXYZ.AI (What is TXYZ.AI?)
Related Papers

Recommenders and Search Tools

Link to Influence Flower
Influence Flower (What are Influence Flowers?)
Core recommender toggle
CORE Recommender (What is CORE?)
About arXivLabs

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.