Trace-Level Analysis of Information Contamination in Multi-Agent Systems
arXiv cs.AI / 5/1/2026
📰 NewsIdeas & Deep AnalysisModels & Research
Key Points
- The paper studies how uncertainty in multi-agent workflows that reason over heterogeneous artifacts (e.g., PDFs, spreadsheets, slide decks) can “contaminate” agent behavior by altering decomposition and routing decisions.
- It treats uncertainty as a controllable variable by injecting structured perturbations into artifact-derived representations and then running fixed workflows with comprehensive logging to measure contamination via trace divergence.
- Across 614 paired runs on 32 GAIA tasks using three language models, the authors find a decoupling where agent traces can diverge substantially while still recovering correct answers, or remain similar while yielding incorrect outputs.
- The work identifies three contamination manifestation types—silent semantic corruption, behavioral detours with recovery, and combined structural disruption—and links each to distinct control-flow signatures.
- The study also evaluates operational costs and explains why common verification guardrails often fail, while contributing a taxonomy and a trace-based framework for detecting and localizing contamination across agent interactions.
💡 Insights using this article
This article is featured in our daily AI news digest — key takeaways and action items at a glance.
Related Articles

Why Autonomous Coding Agents Keep Failing — And What Actually Works
Dev.to

Why Enterprise AI Pilots Fail
Dev.to

The PDF Feature Nobody Asked For (That I Use Every Day)
Dev.to

How to Fix OpenClaw Tool Calling Issues
Dev.to

Mistral's new flagship Medium 3.5 folds chat, reasoning, and code into one model
THE DECODER