Signals – finding the most informative agent traces without LLM judges (arxiv.org)

Reddit r/LocalLLaMA / 4/5/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • Katanemo Labs (DigitalOcean company) introduces “Signals,” a method for automatically extracting structured indicators from agent interactions to prioritize the most informative agent traces for review.
  • The approach avoids LLM judges and extra costly inspection steps by computing “signals” directly from live trajectories, with the claim that it does not require a GPU.
  • Signals are organized into a taxonomy spanning interaction, execution, and environment patterns, covering issues such as misalignment, stagnation, disengagement, failure, looping, and exhaustion.
  • In an annotation study on τ-bench, signal-based sampling achieved 82% informativeness versus 54% for random sampling, yielding a 1.52× efficiency gain per informative trajectory.
  • The work includes a public implementation in the referenced GitHub project, enabling practitioners to apply the signal extraction to their agent trace workflows.
Signals – finding the most informative agent traces without LLM judges (arxiv.org)

Hello Peeps Salman, Shuguang and Adil here from Katanemo Labs (a DigitalOcean company).

Wanted to introduce our latest research on agentic systems called Signals. If you've been building agents, you've probably noticed that there are far too many agent traces/trajectories to review one by one, and using humans or extra LLM calls to inspect all of them gets expensive really fast. The paper proposes a lightweight way to compute structured “signals” from live agent interactions so you can surface the trajectories most worth looking at, without changing the agent’s online behavior. Computing Signals doesn't require a GPU.

Signals are grouped into a simple taxonomy across interaction, execution, and environment patterns, including things like misalignment, stagnation, disengagement, failure, looping, and exhaustion. In an annotation study on τ-bench, signal-based sampling reached an 82% informativeness rate versus 54% for random sampling, which translated to a 1.52x efficiency gain per informative trajectory.

Paper: arXiv 2604.00356. https://arxiv.org/abs/2604.00356
Project where Signals are already implemented: https://github.com/katanemo/plano

Happy to answer questions on the taxonomy, implementation details, or where this breaks down.

submitted by /u/AdditionalWeb107
[link] [comments]