T-MAP: Red-Teaming LLM Agents with Trajectory-aware Evolutionary Search
arXiv cs.AI / 3/25/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper argues that traditional LLM red-teaming methods miss agent-specific vulnerabilities that only appear during multi-step tool use, especially in tool ecosystems like Model Context Protocol (MCP).
- It introduces T-MAP, a trajectory-aware evolutionary search technique that uses execution trajectories to systematically generate adversarial prompts and attack paths.
- T-MAP can automatically produce attacks that bypass safety guardrails while still achieving harmful objectives through real tool interactions, not just harmful text.
- Experiments across multiple MCP environments show T-MAP significantly improves attack realization rate (ARR) versus baselines and remains effective against multiple frontier models, including GPT-5.2, Gemini-3-Pro, Qwen3.5, and GLM-5.
- The findings suggest autonomous LLM agents have underexplored security weaknesses tied to tool-execution trajectories and agent behavior over time.
Related Articles
Santa Augmentcode Intent Ep.6
Dev.to

Your Agent Hired Another Agent. The Output Was Garbage. The Money's Gone.
Dev.to
ClawRouter vs TeamoRouter: one requires a crypto wallet, one doesn't
Dev.to
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to

Palantir’s billionaire CEO says only two kinds of people will succeed in the AI era: trade workers — ‘or you’re neurodivergent’
Reddit r/artificial