A Comparative Evaluation of AI Agent Security Guardrails
arXiv cs.AI / 4/29/2026
💬 OpinionIdeas & Deep AnalysisModels & Research
Key Points
- The paper compares DKnownAI Guard with AWS Bedrock Guardrails, Azure Content Safety, and Lakera Guard for securing AI agents against real-world risk scenarios.
- Using human annotations as ground truth, it evaluates guardrails on two risk types: attacks targeting the agent (e.g., instruction override, indirect injection, tool abuse) and requests aimed at generating harmful content.
- Results show DKnownAI Guard achieves the top recall rate of 96.5%, indicating it most effectively catches relevant risks.
- DKnownAI Guard also leads overall on true negative rate (TNR) at 90.4%, suggesting fewer false alarms relative to competitors.
- The study concludes DKnownAI Guard delivers the best combined performance across the tested AI agent security guardrails.


