ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

arXiv cs.AI / 4/7/2026

💬 OpinionDeveloper Stack & InfrastructureIdeas & Deep AnalysisModels & Research

共有:

Key Points

The paper argues that LLM agent security is expanding beyond prompt injection to supply-chain threats, where malicious behaviors are hidden inside third-party tools or MCP servers that agents call during execution.
It introduces SC-Inject-Bench, a new large-scale benchmark of 10,000+ malicious MCP tools organized by a taxonomy of 25+ supply-chain attack types mapped to MITRE ATT&CK.
The authors report that existing MCP scanners and semantic guardrails underperform on this new benchmark, motivating the need for defenses that go deeper than tool traces.
They propose ShieldNet, a network-level guardrail framework using a MITM proxy plus an event extractor and lightweight classifier to detect supply-chain poisoning by analyzing real network interactions.
Experiments indicate ShieldNet can reach up to 0.995 F1 with about 0.8% false positives while adding little runtime overhead, outperforming prior scanners and LLM-based guardrails.

Abstract

Existing research on LLM agent security mainly focuses on prompt injection and unsafe input/output behaviors. However, as agents increasingly rely on third-party tools and MCP servers, a new class of supply-chain threats has emerged, where malicious behaviors are embedded in seemingly benign tools, silently hijacking agent execution, leaking sensitive data, or triggering unauthorized actions. Despite their growing impact, there is currently no comprehensive benchmark for evaluating such threats. To bridge this gap, we introduce SC-Inject-Bench, a large-scale benchmark comprising over 10,000 malicious MCP tools grounded in a taxonomy of 25+ attack types derived from MITRE ATT&CK targeting supply-chain threats. We observe that existing MCP scanners and semantic guardrails perform poorly on this benchmark. Motivated by this finding, we propose ShieldNet, a network-level guardrail framework that detects supply-chain poisoning by observing real network interactions rather than surface-level tool traces. ShieldNet integrates a man-in-the-middle (MITM) proxy and an event extractor to identify critical network behaviors, which are then processed by a lightweight classifier for attack detection. Extensive experiments show that ShieldNet achieves strong detection performance (up to 0.995 F-1 with only 0.8% false positives) while introducing little runtime overhead, substantially outperforming existing MCP scanners and LLM-based guardrails.

Fully Automated Website 2026-04-11: The Scoreboard — Visual Judge Score Comparison on the Homepage

Dev.to

Human-Aligned Decision Transformers for satellite anomaly response operations with ethical auditability baked in

Dev.to

That Smoking-Gun Video? It's Not Evidence. It's a Suspect.

Dev.to

AI Citation Registries and Website-Based Publishing Constraints

Dev.to

Amazon S3 Files: The End of the Object vs. File War (And Why It Matters in the AI Agent Era)

Dev.to

ShieldNet: Network-Level Guardrails against Emerging Supply-Chain Injections in Agentic Systems

Key Points

Abstract

Related Articles

Fully Automated Website 2026-04-11: The Scoreboard — Visual Judge Score Comparison on the Homepage

Human-Aligned Decision Transformers for satellite anomaly response operations with ethical auditability baked in

That Smoking-Gun Video? It's Not Evidence. It's a Suspect.

AI Citation Registries and Website-Based Publishing Constraints

Amazon S3 Files: The End of the Object vs. File War (And Why It Matters in the AI Agent Era)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Key Points

Abstract

Related Articles

Fully Automated Website 2026-04-11: **The Scoreboard — Visual Judge Score Comparison on the Homepage**

Human-Aligned Decision Transformers for satellite anomaly response operations with ethical auditability baked in

That Smoking-Gun Video? It's Not Evidence. It's a Suspect.

AI Citation Registries and Website-Based Publishing Constraints

Amazon S3 Files: The End of the Object vs. File War (And Why It Matters in the AI Agent Era)

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer

Fully Automated Website 2026-04-11: The Scoreboard — Visual Judge Score Comparison on the Homepage