I built Arc Sentry, a pre-generation guardrail for open source LLMs that blocks prompt injection before the model generates a response. It works on Mistral, Qwen, and Llama by reading the residual stream, not output filtering.
Prompt injection is OWASP LLM Top 10 #1. Most defenses scan outputs or text patterns, by the time they fire, the model has already processed the attack. Arc Sentry blocks before generate() is called.
I want to test it on real deployments, so I’m offering 5 free security audits this week.
What I need from you:
• Your system prompt or a description of what your bot does • 5-10 examples of normal user messages What you get back within 24 hours:
• Your bot tested against JailbreakBench and Garak attack prompts • Full report showing what got blocked and what didn’t • Honest assessment of where it works and where it doesn’t No call. Email only. 9hannahnine@gmail.com
If it’s useful after seeing the results, it’s $199/month to deploy.
[link] [comments]




