The AI Firewall

Detect unsafe AI-generated code in runtime — prompt injections, backdoors, and leaks — from Claude Code to ChatGPT.

Open Source • MIT License

Secured

1,370 calls protected

Monitoring...

AI Code Threats
Stopped at Runtime

Every AI-generated output is an attack vector — our firewall blocks prompt injections, leaks, and backdoors before they reach your systems or models.

Prompt Injections

Attackers manipulate AI behavior through crafted inputs, leading to unsafe code execution and hijacked system prompts

Data Leaks

Sensitive data like API keys, credentials, and PII leak into AI-generated code outputs or responses, exposing your systems

Backdoors

AI generates malicious code patterns — hidden backdoors that create vulnerabilities or exploits inside your applications

SuperagentLM —
The Defender Model

The firewall is powered by SuperagentLM , a fine-tuned small language model built to defend against unsafe AI-generated code that LLMs can't stop.

Reasoning

Detects unsafe AI-generated code — prompt injections, leaks, and backdoors — by reasoning.

Fast (<50ms)

Optimized for sub-50ms latency and CPU-only operation, fast enough to run in production.

Up to date

Continuously fine-tuned on proprietary attack data to stay ahead of new exploits.

Observable

Always-on logs and traces for every request, with decision reasoning available for debugging unsafe AI-generated code, compliance, and auditability.

Deployment Options

Choose hosted or self-hosted — either way, unsafe AI-generated code never passes unchecked.

Hosted

•Managed solution, no maintenance

•Start in seconds, scale automatically

•Perfect for teams without on-premise requirements

Self-hosted

•Deploy on-premise with full control

•Complete data ownership

•Enterprise-ready for strict requirements

Get Started in Seconds

Drop-in security layer for OpenAI, Anthropic, Gemini, and more

from openai import OpenAI

client = OpenAI(
    base_url="YOUR_FIREWALL_LINK_HERE",  # Replace with your firewall link
    api_key="your-openai-api-key"
)

response = client.chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello, world!"}]
)

View quickstart in Docs →

Works seamlessly with all your favorite tools and SDKs:

LiteLLM Claude Code Anthropic SDK Cursor IDE Gemini CLI Google Gemini SDK Groq SDK +more

Frequently Asked Questions

Everything you need to know about Superagent AI Firewall

What is Superagent AI Firewall?

Superagent is an AI firewall that provides reasoning-driven runtime protection for unsafe AI-generated code. It blocks prompt injections, backdoors, and data leaks in real time with sub-50ms decision making — powered by SuperagentLM .

What threats does the AI Firewall protect against?

The AI Firewall protects against three core AI threats in generated code: prompt injections (leading to unsafe commands), data leaks (sensitive information exposure), and backdoors (malicious code hidden in outputs).

How fast is the protection?

The AI Firewall provides sub-50ms decisions with full reasoning. Our SuperagentLM model is fine-tuned specifically for threat detection, ensuring lightning-fast protection without compromising your AI application's performance.

Is the AI Firewall open source?

Yes, the AI Firewall is open source and released under the MIT License. You can find the code on GitHub and contribute to the project. We believe in transparent security.

Does it only work with the AI apps I build?

No — the AI Firewall protects all AI traffic that passes through it. It is built for AI-generated code scenarios like coding agents (Claude Code, Cursor) and CI/CD pipelines, but it also works with chat assistants like ChatGPT and any third-party AI apps.

What deployment options are available?

The AI Firewall offers both hosted and self-hosted deployment options. The hosted solution is managed with no maintenance required, while self-hosted provides full control and data ownership for enterprise requirements.

How does NinjaLM detect threats?

SuperagentLM is a fine-tuned small language model that reasons about every request in real time. Unlike static filters, it understands context and intent, allowing it to catch novel unsafe code patterns and sophisticated threats that rule-based systems miss.

How do I integrate the AI Firewall?

Integration is simple - just swap your API URL to route through the AI Firewall. For example, change your OpenAI endpoint from api.openai.com to your Superagent firewall URL. No refactoring required - add protection with a single change and get started in seconds.

Your AI has no defenses — until now

Get protection from unsafe AI-generated code in under a minute. No code changes required.