Security-first AI gateway · litellm-compatible

The AI gateway with the firewall built in, not bolted on.

LLMSecure is a private, drop-in replacement for your LLM proxy. The injection, PII and secrets firewall runs inside the gateway process — so prompts never leave your boundary, there is no separate port to route around, and you remove a CVE-prone component instead of adding another.

Private / self-hosted · prompts and responses stay in your environment · detect & mitigate with shadow mode and tunable thresholds.

A firewall in front of a vulnerable gateway is a firewall you can walk around.

The common pattern today is an LLM firewall chained in front of an open-source gateway like litellm. That leaves two gaps the chained model can't close:

Network bypass

Compromise any host on the same segment and you can talk to litellm directly — the front firewall never sees the request. SSRF inside the environment does the same.

Data leaves to scan

External guardrails (SaaS scanners) require shipping the prompt out to a third party to inspect it. For regulated and financial workloads that is often a non-starter.

Infra still exposed

Content guardrails only protect the conversation. The gateway's own SSRF / auth / injection CVEs remain — none of them are defended by a content filter.

LLMSecure collapses gateway and firewall into one hardened, private component. The firewall is inline by architecture — there is no bare-gateway port to skip — and the scanning happens in-process, so nothing is sent to a third party to be inspected.

Two pillars, one binary

The name says it: LLM Secure + Secure LLM. Securing the gateway itself, and securing the calls that pass through it.

Pillar 1

LLM Secure — the gateway's own security

AI infrastructure security.

  • Memory-safe Rust core, minimal dependencies, no Python supply chain.
  • Parameterized queries, enforced authorization, least privilege by default.
  • SSRF egress allow-listing, signed builds, SBOM, isolated CI/CD.
  • Replacing litellm removes a component with a real critical-CVE history.

An add-on firewall can't give you this — it sits outside the thing that's exposed.

Pillar 2

Secure LLM — securing every call

AI runtime security.

  • Prompt-injection and jailbreak detection.
  • PII / DLP, secrets and credential detection, response-leak mitigation.
  • Content moderation and malicious-URL checks.
  • Shadow mode + adjustable thresholds with transparent false-positive/negative reporting.

All in-process — your prompts are never forwarded to an external scanner.

Competitors bolt Pillar 2 onto a fragile Pillar 1 (litellm). We fuse both into a single hardened component — and we solve the infrastructure security that content filters ignore.

Three things only an in-process design can do

01

Inline by architecture

The firewall lives in the gateway process. There is no standalone "bare gateway" listener for an attacker to reach around to. (Honest scope: gateway-host compromise is addressed by Pillar 1 hardening, not by this claim.)

02

Scanning never leaves

Detection runs in-process on your infrastructure. Prompts and responses are not sent to any external service — including ours. The control plane only ever receives metadata and verdicts.

03

Solves infra security

By replacing the gateway rather than wrapping it, you shrink the attack surface instead of growing it — one hardened component instead of two.

How we compare

Against gateways and AI-security vendors, on the dimensions that matter for regulated deployments.

Capability LLMSecure Cloudflare Kong Palo Alto AIRS F5
Private deploy / data stays in your boundaryYesSaaSPartialPartialEnterprise-heavy
Native firewall depthYesLimitedBolt-onYes (chained)Yes (chained)
Solves infrastructure security (Pillar 1)YesNoNoNoPartial
Firewall inline / no port to route aroundIn-processNoNoChainedChained
Scanning never sent to a third partyYesNoNoNoNo
Replaces litellm (shrinks attack surface)YesNoGeneral GWAdd-onPartial
litellm-compatible drop-inYesNoNoNoNo

Comparison reflects typical deployment models as positioned by each vendor; verify against your own requirements.

Gateway parity, plus security

Gateway, aligned with litellm

  • OpenAI-compatible unified API across providers
  • Multi-provider routing with fallbacks and retries
  • Virtual keys, per-key budgets and rate limits
  • Usage metering and cost tracking
  • Streaming (SSE) responses
  • Drop-in base-URL swap — point your SDK at LLMSecure

Security, built in

  • Prompt-injection & jailbreak detection
  • PII / DLP detection and redaction
  • Secrets & credential detection
  • Response-leakage mitigation & content moderation
  • Shadow mode, tunable thresholds, transparent FP/FN reporting
  • Immutable audit trail; one-click bypass for troubleshooting

Two ways to run it

No public self-serve. Start on Pro, move to Enterprise when you're ready. Both are priced per engagement.

Pro

Private data plane, SaaS control plane — close to a SaaS experience, without the data exposure.

  • One-line install: single binary / one docker run
  • Auto-registers with the control plane, pulls policy, demo in minutes
  • Data plane stays private — control plane sees metadata & verdicts only
  • Configurable fail-open / fail-closed, one-click bypass
  • Runs disconnected (cached policy) if the control plane is unreachable
Request a Pro trial

We tell you what it does — and what it doesn't.

Detect & mitigate

Prompt-injection defense is detection and mitigation, not a guarantee. We run in shadow mode first and report false positives and negatives openly.

Data residency by default

Prompts and responses never leave your deployment. The control plane receives metadata and verdicts only; sample sharing is strictly opt-in and redacted.

Defense in depth

Inline means no network port to route around — not "unbreakable." Host hardening, least privilege and signed builds carry the rest.

Request access

Pro and Enterprise are both by application. Tell us about your setup and we'll get you into a Pro trial.

Opens your mail client — no data is captured by this page.