Kong Gateway as Enterprise AI Tollgate
Diving deeper into
Augusto Marietti, CEO of Kong, on the end of tokenmaxxing
the vast majority of that growth is enterprises using us as a tollgate to unlock AI internally without things going wrong
Analyzed 9 sources
Reviewing context
This shows Kong’s AI gateway is landing first as internal control infrastructure, not as a feature layer for customer apps. Big companies are putting a checkpoint in front of employee AI usage so one system can decide which model to call, strip sensitive data, cache repeat prompts, and enforce limits before thousands of workers start sending expensive or risky traffic across many LLMs.
-
The workflow is very concrete. Instead of each team wiring directly into OpenAI, Anthropic, Azure OpenAI, or an internal model, requests pass through Kong first, where semantic routing, prompt guards, and semantic caching can choose a cheaper model, block unsafe prompts, or serve a cached answer.
-
This is a different market from OpenRouter, Cloudflare, or Vercel. OpenRouter is built around public model price arbitrage and takes a brokerage style cut, while Cloudflare and Vercel simplify external model access through one API and billing layer. Kong is selling governance inside the firewall, where the buyer is the CIO, CISO, or platform team.
-
Security is part of why this internal use case is moving first. A March 24, 2026 supply chain compromise hit LiteLLM versions 1.82.7 and 1.82.8, reinforcing the case for enterprises to standardize on a hardened gateway instead of letting many teams run lightweight open source middleware on their own.
The next step is that this gateway stops being just a traffic cop and becomes the default operating layer for enterprise agents. As companies connect more internal APIs, MCP servers, and models behind one governed control point, the winner is likely to own the routing, policy, and billing layer that every serious internal AI workflow passes through.