Consolidation Around Trusted AI Gateways
3T+ token/day Coinbase of the inference economy
Security is turning model routing from a cheapest path problem into a trust market. When routing sits in front of every prompt, every API key, and often sensitive logs, the gateway that stays online, patches fast, and gives enterprises one managed control point can win share even if rivals are cheaper or more flexible. That is the same pattern that favored Coinbase in retail crypto, where custody and reliability mattered more than squeezing out the last bit of yield.
-
The LiteLLM incident showed how much blast radius a gateway can have. Mercor said its breach was tied to the compromised open source LiteLLM project, and Mercor was already operating at a $1B annualized revenue run rate by February 2026. That makes security failures at the routing layer look less like bugs and more like existential vendor risk.
-
The field is splitting between managed and self managed approaches. Vercel moved AI Gateway into general availability in August 2025 as a hosted routing layer, while Merge launched Gateway in April 2026 as a control plane for production AI. Both are selling a simpler answer for teams that do not want to own proxy security, logging, and failover themselves.
-
OpenRouter is already behaving like this category leader. It aggregates 400 plus models through one API, with estimated revenue rising from $1M at the end of 2024 to $50M by March 2026. In practice that means developers can centralize billing, fallback, usage analytics, and model switching in one place, which gets stickier as governance and security reviews get harder.
The next phase of the market favors gateways that become the safe default for production traffic. As more AI apps pass real customer data and larger budgets through a single routing layer, scale will concentrate around a small number of providers that combine broad model access with managed security, compliance controls, and operational reliability.