Turnkey Phone Assistants Threaten Vapi
Vapi
A turnkey product from OpenAI or Microsoft would shift the fight from model quality to distribution and bundling. Vapi wins today by making developers stitch together telephony, speech, models, and workflows through one API, but OpenAI already supports SIP based phone connections in its Realtime API, and Microsoft already has telephony, call automation, and AI embedded across Teams Phone and Azure. That means a big platform entrant could remove much of the setup work Vapi currently abstracts away.
-
Vapi sits in the orchestration layer. It coordinates transcriber, model, and voice components, lets customers swap providers, and charges a $0.05 per minute platform fee on top of telephony and model costs. If a platform owner bundles those layers together, that fee and much of the integration value come under pressure.
-
The closest startup comparables show the likely paths. Retell looks similar to Vapi with no code call flow tooling, while Bland pushes further down stack with its own speech and model systems for lower latency. A Microsoft or OpenAI product would likely look even more integrated than Bland, but with built in customer access through existing clouds and enterprise contracts.
-
Microsoft already has the raw pieces in market. Azure Communication Services offers telephony and call automation, including a sample that connects calling to Azure OpenAI. Teams Phone also already includes Copilot features for PSTN and VoIP calls. OpenAI similarly documents direct SIP based call routing into the Realtime API. The gap to a packaged phone agent is productization, not core capability.
This market is heading toward a split. Horizontal infrastructure will consolidate around a few bundled platforms, while independents that survive will move upward into vertical workflows, analytics, compliance, and deep system integrations. For Vapi, the long term defense is becoming the control plane for production voice operations, not just the fastest way to connect a phone call to a model.