Retell builds AI call orchestration
Retell AI at $60M/year up 650% YoY
Retell’s pivot shows that the biggest value in AI voice moved from making speech sound good to making phone calls actually work inside messy business workflows. Live dubbing was a media tool, but phone agents are an operations tool. The product now lets a developer connect speech recognition, a language model, voice output, business rules, and back office systems so an agent can verify identity, book appointments, update records, and hand off calls without long pauses.
-
The enabling change was infrastructure. Retell emerged as speech and model latency dropped enough for real conversation, with external providers like Deepgram and Cartesia making sub second turn taking possible. That let the company sell a control layer for production phone calls instead of a one off creator tool.
-
Retell sits in the same lane as Vapi, but the market is splitting in two directions. Developer platforms let teams assemble their own stack and swap models by cost or quality, while managed vendors like Sierra and Decagon sell more packaged outcomes to large enterprises that want less building and more service.
-
The best early customers make the wedge obvious. Firms like Sunshine Loans use voice agents on high volume calls where every minute has labor cost and every mistake has real business downside. In that setting, charging a platform fee on top of pass through model costs makes Retell look more like Twilio for AI calls than a classic SaaS seat product.
From here, the control layer is likely to expand upward into the rest of the contact center, including routing, testing, QA, analytics, and compliance. If Retell keeps owning the call orchestration layer while providers above and below it commoditize, it can grow from a voice API into the system companies use to run large parts of phone based customer operations.