AI Avatars as Infrastructure Play
Chris Savage, CEO of Wistia, on the economics of AI avatars
This points to AI avatars becoming a cheap input, not a durable product category. Once avatar generation can be bought through an API, the value shifts up the stack to the software that owns the workflow, where teams script videos, publish them, host them, track views, and connect those videos to training, sales, or marketing systems. That is why incumbents like Wistia can treat avatars as one feature inside a broader video product, while standalone avatar vendors face pressure to keep cutting price and compute cost.
-
The pattern matches older video infrastructure markets. Hosting and delivery started as expensive, specialized services, then competition pushed them toward utility economics, where scale and lower cost won. AI avatar rendering is following the same path as model quality improves and more vendors offer similar output.
-
The winners are already moving away from a pure generation tool. Synthesia has expanded into hosting, analytics, lead capture, and publishing, while HeyGen pairs a self serve studio with developer APIs. That is what companies do when the core generation layer is no longer enough to defend margins.
-
For a platform like Wistia, commoditized avatars are helpful. If the underlying avatar API gets cheaper, Wistia can add talking head creation to its existing video hosting and analytics product without needing to invent the base model, and can capture value through subscription software rather than raw rendering volume.
The market is heading toward a small set of scaled avatar and model providers underneath a larger set of application companies on top. The durable businesses will look less like standalone avatar factories and more like full video workflow systems, where avatar generation is bundled beside editing, distribution, analytics, and collaboration.