Synthesia Adopts Selfie Avatar Capture
Synthesia
HeyGen forced avatar creation from a sales led service into a product led feature race. The shift matters because custom avatars used to mean a planned studio style shoot, while selfie based capture turns it into a few minutes on a phone. Once that workflow proved good enough, Synthesia had to match it to keep personalization from becoming a reason to start with HeyGen instead, even as Synthesia stayed stronger in enterprise packaging and workflow depth.
-
The practical change is speed and who can use it. A trainer, salesperson, or founder can record a short phone video, create a reusable avatar, then generate onboarding, sales, or localization videos without booking cameras, actors, or an editing team.
-
This is what feature parity looks like in AI video. HeyGen moved first on low friction avatar capture, Synthesia matched it, and both continue to converge on the same core stack of avatars, script editing, translation, hosting, and publishing.
-
The harder moat shifts up the stack. Synthesia is larger, at about $145.9M ARR in September 2025 versus HeyGen at $95M, and leans heavily into enterprise accounts, so winning depends less on one avatar feature and more on admin controls, collaboration, integrations, and buying patterns inside large companies.
The next phase is less about who has selfie capture, and more about who turns abundant AI video into a default business workflow. As avatar creation gets standardized, competition moves toward enterprise distribution, interactive experiences, developer access, and broader video suites that keep customers creating, hosting, and measuring video in one place.