Sesame full stack voice advantage
Sesame AI
Building the model and the device together lets Sesame get paid at every layer of the stack, and it makes the product harder to copy with an API wrapper alone. Sesame can sell subscriptions to consumers, hosted inference and fine tuning to enterprises, and eventually smart glasses that put its voice model in a dedicated always available form factor. That combination mirrors the strongest wearable businesses, where hardware brings users in and software keeps monetizing them over time.
-
Pure software voice companies mostly monetize one layer, usually API usage or subscriptions. Sesame adds device revenue on top, while using its own model to tune latency, emotion, and on device performance for glasses, which is difficult for a software only rival to match end to end.
-
The closest lesson from wearables is Oura, where ring sales and membership reinforce each other. In 2025, Oura reached an estimated $1B in revenue with about 80% from hardware and 20% from subscriptions, showing how a device can become both customer acquisition and recurring software monetization.
-
The flip side is that hardware only works when it creates a workflow software cannot. Friend sold a pendant with no subscription, and Limitless paired software with a pendant, but both faced pressure from phones, earbuds, and glasses controlled by bigger platforms. That raises the bar for Sesame to make audio first glasses feel meaningfully better than talking to ChatGPT on a phone.
Going forward, the winners in voice AI are likely to look less like standalone model vendors and more like full stack interaction companies. If Sesame can turn its model advantage into a daily wearable habit, it can own both the conversation engine and the surface where that conversation happens, which is where the most durable economics in this category will sit.