Wispr

View PDF

Valuation

Wispr closed a $25 million Series A extension in November 2025 led by Notable Capital, bringing the company's post-money valuation to $700 million. The round included participation from Flight Fund, backed by Steven Bartlett.

Wispr previously raised a $30 million Series A in June 2025 led by Menlo Ventures, with participation from NEA and 8VC.

Wispr has raised a total of $81 million across all funding rounds.

Product

Wispr Flow operates as a system-level voice dictation tool that works across any application on Mac, Windows, and iOS. Users press a hotkey (Fn on Mac, Ctrl+Win+Alt on Windows, or long-press on iPhone), speak while holding the key, and release to instantly paste polished text wherever their cursor is positioned.

The core differentiator is the automatic editing engine that removes filler words, adds proper punctuation and capitalization, and rewrites speech fragments into coherent sentences. Audio streams to Wispr's cloud infrastructure with sub-700 millisecond latency from end-of-speech to finished text.

The platform includes a Personal Dictionary that learns proper names and jargon, syncing corrections across devices. Teams can share dictionaries for company-specific terminology and acronyms.

Command Mode enables in-place text editing through voice commands like "make this more concise" or "translate to Polish," leveraging large language models to rewrite highlighted text. The Snippet Library lets users speak short cues that expand into full boilerplate text, Calendly links, or legal paragraphs.

For developers, Wispr offers specialized "vibe coding" features that recognize camelCase, snake_case, CLI commands, and file names, making voice coding viable in IDEs like Cursor and terminals like Warp. The system supports over 100 languages with automatic detection and code-switching capabilities.

Sessions can run up to 6 minutes on desktop with warnings at 5 minutes, after which text auto-saves to an in-app History pane. Privacy controls include Zero Data Retention mode that immediately deletes transcripts after processing.

In December 2025, Wispr outlined “Wispr Actions,” a new initiative to extend Flow beyond dictation into voice-driven workflow automation. The feature set will enable Flow to perform tasks on users’ behalf—drafting replies, triggering macros, and executing actions—advancing Wispr’s voice-native computing system toward automation. Actions is framed as a major focus for the next year.

Business Model

Wispr uses a B2B2C SaaS model with individual subscriptions and enterprise licensing. Target users include knowledge workers, developers, and enterprise teams that need accurate voice-to-text conversion across multiple applications and devices.

A freemium structure acquires users via a free tier with basic functionality, then converts to paid plans for advanced features like unlimited sessions, team collaboration, and API access. Enterprise customers pay negotiated rates for bulk licensing, shared dictionaries, and integration capabilities.

Wispr's cost structure includes cloud infrastructure for real-time speech processing and licensing fees for third-party language models, and the company is building proprietary ASR models to reduce external dependencies and improve margins. Cross-application functionality creates switching costs as users integrate voice dictation into daily workflows.

The model benefits from network effects within enterprise teams sharing custom vocabularies and from usage-based expansion as customers increase dictation frequency. Developer-focused features and API access create additional revenue streams and enable use as infrastructure for other applications.

In March 2025, Wispr launched a product partnership with Warp to power native voice input in the Warp terminal. The integration lets developers speak commands and questions, pairing Flow with agent-like capabilities for complex operations. This deepens Wispr’s developer positioning by embedding voice-native workflows directly inside coding environments.

Competition

Platform-integrated dictation

Apple, Microsoft, and Google bundle voice dictation into their operating systems and devices as a retention feature rather than a revenue driver. Apple Intelligence runs small language models on-device across iOS and macOS with automatic punctuation and Live Translation.

Microsoft's Windows 11 Voice Access includes Fluid Dictation with grammar and filler word correction powered by on-device models on Copilot+ PCs. Google's Gboard and Gemini Live offer real-time transcription on Android flagships with sub-100 millisecond latency.

These incumbents create pricing pressure by offering free functionality and raise user expectations for speed and battery efficiency. Wispr must differentiate through superior accuracy and cross-platform consistency to justify paid subscriptions.

Enterprise and vertical specialists

Nuance Dragon dominates regulated industries like healthcare and legal with decades of vocabulary tuning and compliance features. The 2025 Dragon releases added enhanced security and SDK integration, maintaining strong positioning in professional verticals.

Dragon's Windows-centric approach and $600-$1,200 per seat pricing create opportunities for Wispr's cross-platform solution at $12 per month. However, Dragon's deep vertical integration and Microsoft distribution through the Nuance acquisition provide significant competitive advantages in enterprise sales.

Meeting-focused platforms

Otter.ai, Fireflies, and Read AI originally focused on meeting transcription but are expanding into general dictation and productivity workflows. These platforms offer $13-$30 monthly plans with established enterprise customer bases and AI-powered meeting summaries.

TAM Expansion

New products

Wispr is developing proprietary ASR models that achieve 10% word error rates compared to 27% for OpenAI Whisper and 47% for Apple's on-device dictation. Owning the speech recognition stack reduces third-party licensing costs while enabling speaker-adaptive personalization that improves accuracy over time.

The roadmap extends beyond dictation toward voice-controlled workflow automation. Future capabilities include drafting email replies, triggering application macros, and controlling software through voice commands, expanding into productivity automation and RPA markets.

Android launch planned for Q1 2026 will double the addressable device base after the current Mac, Windows, and iOS coverage. Cross-platform parity enables Wispr to compete for enterprise deals requiring comprehensive device support.

Customer base expansion

The newly launched API enables SaaS vendors and enterprises to embed Wispr transcription into internal tools and customer-facing applications. This B2B2B model expands beyond individual subscriptions to platform licensing and white-label implementations.

Developer-focused features like variable recognition and IDE integration tap into the 30 million global professional developer market. Partnerships with development tools like Cursor and Warp position Wispr within daily coding workflows, driving higher retention and usage frequency.

Team and Business tiers target enterprise customers with shared dictionaries, administrative controls, and bulk licensing. The 270 Fortune 500 companies already reached provide expansion opportunities for department-wide and company-wide deployments.

Geographic expansion

International growth funding from the November 2025 round enables localization for non-US markets with region-specific compliance and data residency requirements. Support for 100+ languages and accent-specific models addresses global enterprise customers and emerging market opportunities.

The speech recognition market shows 18% CAGR in the Americas with faster growth in emerging regions. Wispr's cloud-based architecture and cross-platform compatibility position the company to scale internationally without significant infrastructure changes.

Multilingual and code-switching capabilities serve global teams and international companies where employees switch between languages within conversations, creating competitive advantages over region-specific solutions.

Risks

Commoditization pressure: Free voice dictation from Apple, Google, and Microsoft improves through on-device AI models, reducing the performance gap that supports Wispr's subscription pricing. As platform-integrated solutions reach comparable accuracy and feature sets, Wispr must deliver differentiation to defend pricing power.

Infrastructure costs: Real-time speech processing requires substantial cloud infrastructure and bandwidth, creating variable costs that scale with usage. As the company grows and processes more audio, maintaining sub-700 millisecond latency while controlling infrastructure expenses becomes more challenging, especially for international expansion where data residency requirements may require regional deployments.

Read more from

Scott Stevenson, CEO of Spellbook, on building Cursor for contracts

lightningbolt_icon Unlocked Report
Continue Reading
None