Speechify’s Windows app uses local models for transcription and dictation
1 day ago / Read about 8 minute
Source:TechCrunch

Image Credits:Speechify

Voice AI company Speechify just launched a native Windows app that employs locally stored models to enable dictation across apps, and reading aloud articles, documents, or PDFs using its library of voices.

The company is taking on the likes of Wispr Flow, Willow, and Superwhisper, which also provide dictation and transcription apps across platforms.

Speechify said its Windows app does voice processing entirely on-device on Copilot+ PCs (that have NPUs from AMD, Intel, and Qualcomm) and other Windows 11 PCs that have GPUs from Intel and AMD.

The app has three models running on-device: neural text-to-speech, real-time voice activity detection, and Whisper-powered transcription. Users can configure the app to switch to cloud-based models or even change them during usage.

The company, which has over 50 million users, said that VITS Neural can generate audio across seven different speed presets, allowing users to have the app read aloud documents or web pages. The company uses the Silero open source model for voice activity detection.

“Over a billion people on this planet use Windows. With this Windows launch, we’re making sure that reading, and now writing, is never a barrier, no matter what device you use or how you prefer to work. We’re especially excited about the opportunity in the enterprise given how many professionals have asked for Speechify on their PCs,” said Cliff Weitzman, founder and CEO of Speechify, in a statement.

Last month, the company launched Granola-like meeting transcription, but that feature was limited to browser-based meetings. Now that the company has apps across platforms, it will likely bring over this feature to native apps to transcribe meetings on any app or browser.

Techcrunch event

Disrupt 2026: The tech ecosystem, all in one room

Your next round. Your next hire. Your next breakout opportunity. Find it at TechCrunch Disrupt 2026, where 10,000+ founders, investors, and tech leaders gather for three days of 250+ tactical sessions, powerful introductions, and market-defining innovation. Register now to save up to $400.

Save up to $300 or 30% to TechCrunch Founder Summit

1,000+ founders and investors come together at TechCrunch Founder Summit 2026 for a full day focused on growth, execution, and real-world scaling. Learn from founders and investors who have shaped the industry. Connect with peers navigating similar growth stages. Walk away with tactics you can apply immediately

Offer ends March 13.

San Francisco, CA | October 13-15, 2026
REGISTER NOW

Until a few years ago, Speechify largely concentrated on text-to-speech use cases such as reading out articles and emails, and generating podcasts out of documents. Lately, the company has been trying to become a full-stack voice app for users by launching dictation, meeting transcription, and a voice assistant.