Speed of Sound is a voice-typing app for the Linux desktop.
By default, transcription runs entirely on-device, so no audio ever leaves your machine. The app ships with a built-in multilingual Whisper model and supports additional models from the Whisper, Parakeet, and Canary families to improve accuracy.
Please visit speedofsound.io/getting-started after installation.
Added support for NVIDIA Canary and Parakeet ASR model families alongside Whisper, giving more options for on-device speech recognition across multiple languages. New "Stay hidden on activation" setting lets you add the app to system startup without showing the main window. The app now detects when the XDG Remote Desktop Portal is not supported and shows a clear banner instead of failing silently. Fixed typing error propagation so non-Latin script failures surface as notifications instead of being swallowed. Fixed a bug where changing the secondary language in Preferences didn't update the main window. Updated dependencies including Sherpa ONNX, Kotlin, and cloud SDKs.