Buzz transcribes and translates audio to text offline using OpenAI's Whisper. Import audio and video files into Buzz and export them as TXT, SRT, or VTT files. Buzz supports Whisper, Whisper.cpp, Faster Whisper, Whisper-compatible models from the Hugging Face repository, and the OpenAI Whisper API.
Required permissions in Buzz will let you select audio and video files for transcription, from most common file location on your computer. Network permission is used to download transcription model files. Microphone permission lets you transcribe real time speech.
Note: If your system theme is not applied to Buzz, ensure it is in ~/.themes folder. You may need to copy the system themes to this folder cp -r /usr/share/themes/ ~/.themes/.
This release introduces Vulkan GPU support for whisper.cpp making it significantly faster even on laptops. Real-time transcription is possible even with large models on computers with ~5GB RAM video cards. There is now an option to separate voice tracks before the audio is transcribed. This can improve transcript accuracy for audios with background noises or music. Faster whisper was updated to the latest version adding noticeable speed improvement.
Additional improvements: