QWhisper is a Qt6-based desktop application that provides real-time speech-to-text functionality using OpenAI's Whisper model. It offers a user-friendly graphical interface for continuous audio transcription with support for multiple output formats and advanced audio processing features.
This software has a 100% Works on My Box Seal of Approval. YMMV.
- Real-time speech-to-text transcription using Whisper models
- Support for multiple Whisper model sizes (tiny, base, small, medium, large, turbo)
- CPU and GPU (CUDA) acceleration support
- Voice Activity Detection with configurable thresholds
- Audio filtering with bandpass filter for improved accuracy
- Multiple audio input sources (microphone, system audio)
- Interactive transcript editing with search functionality
- Multiple output options:
- Interactive transcript window
- File output with continuous append
- Clipboard integration
- Direct typing to active window (X11/Wayland)
- Automatic model downloading and management
- Persistent configuration settings
- Audio level monitoring and visualization
- Qt6 (Core, Widgets, Multimedia, Network)
- CMake 3.16 or higher
- C++17 compatible compiler
- PulseAudio development libraries (Linux)
- CUDA Toolkit (for GPU acceleration)
- X11 development libraries with XTest extension (for window typing on X11)
- For Wayland window typing:
- (may require xwayland)
ydotool(requires ydotoold daemon) orwtype(alternative to ydotool)
- c++/qt build toolchain
- Clone or extract the source code
- Navigate to the qwhisper directory
- Create and enter build directory:
mkdir -p build cd build - Configure the build:
cmake ..
- Compile the application:
make -j$(nproc)
The build process will automatically download and compile the whisper.cpp library. CUDA support will be automatically detected and enabled if the CUDA toolkit is installed.
To install QWhisper system-wide:
sudo ./install.shTo install for the current user only:
./install.shThe installer will:
- Copy the executable to the appropriate bin directory
- Install the application icon and desktop integration files
- Set up documentation and menu entries
After installation, QWhisper can be launched:
- From the command line:
qwhisper - From your desktop environment's application menu
- By clicking the desktop icon
To run without installation from the build directory:
cd build
./qwhisperQWhisper stores its configuration in ~/.config/qwhisper/config.json following the XDG Base Directory specification. Whisper models are downloaded to ~/.local/share/qwhisper/models/ by default.
On first launch, QWhisper will prompt to download a Whisper model if none are found. The application supports automatic downloading of all official Whisper models from the Hugging Face repository.
Ctrl+F: Search within transcriptCtrl+S: Save transcriptCtrl+O: Open saved transcriptCtrl+E: Export transcript
To remove QWhisper from your system:
./uninstall.shThe uninstaller will:
- Remove all installed application files
- Optionally preserve user configuration files
- Optionally preserve downloaded models
- Clean up desktop integration
If you experience crashes with GPU selection, try:
CUDA_VISIBLE_DEVICES="0,1,2" qwhisperThis limits which GPUs are visible to avoid out-of-memory issues.
If audio capture is not working:
- Check that your audio device is properly configured in system settings
- Verify PulseAudio is running and accessible
- Try different audio input sources in QWhisper settings
This project is licensed under the MIT License. See the LICENSE file for details.
