Author's note

I vibe-coded this project. The keyboard shorcut is binded through GNOME custom shortcut UI, which then is binded to my Logitech mouse's side button through Input Remapper.

This is for personal use. I'm on Ubuntu, X11 (initially, wtype didn't work, so I switched to X11).

Usage: Click mouse's side button, talk, click again, text appears.

Customizable parts:

STT provider.
Activation and stop sound.
Output directory for temporary .wav file.
Logic for knowing is there a recording going on.
Mechanism to type text to where the cursor in is. System-specific.

Dictation CLI

Small Go CLI to transcribe a WAV dropped into the current directory and insert the text at the cursor.

What it does

If no *.wav present: plays a short pip and notifies "Recording ready" so you can record into this folder.
If a *.wav exists: plays a pip, uploads the newest WAV to OpenAI Whisper, copies/transmits the transcription into the active app (xdotool/clipboard), and deletes the WAV.

Requirements

Linux (GNOME/X11 or Wayland)
Tools: xdotool, paplay or aplay, notify-send. For Wayland: wl-copy (preferred) or xclip + xdotool as fallback.
Environment: OPENAI_API_KEY set.

Build

cd /home/kyle/dictation
go build -o dictate

Usage

Bind the dictate binary to a keyboard shortcut.
Press once to prepare recording (hear pip + notification), save a WAV into the folder, then press again to transcribe and insert.

Notes

On Wayland the program will copy to clipboard and notify you to paste if xdotool cannot simulate a paste.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
main.go		main.go
off.mp3		off.mp3
on.mp3		on.mp3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Author's note

Dictation CLI

About

Uh oh!

Releases

Packages

Languages

okoyfoeciov/dictation

Folders and files

Latest commit

History

Repository files navigation

Author's note

Dictation CLI

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages