YouTube Transcript Clipper

clipper.py is a small helper that lets you point yt-dlp at a YouTube video, search the subtitles for a specific line, and download a short clip around that subtitle.

It works by:

Fetching automatic or creator-provided subtitles via yt-dlp (JSON format).
Fuzzy matching your query against the transcript to find the best subtitle segment.
Asking yt-dlp to download only the portion of the video around that timestamp.

Requirements

Python 3.9+
yt-dlp installed as a Python package (pip install yt-dlp)
ffmpeg available on your PATH (required by yt-dlp for cutting clips)

Create a local environment and install dependencies:

python3 -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

On Windows PowerShell:

py -3 -m venv .venv
.venv\Scripts\Activate.ps1
pip install -r requirements.txt

Usage

python clipper.py "<youtube-url>" "<query text>" --before 5 --after 10 --lang en --output myclip.mp4

--before / --after control how many seconds to keep before/after the matching subtitle (defaults: 5s each).
--lang selects the subtitle language to search (default: en). The tool will try both manual and automatic subtitles.
--output sets the destination file. If omitted, a name is generated from the query.
--format lets you forward a custom yt-dlp format selector (for example: bestvideo+bestaudio/best).
--verbose surfaces the underlying yt-dlp and ffmpeg logs.
--auto-transcribe falls back to a speech-to-text workflow when no subtitles are available.
--stt-provider selects the speech-to-text provider plugin to use alongside --auto-transcribe.

Example:

python clipper.py "https://www.youtube.com/watch?v=dQw4w9WgXcQ" "never gonna give you up" --before 3 --after 7 --output rickroll.mp4

The script uses yt-dlp's --download-sections "*start-end" support to grab just the requested time span when possible, and saves an .srt subtitle file alongside the clip so you can burn captions later if you prefer. If the line occurs multiple times, it picks the subtitle segment with the highest fuzzy match score, which usually corresponds to the closest textual match. When yt-dlp cannot download just the requested range, it falls back to fetching the full video and trims it locally with ffmpeg.

If no good match is found the script exits with a nonzero status. Try adjusting your query (shorter phrases often match better) or confirm that the video has subtitles in the selected language.

GUI frontend

Prefer not to use the terminal? Launch the Tkinter-based helper to collect options, preview the exact command that will run, and stream clipper.py output in a scrollable panel:

python clipper_gui.py

Fill in the video URL and query text (both required), adjust how many seconds before/after the quote you want, pick a subtitle language and quality preset from the dropdowns, optionally choose an output path, then click Run clipper. The GUI constructs the equivalent CLI command (with verbose logging enabled), runs it on your behalf, and surfaces progress and errors inline so non-technical users can follow along.

At launch the GUI offers to install or update yt-dlp via pip install --upgrade yt-dlp, since staying on the latest release is critical for bypassing YouTube site changes. Accepting the prompt is especially helpful for Windows users who might not update the dependency manually.

Speech-to-text fallback

==EXPERIMENTAL, HERE BE DRAGONS!==

When yt-dlp cannot fetch subtitles, pass --auto-transcribe to download the audio track, send it through a speech-to-text (STT) provider, and continue matching locally. Providers are pluggable: define a subclass of STTProvider in stt_providers.py, decorate it with @register_provider, and select it at runtime via --stt-provider <name>. The repository currently ships with a stub provider that simply documents the interface; replace it with an implementation that uploads audio to your service of choice (for example, OpenAI Whisper API or Google Cloud Speech-to-Text) and returns timestamped segments.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
README.md		README.md
clipper.py		clipper.py
clipper_gui.py		clipper_gui.py
requirements.txt		requirements.txt
stt_providers.py		stt_providers.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YouTube Transcript Clipper

Requirements

Usage

GUI frontend

Speech-to-text fallback

About

Uh oh!

Releases

Packages

Languages

SecareLupus/YoutubeClipper

Folders and files

Latest commit

History

Repository files navigation

YouTube Transcript Clipper

Requirements

Usage

GUI frontend

Speech-to-text fallback

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages