home

CapsGAT 1.2 User Manual

1. Introduction

1.1 Overview

CapsGAT is a specialized transcription workstation designed for converting subtitle files into publishable interview transcripts using GAT2 (Gesprächsanalytisches Transkriptionssystem 2) conventions.

1.2 Key Features

Quick Speaker Assignment: Rapidly assign speakers to subtitle segments using keyboard shortcuts
Basic Editing: Split, merge, and edit transcript segments
GAT2 Symbols: Insert conversation analysis symbols for pauses, breathing, and annotations
Audio Synchronization: Link audio files to transcripts with auto-sync functionality
Export Transcripts: Generate formatted transcripts in HTML or plain text

2. Projects

2.1 Basic Project File Management

Start a new transcription project by selecting File → New Project or pressing Ctrl+N.
Open saved projects with File → Open Project or Ctrl+O. CapsGAT uses the .capsgat file format.
Save your work with File → Save Project (Ctrl+S) or Save Project As to create a new file.

2.2 Project Memos

Add project notes and descriptions via Edit → Project Memo. This information can be included in transcript exports.

3. Importing Subtitle Files

3.1 Supported File Formats

SRT (.srt): Standard subtitle format with timestamps
JSON (.json): Various JSON formats including token-based and segment-based
Text (.txt): Plain text files (one block per line)
TSV (.tsv): Tab-separated values with start/end times and text

3.2 Import Methods

Use File → Import → Subtitles to load subtitle files. When importing audio files, CapsGAT automatically searches for matching subtitle files in the same directory and offers to import them.

3.3 JSON Import Options

When importing JSON files with token data, you can choose from three import methods:

One Continuous Block: Import all text as a single segment
Tokens as Separate Blocks: Create individual blocks for each token
Auto-segment: Automatically detect pauses and create segments accordingly

4. Basic Formatting Functions

4.1 Speaker Assignment

Assign speakers using the number keys 1-4 (configurable up to 8 speakers). Unassign with U.

4.2 Navigation

Keyboard Shortcuts:

Next block: N or →
Previous block: P or ←
Jump to unassigned: Click on block in "Unassigned Blocks" list

4.3 Editing Functions

Split Block: Space - Opens split dialog to divide current segment
Merge Blocks: Delete - Combine current block with next block
Edit Content: E - Open text editor for current block
Insert Empty Line: Enter - Add blank line for formatting

4.4 GAT2 Symbols

Access the symbols dialog with * or the Symbols button. Includes:

Pauses: Micropause (.), short (-), medium (--), long (---)
Breathing: Inhales (°h, °hh, °hhh) and exhales (h°, hh°, hhh°)
Annotations: Comments ((())), actions (<<>>), overlaps ([ ])
Measured Pauses: Custom timing with placement dialog

5. Audio Functions

5.1 Importing Audio

Load audio files via File → Import → Audio File. Supported formats: MP3, WAV, OGG, M4A, FLAC.

5.2 Automatic Subtitle Detection

When importing audio, CapsGAT automatically searches the directory for matching subtitle files (SRT, JSON, TXT, TSV) and offers to import them.

5.3 Audio Controls

Playback Controls:

Play/Pause: End or click ⏯ button
Rewind 5s: PgUp or click ⏪ button
Fast Forward 5s: PgDn or click ⏩ button
Jump to Time: Use the "Jump" button for precise navigation
Adjust speed: Use the mouse wheel on the knob, the buttons (center button to reset speed) or the + and - keys to adjust playback speed

5.4 Additional Audio Features

Auto-sync to Audio: Automatically highlights the current transcript block during playback (Using audio controls [PgUp/PgDwn] instead of scrolling is recommended)
Autopause: Automatically pauses audio when opening editing dialogs
Progress Tracking: Visual progress bar with time display

6. Exporting Transcripts

6.1 Export Formats

Generate transcripts in these formats:

HTML: Formatted with CSS styling, closer to the final, publication-ready transcript
Plain Text: Simple text format for maximum copy-paste compatibility
Subtitle file: An .SRT file containing time stamps and speaker diarization

6.2 Export Options

Customize your export with these options:

Include Timestamps: Add timecodes to transcript (when available)
Include Diarization (Speaker Labels): Choose whether to include speaker labels (only applicable for subtitle exports)
Project Title: Include project name as header
Project Memo: Include project description
Audio File Info: Include source audio file name

6.3 Export Preview

Use the preview feature to review your transcript formatting before exporting. The preview shows an approximation of how the final transcript will appear.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!