YouTube Transcript

Overview

Extract YouTube video transcripts, metadata, and chapters using youtube-transcript-api and yt-dlp. Output formatted as Markdown with YAML frontmatter, saved to ~/Brains/brain/ (Obsidian vault).

Quick Start

To extract a transcript from a YouTube video:

python scripts/extract_transcript.py <youtube_url>

Optional: Specify custom output filename:

python scripts/extract_transcript.py <youtube_url> custom_filename.md

Output Format

YAML Frontmatter

The generated Markdown includes comprehensive metadata:

title - Video title
channel - Channel name
url - YouTube URL
upload_date - Upload date (YYYY-MM-DD)
duration - Video duration (HH:MM:SS)
description - Video description (truncated to 500 chars)
tags - Array of video tags
view_count - View count
like_count - Like count

Body Structure

Transcript organized by video chapters (if available):

## Chapter Title

**00:05:23** Transcript text for this segment.

**00:05:45** Next segment text.

If no chapters exist, all content appears under "## Transcript" heading.

Timestamps formatted as HH:MM:SS for consistency.

Workflow

Extract metadata using yt-dlp --dump-json
Extract transcript using youtube-transcript-api (tries Korean → English → Japanese)
Remove duplicate entries (prefix removal)
Group transcript segments by video chapters (if present)
Format as Markdown with YAML frontmatter
Save to ~/Brains/brain/ with sanitized filename based on video title

Language Support

The skill tries to extract transcripts in this order:

Korean (ko) - Priority for Korean content
English (en) - Fallback for international content
Japanese (ja) - Additional fallback

If none of the requested languages are available, the script exits with an error message.

Requirements

Install required dependencies:

# Install yt-dlp for metadata extraction
apt install yt-dlp
# OR
pip install yt-dlp

# Install youtube-transcript-api for transcript extraction
pip install youtube-transcript-api

Why youtube-transcript-api?

The skill uses youtube-transcript-api instead of directly downloading VTT files with yt-dlp because:

More reliable - Direct API access, avoids HTTP 429 (Too Many Requests) errors
Better language support - Easy to specify language preferences
Cleaner data - Returns structured data directly, no VTT parsing needed
Faster - No file download/cleanup overhead
Auto-generated captions - Works with auto-generated captions

Deduplication

The skill automatically removes duplicate entries where a transcript segment is a prefix of the next segment. This is common in auto-generated captions where text accumulates.

To manually deduplicate existing transcript files:

python scripts/deduplicate_transcript.py <markdown_file>

Troubleshooting

"No subtitles available"

The video may not have captions in the requested languages (ko/en/ja)
Some videos disable captions entirely
Try checking manually on YouTube to see if captions are available