Can Claude edit videos?

Claude can process videos using ffmpeg and ffprobe — compressing, cutting chapters, extracting thumbnails, and converting formats. Combined with its vision capability to analyze frames, it can make editorial decisions about chapter boundaries and content structure. It is not a replacement for professional NLE software, but handles the production pipeline tasks that make up 80% of video workflows.

What tools does Claude use for video processing?

Claude uses ffmpeg for video compression, chapter cutting, and thumbnail extraction; ffprobe for file inspection; its built-in vision capability for analyzing video frames; and API integrations (like WordPress REST API) for uploading and publishing the final assets.

Can AI replace video editors?

AI can handle the production pipeline aspects of video editing — compression, format conversion, chapter segmentation, thumbnail generation, and web publishing. For creative editorial decisions, color grading, and complex storytelling, human editors remain essential. The value is in automating the 80% of video work that is technical rather than creative.

How does Claude analyze video content without a transcript?

Claude extracts keyframes at regular intervals using ffmpeg, then uses its multimodal vision capability to visually analyze each frame. It can identify scene changes, read on-screen text, recognize diagrams and charts, and understand the visual flow of a video to determine chapter boundaries — all without needing audio transcription.

What infrastructure is needed for Claude to process video?

Claude needs access to a Linux shell with ffmpeg/ffprobe installed, sufficient disk space for the working files, and API credentials for the destination platform (WordPress, YouTube, CDN, etc.). Google Cloud provides the compute layer. The total cost for processing a single video is negligible — a few cents of compute time.

NotebookLM – Tygart Media

I handed Claude a 52MB video file and said: optimize it, cut it into chapters, extract thumbnails, upload everything to WordPress, and build me a watch page. No external video editing software. No Premiere. No Final Cut. Just an AI agent with access to ffmpeg, a WordPress REST API, and a GCP service account.

It worked. Here is exactly what happened and what it means.

The Starting Point

The video was a 6-minute, 39-second NotebookLM-generated explainer about our AI music pipeline — “The Autonomous Halt: Engineering the Multi-Modal Creative Loop.” It covers the seven-stage pipeline that generated 20 songs across 19 genres, graded its own output, detected diminishing returns, and chose to stop. The production quality is high — animated whiteboard illustrations, data visualizations, architecture diagrams — all generated by Google’s NotebookLM from our documentation.

The file sat on my desktop. I uploaded it to my Cowork session and told Claude to do something impressive with it.

What Claude Actually Did

Step 1: Video Analysis

Claude ran ffprobe to inspect the file — 1280×720, H.264, 30fps, AAC audio, 52.1MB. Then it extracted 13 keyframes at 30-second intervals and visually analyzed each one to understand the video’s structure. No transcript needed. Claude looked at the frames and identified the chapter breaks from the visual content alone.

ffprobe → 399.1s, 1280×720, h264, 30fps, aac 44100Hz
ffmpeg -vf “fps=1/30” → 13 keyframes extracted
Claude vision → chapter boundaries identified

Step 2: Optimization

The raw file was 52MB — too heavy for web delivery. Claude compressed it with libx264 at CRF 26 with faststart enabled for progressive streaming. Result: 21MB. Same resolution, visually identical, loads in half the time.

52MB

Original

21MB

Optimized

60%

Reduction

Step 3: Chapter Segmentation

Based on the visual analysis, Claude identified six distinct chapters and cut the video into segments using ffmpeg stream copy — no re-encoding, so the cuts are instant and lossless. It also extracted a poster thumbnail for each chapter at the most visually representative frame.

The chapters:

1. The Creative Loop (0:00–0:40) — Overview of the multi-modal engine
2. The Nuance Threshold (0:50–1:30) — The diminishing returns chart
3. Seven-Stage Pipeline (1:30–2:20) — Full architecture walkthrough
4. Multi-Modal Analysis (2:50–3:35) — Vertex AI waveform analysis
5. 20-Song Catalog (4:10–5:10) — The evaluation grid
6. The Autonomous Halt (5:40–6:39) — sys.exit()

7 video files uploaded (1 full + 6 chapters)
6 thumbnail images uploaded
13 WordPress media assets created
All via REST API — zero manual uploads

Step 4: WordPress Media Upload

Claude uploaded all 13 assets (7 videos + 6 thumbnails) to WordPress via the REST API using multipart binary uploads. Each file got a clean SEO filename. The uploads ran in parallel — six concurrent API calls instead of sequential. Total upload time: under 30 seconds for all assets.

Step 5: The Watch Page

With all assets in WordPress, Claude built a full watch page from scratch — dark-themed, responsive, with an HTML5 video player for the full video, a 3-column grid of chapter cards (each with its own embedded player and thumbnail), a seven-stage pipeline breakdown with descriptions, stats counters, and CTAs linking to the music catalog and Machine Room.

12,184 characters of custom HTML, CSS, and JavaScript. Published to tygartmedia.com/autonomous-halt/ via a single REST API call.

The Tools That Made This Possible

Claude did not use any video editing software. The entire pipeline ran on tools that already existed in the session:

ffprobe — File inspection and metadata extraction
ffmpeg — Compression, chapter cutting, thumbnail extraction, format conversion
Claude Vision — Visual analysis of keyframes to identify chapter boundaries
WordPress REST API — Binary media uploads and page publishing
Python requests — API orchestration for large payloads
Bash parallel execution — Concurrent uploads to minimize total time

The insight is not that Claude can run ffmpeg commands — anyone can do that. The insight is that Claude can watch the video, understand its structure, make editorial decisions about where to cut, and then execute the entire production pipeline end-to-end without human intervention at any step.

What This Means

Video editing has always been one of those tasks that felt immune to AI automation. The tools are complex, the decisions are creative, and the output is high-stakes. But most video editing is not Spielberg-level craft. Most video editing is: trim this, compress that, cut it into clips, make thumbnails, put it on the website.

Claude handled all of that in a single session. The key ingredients were:

Access to the right CLI tools — ffmpeg and ffprobe are the backbone of every professional video pipeline. Claude already knows how to use them.
Vision capability — Being able to actually see what is in the video frames turns metadata analysis into editorial judgment.
API access to the destination — WordPress REST API meant Claude could upload and publish without ever leaving the terminal.
Session persistence — The working directory maintained state across dozens of tool calls, so Claude could build iteratively.

The Bigger Picture

This is one video on one website. But the pattern scales. Connect Claude to a YouTube API and it becomes a channel manager. Connect it to a transcription service and it generates subtitles. Connect it to Vertex AI and it generates chapter summaries from audio. Connect it to a CDN and it handles global distribution.

The video you are watching on the watch page was compressed, segmented, thumbnailed, uploaded, and presented by the same AI that orchestrated the music pipeline the video is about. That is the loop closing.

Claude is not a video editor. Claude is whatever you connect it to.

Tag: NotebookLM

I Gave Claude a Video File and It Became My Editor, Compressor, and Web Developer