The new way to ship video

Agentic video editing

Video editing done by an AI agent — not a human clicking through a timeline. You describe what you want, the agent runs the cuts, the captions, the motion graphics, the thumbnail, the YouTube upload. You review the result and ship.

Try PandaStudio free See the CLI surface

What "agentic" actually means here

The video-tools industry slapped "AI" on every product around 2023 and most of it was thin: a transcribe button here, an auto-caption pass there, a one-click thumbnail generator with three template variants. Useful, but the human still drives — click this, click that, scrub the timeline, export.

Agentic editing flips that. The AI agent — Claude, GPT, whatever — drives the editor. The human writes a prompt and reviews the output. Concretely: instead of you transcribing, then trimming fillers, then adding chapters, then writing a title, then making a thumbnail, you tell the agent "open my latest recording and make it a YouTube-ready tutorial" and walk away. Five minutes later you have an MP4, captions, title, description, chapter timestamps, and a thumbnail. You review and ship.

This requires a video editor whose every feature is a tool the agent can call — not just a UI. Most editors aren't built that way. PandaStudio is.

Why PandaStudio is built for this

First-class MCP server

30+ static tools cover the editorial workflow plus an escape hatch (pandastudio_call) for the long tail. Cursor, Continue, Cline, and Claude Desktop discover all of them automatically on connect. No copy-paste-tool-list dance.

Bundled Claude Skill

Claude users get a skill that teaches the model how to use the tool well — pacing rules for motion graphics, when to add zooms, what makes a strong title, when to use lower-thirds. Auto-loaded from~/.claude/skills/pandastudio/. No prompt engineering required.

60+ verb-noun CLI

Every UI feature is also a CLI command:pandastudio transcript.delete-words,project.add-zoom,export.start. The agent calls these via MCP; you can also call them from a shell script or CI pipeline. No second-class automation surface.

Local-first runtime

Recordings, edits, transcription all run on your machine. The agent's tool calls go to localhost — your media never leaves your disk unless you explicitly publish. Matters for SaaS demos of internal panels, client work under NDA, anything where cloud upload is a no-go.

What it actually looks like

Three real prompts that work today:

The full pipeline

"Open my latest recording in PandaStudio. Transcribe it locally, remove fillers and long silences, add captions in the bold style, generate a YouTube title, description, chapter timestamps, and thumbnail. Export 1080p H.264, ready for upload."

Agent calls ~12 verbs end-to-end. Typical time: 5-8 minutes for a 30-minute source recording.

Targeted edit

"Add a glass-mode lower third with my name and 'Founder, Acme' at 0:14 to 0:20. Drop a zoom region focused on the dashboard at 1:32 with a swoosh sound effect. Add a chapter divider before the 'Pricing' segment."

Agent calls 3 verbs precisely.project.add-lower-third,project.add-zoom,project.add-motion-graphic.

Repurposing

"Take my latest YouTube export, identify the most clip-worthy 30-90s segments, and produce vertical Shorts/Reels/TikTok versions of each with captions and a title card."

Callsexport.listthen the AI Shorts pipeline. Returns 5-10 vertical exports ready to publish.

Works with every major AI coding agent

Claude CodeSetup guide →Claude DesktopSetup guide →CursorSetup guide →ContinueSetup guide →ClineSetup guide →

Frequently asked questions

What is agentic video editing?

Agentic video editing is video editing done by an AI agent — a model like Claude or GPT that can use tools — rather than by a human clicking through a timeline editor. The agent reads the transcript, decides what to cut, drops in motion graphics, generates captions, writes the YouTube title and description, and exports the finished MP4. The human's role moves up the stack: review the result and ship it. PandaStudio is purpose-built for this loop.

Which AI agents work with PandaStudio?

Claude Code, Claude Desktop, Cursor, Continue, and Cline all work out of the box. PandaStudio ships an MCP (Model Context Protocol) server with 30+ first-class tools and a long-tail escape hatch (`pandastudio_call`) for any verb. Claude users get a bundled Skill that auto-loads from ~/.claude/skills/pandastudio/. Other MCP-compatible clients (any tool that implements the protocol) work via the npm-installed @writepanda/mcp package.

How is agentic editing different from AI features in regular editors?

Most 'AI video editors' bolt on individual AI features — a transcribe button, a caption generator, an auto-zoom button. Agentic editing is the inverse: the agent drives the entire editor through a typed conversation. Instead of you clicking 'transcribe' then 'remove fillers' then 'add captions' then 'generate title' across 20 minutes, you tell Claude 'make this YouTube-ready' and it runs the whole pipeline in one go. The difference shows most clearly on long-form: a 30-minute tutorial that takes 90 minutes to edit by hand takes 5 minutes of supervision in agentic mode.

What does PandaStudio's MCP server expose?

60+ verb-noun commands covering every editor feature: project lifecycle (project.new / open / save / delete), transcript editing (transcript.transcribe / delete-words / remove-fillers / remove-silences), region kinds (project.add-zoom / add-motion-graphic / add-lower-third / add-fx / add-trim / add-speed / add-clip-transform-region), motion graphics (motion.list / generate / render-html), captions (caption.toggle / set-template / set-style), AI metadata (llm.generate-title / generate-description / generate-timestamps), thumbnails (export.generate-thumbnail / edit-thumbnail), YouTube publishing (export.publish-youtube), and exports (export.start). Every UI feature is also a verb — there's no second-class automation surface.

Is agentic editing only for tech-savvy users?

No. The agent does the technical work. You write a prompt in your existing chat tool — Claude Desktop, ChatGPT-via-Cursor, etc. — like 'open my latest recording and make it a tutorial' and the agent figures out what verbs to call. PandaStudio's bundled Claude Skill teaches Claude exactly how to use the tool well, including pacing rules for motion graphics, when to add zooms, what makes a strong YouTube title, how to write captions for vertical content. The skill is auto-loaded — Claude reads it the first time you connect.

What about privacy when an agent edits internal video?

PandaStudio is local-first. Recordings live on your disk; the agent's tool calls run against the desktop app on localhost; no media bytes upload to a third-party server. The only network calls are (a) to whatever LLM API you've configured (Anthropic / OpenAI / etc — you bring your own key and pay them directly) and (b) to YouTube IF you explicitly publish. This matters for SaaS founders editing demos of internal admin panels, agencies editing client work, anyone whose recordings shouldn't be on someone else's cloud.

Ship video the agentic way

Free download, three free exports, no credit card. Connect your existing AI agent and run a real edit in under 10 minutes.

Download PandaStudio Read the docs

Keep reading

The agentic editing playbook

Full how-to: zooms, lower thirds, captions, motion graphics, destinations.

CLI + MCP reference

Every verb the agent can call.

For developer marketing teams

Scriptable, version-controllable, fits how engineering teams work.