Style every word.
Make every caption hit.

The caption editor creators actually use.

Why captions

Sound is off by default, and most viewers never turn it on. Captions carry your message in silence, hold attention in places audio can't reach the office, the train, a quiet room at midnight and turn into the watch time every platform rewards with more reach.

The longer people stay, the harder every platform pushes you to the next viewer. Captions are the cheapest way to hold the scroll one second longer, then another. More of your video gets seen, by more people.

From raw clip to viral caption in 3 clicks.

Upload, style, download. That's the whole workflow. No timeline scrubbing, no plugins to install, no hour lost to heavy software for a task this simple.

01 Upload

Drop in your video

Our AI transcribes every word in seconds. Nothing to download, nothing to set up.

02 Style

Make the captions yours

Pick a preset or fine tune any word: color, size, weight, animation. Match your brand or your mood in a few clicks.

03 Download

Export and share

Choose the quality that fits. Ready to post the moment it lands.

Built for creators

Captions you'll actually want to keep.

Transcription you can trust

Your audio is read with rare accuracy, so words and timing land exactly where they should. Start from a transcript that's right, instead of one you have to fix.

Caption in any language or script

Work in your own language, in romanized text, or a mix of both, all in one place. The captions other tools make you juggle three apps for, you make here in one.

Looks that already go viral

Start from a tap. Presets built for Shorts, Reels, and TikTok hand you a proven, scroll-stopping look in seconds.

Make every word yours

Fonts, colors, weights, spacing, position, layouts, all of it. Tune any detail until the captions feel unmistakably like you.

Captions with motion

Bounce, glitch, scale, slide. Animate a single word when you want the eye to stop right where it matters.

From upload to export in minutes

Edits that update live as you make them. No heavy software, no powerful computer.

Simple, honest pricing.

Start free. Upgrade to Pro when you need more exports and transcription. Cancel anytime.

Free
$0 /mo
  • 2 video exports / month
  • 5 min of auto-transcription / month
  • Every caption style, preset & animation
  • Word-level animated captions
  • Per-word color, font, size & effects
  • Caption in your local native language
  • Different script options, native & romanized
  • 70+ languages with auto-detect
  • Export up to 4K, nothing to install
Pro Recommended
$7 /mo
  • Everything in Free, plus
  • 15 video exports / month
  • 60 min of auto-transcription / month
  • No watermark, clean exports
  • Cancel anytime, keep what you exported

And we're just getting started.

What we're shipping, in real time.

Next up 6 in development
  • translate-to-any-language
  • premium-caption-packs
  • object-segmentation
  • 3d-caption-positioning
  • lyrical-video-maker
  • brand-kit

v2026.05 · plutoworld/roadmap

Help us decide

Tell us what to build next, what's broken, or what you'd love to see. Email is optional — leave it for the changelog.

Frequently asked questions

A caption editor that runs entirely in your browser. Drop in a clip, get a word-level animated caption track, then export. No install, no plugins, no heavy software. It's built for short-form video: TikTok, Reels, and Shorts.
Start free, no card required. The Free plan includes 2 video exports and 5 minutes of transcription per month, with every caption style, preset, and animation unlocked. Pro is $7/month (or $5.83/month billed annually) and raises that to 15 exports and 60 transcription minutes per month, and removes the watermark. Both meters reset every billing period.
Free-plan exports carry a small Pluto World badge in the corner. Upgrade to Pro and it's removed, so your exports come out completely clean, at the same quality.
Transcription is powered by ElevenLabs Scribe, with word-level timestamps accurate enough that most clips need zero edits. When something's off, click the word and fix it. Text, timing, and styling all live in one view.
70+ languages with automatic detection, plus native code-switching like Hinglish or Punglish. For many languages you can caption in the native script or in romanized text, including Hindi (Devanagari or Hinglish), Punjabi (Gurmukhi or Punglish), Arabic, Urdu, Bengali, and Tamil, and switch between them right in the transcript.
Yes, that's the whole point. Click any word and change its color, font, size, weight, stroke, background, animation, or timing without touching the rest of the line. Most editors hide this. We lead with it.
MP4, MOV, and WebM in (WebM isn't supported in Safari). Clips up to 2 minutes and 4 GB, sized for short-form. Export anywhere from 720p up to 4K, with no forced upscale or compression artifacts.
You can upload a video and edit captions without signing in. An account is only needed when you transcribe or export, so your usage, quota, and saved exports stay with you.
Ready-made caption styles that bundle font, color, shadow, animation, and effects into one click. Pick a look that matches your edit, then tweak any detail, or leave it as-is and ship.
Most tools give you a text overlay and call it done. We give you per-word control, a preset library, smart auto-segmentation, advanced effects, multiple animation modes, and live preview as you edit. It's closer to a motion graphics tool that anyone can drive.
Chrome, Edge, Firefox, and Safari. Any modern desktop browser gives you the full experience. (WebM uploads aren't supported in Safari; use MP4 or MOV there.)
Mobile is in the works. For now the editor is built for desktop. Captioning at the word level needs precision a phone screen can't really give you yet.

Your next video deserves
better captions.

Free tier with no card. Upload a clip, see your captions in seconds.

Try plutoworld captions