Last updated June 2026
12 Best Subtitle Generators in 2026
Honest picks by use case — with the standards depth (Netflix 17 CPS, BBC 15-17, line length 32-42), 2026 compliance angles (EAA, WCAG 2.1, FCC, ABNT NBR 15290), and format matrix (SRT, VTT, ASS, TTML, SCC, STL, EBU) most listicles skip.
The 2026 subtitle market splits into four categories: AI transcription-first tools (Otter, VexaScribe), AI video-editor combos (Descript, VEED, Kapwing, Submagic, CapCut), human + AI verification services (Rev, Happy Scribe), and open-source desktop editors (Aegisub, Subtitle Edit). No single tool wins for everyone — Submagic dominates TikTok styling but is wrong for a research lab; Aegisub is the broadcast professional's standard but has zero AI features; Rev's human verification is the only safe choice for FCC-regulated broadcast; CapCut is genuinely free for mobile-first creators. Below we compare all 12 ordered by where they fit best in 2026, give you a use-case matrix that's honest about tradeoffs, and tell you explicitly when to pick someone other than us. We make VexaScribe — we built this page — and we'll send you to Submagic, Aegisub, Rev, or CapCut by name when those tools are the right answer. We rank ourselves at position 4 in the list below — strong for accuracy + multilingual + affordability, but not the right pick for short-form social, broadcast compliance, or animated styling.
The essentials
Quick read of the 12 tools. Detailed breakdowns and use-case recommendations follow.
| Tool | Best for | Pricing | Free tier | Formats | Accuracy |
|---|---|---|---|---|---|
| Submagic | TikTok / Reels / Shorts styled captions | $15-49/mo | Limited trial | MP4 burn-in, SRT | 88-93% |
| CapCut | Free mobile/desktop, TikTok-native | Free / $7.99 Pro | Unlimited (free tier) | MP4 burn-in, SRT | 85-92% |
| Descript | Podcast and course editing workflow | $16-50/mo | 1h/mo free | SRT, VTT, TXT, MP4 burn-in | 93-95% |
| VexaScribe | Accuracy + multilingual SRT + EU storage | $2-20/mo | 30 min trial | SRT, VTT, TXT, DOCX, JSON, PDF | 92-95% (Whisper Large-v3) |
| VEED.io | All-in-one browser video editor | $12-59/mo | 10 min/mo (watermarked) | SRT, VTT, TXT, MP4 burn-in | 90-94% |
| Kapwing | Team collaboration, caption presets | $16-50/mo | Watermarked free | SRT, VTT, MP4 burn-in | 90-94% |
| Rev.com | Broadcast-grade human review | $0.25/min AI, $1.99/min human | Pay-per-use only | SRT, VTT, SCC, TTML, MP4 burn-in | 99% (human) |
| Happy Scribe | EU residency + 120+ languages | $17-29/mo or PAYG | 10 min trial total | SRT, VTT, STL, EBU, TTML | 94-97% |
| Aegisub | Anime fansub, broadcast precision | Free (open source) | Free forever | ASS, SSA, SRT, others | Manual entry |
| Subtitle Edit | 300+ format conversion, QA workflow | Free (open source) | Free forever | 300+ formats supported | Tool-dependent |
| Otter.ai | Live meeting captions (English/FR/ES only) | $16.99-30/mo | 300 min/mo free | SRT, VTT, TXT | 90-93% (English only) |
| YouTube Studio | Baseline for YouTube uploads | Free | Unlimited | SRT, VTT (within YouTube) | 75-85% |
Pricing and feature details verified June 2026 on vendor sites. Plans change frequently — always confirm before subscribing.
How we evaluated
Six criteria, weighted differently per use case (one tool can't win all six):
- ●Accuracy. Word Error Rate on clean audio. Whisper Large-v3 baseline ~95% English, ~92% Portuguese. Tools using older models or in-house ASR typically trail by 3-8 points.
- ●Language support. Tier 1 vs Tier 2 vs Tier 3 coverage. Whisper-based tools win here. Otter notably excludes Portuguese.
- ●Formats. SRT, VTT, ASS/SSA, TTML, SCC, MCC, STL, EBU. The format you need depends on destination — most creators only need SRT; broadcast needs TTML/SCC; European broadcast needs STL/EBU.
- ●Compliance posture. EAA (EU), WCAG 2.1 (universal), FCC 47 CFR § 79.1 (US), ABNT NBR 15290 (Brazil). Tools vary from full broadcast compliance (Rev) to no signaling (CapCut).
- ●Pricing fit. Per-minute vs subscription vs free. Effective cost at typical usage. Hidden caps and watermarks.
- ●Use-case fit. A YouTube long-form creator and a TikTok short-form creator need different tools — neither is “better,” they're shaped for different work.
Honesty statement
We make VexaScribe. We've ranked tools on the criteria above and placed ourselves at position 4 — strong for accuracy + multilingual + affordability, but not the right pick for short-form social styling, broadcast compliance, or animated captions. We sell more reliably long-term by being a credible reference than by writing a self-promotion piece that ranks for a week.
What makes a subtitle tool different from transcription
Transcription gives you a text wall. Subtitles are time-boxed text that has to be readable on screen. The difference is craft, and most AI tools don't enforce the craft rules. Here are the standards that separate professional subtitles from raw transcription:
Timing precision
Frame-accurate (Aegisub, Subtitle Edit) vs word-level (Whisper-based tools, VexaScribe) vs sentence-level (older ASR, YouTube auto-captions). Frame-accurate matters for broadcast and dubbed media; word-level is sufficient for web and social; sentence-level loses sync within 10 seconds of dialogue.
Line length standards
32-42 characters per line, maximum 2 lines per subtitle block. Netflix specifies 42 characters per line for English; BBC specifies 37-39. The 2-line maximum is universal — 3+ lines block too much of the frame. Why this matters: line length × reading speed determines whether viewers can finish reading before the subtitle disappears. AI generators routinely produce 60+ character lines that violate this rule.
Reading speed (CPS / characters per second)
Netflix: 17 CPS adult, 20 CPS children. BBC: 15-17 CPS general audience. A 6-second shot allows roughly 100 characters at 17 CPS — about 2.5 standard 42-char lines. Exceed this and viewers can't read in time. CPS is the single most overlooked subtitle quality metric. Aegisub and Subtitle Edit calculate CPS per block and flag violations; most AI tools (Submagic, VEED, CapCut) don't expose CPS metrics.
Styling
Font, color, position, background opacity, fade in/out. SRT supports none of this. VTT supports basic styling. ASS/SSA supports full control (Aegisub workflow). Burned-in MP4 lets you embed any styling — but viewers can't turn it off, which fails WCAG 2.1 SC 1.4.5 (User Control).
Burn-in vs separate file (hardcoded vs softcoded)
Burn-in (hardcoded): permanent pixels in the video, can't be turned off. Use for: Instagram Reels, TikTok native uploads, client-delivered final cuts. Separate file (softcoded SRT/VTT): viewers can toggle, retranslate, restyle. Use for: YouTube, web video, accessibility compliance, multi-language distribution. Most professional workflows ship both — softcoded SRT for accessibility + burned-in version for platforms that strip captions.
Multi-language tracks
Separate SRT file per language. Standard naming: video.en.srt, video.es.srt, video.pt.srt. YouTube and Vimeo auto-detect this pattern. Happy Scribe (120+ languages) and Maestra are the strongest options for multi-language export with translation review; VexaScribe and Descript handle the export workflow for in-house translation. For complex multi-language broadcast (Netflix-style), TTML with embedded language tags is the standard.
Translation workflow
AI translation + human review is the 2026 industry standard for paid distribution. Pure AI translation is acceptable for personal vlogs, internal content, draft pitches. Always have a native speaker verify final subtitles before public release in languages with cultural nuance (anything past Tier 1 European languages).
The 12 best subtitle generation tools in 2026
Roughly ordered by mainstream creator fit in 2026: short-form video specialists first, all-in-one editors and transcription-first tools next, then verified-human and open-source desktop, with live captioning and YouTube's native baseline last. Not a strict ranking — the use-case matrix below is the better guide for who picks what.
Submagic
AI-styled animated captions purpose-built for short-form vertical video.
Best for
TikTok, Instagram Reels, YouTube Shorts creators who need viral-style animated text overlays. Marketing teams producing high-volume social-first content.
Standout
Pre-built caption templates that emulate top-performing creator styles (MrBeast, Alex Hormozi). Word-by-word animation, emoji insertion, B-roll suggestions. Built for engagement metrics, not transcription accuracy.
Pricing
$15-49/month depending on minutes and team seats.
Formats
MP4 with burned-in subtitles primarily. SRT export available.
Free tier
Limited trial — typically a few exports before requiring subscription.
Honest limitations
Not designed for long-form content (lectures, podcasts, interviews). Accuracy is good but not best-in-class — focus is styling, not WER. No multi-language workflow comparable to Happy Scribe or VexaScribe.
CapCut
Free mobile and desktop editor with auto-captions and TikTok-native styling.
Best for
Mobile-first creators on a budget. TikTok creators who want platform-native styling. Anyone testing video creation before committing to paid tools.
Standout
Genuinely free with no watermark and no time limits — the best free option for finished short-form video. Strong AI features (auto-caption, auto-cut, voice changer) typically only in paid tools.
Pricing
Free / $7.99 Pro for advanced AI features.
Formats
MP4 with burn-in primarily. SRT export.
Free tier
Unlimited free use of core editor + auto-captions.
Honest limitations
Owned by ByteDance — data privacy concerns relevant for corporate, government, healthcare, and EU users. Styling presets heavily TikTok-shaped (less polish for LinkedIn/corporate). No GDPR/LGPD signaling.
Descript
Edit video by editing transcript — delete a sentence in text, the video updates.
Best for
Podcasters and course creators who already think of their content as “text with audio attached.” Teams doing audiogram production for social.
Standout
Text-based editing paradigm is genuinely productivity-changing for talking-head content. AI features (Studio Sound, Overdub voice cloning, filler-word removal) are mature. Subtitle export is clean.
Pricing
$16-50/month. 1 hour/month free tier.
Formats
SRT, VTT, TXT, MP4 with burn-in.
Free tier
1 hour/month transcription on free tier.
Honest limitations
Learning curve — the transcript-edits-video paradigm takes practice. Voice cloning (Overdub) raises ethical concerns for some use cases. Subscription model adds up for occasional users.
VexaScribe
Whisper Large-v3 accuracy across 99 languages with EU storage and clean SRT/VTT export.
Best for
Creators and teams who prioritize transcription accuracy and multilingual workflows. Brazilian, European, and LATAM markets needing affordable BRL/EUR pricing. Anyone who wants a clean SRT/VTT file rather than burned-in styled captions.
Standout
Whisper Large-v3 accuracy at $2/month entry pricing — among the cheaper subscription options in this list. 99 languages with strong PT-BR Tier 1 support (rare among English-first competitors). Audio and transcripts stored in AWS eu-west-2 (London).
Pricing
$2-20/month. 30-minute free trial on signup, no credit card.
Formats
SRT, VTT, TXT, DOCX, JSON, PDF. Single-file or bulk batch (up to 50 files).
Free tier
30-minute trial (single-use).
Honest limitations
No burn-in workflow — exports SRT/VTT for use in your video editor. No animated text styling or templates (use Submagic or CapCut for short-form social). No real-time live captioning (use Otter.ai for live meetings). Transcription inference runs via Replicate (US-based) — storage is EU but compute touches US infrastructure. For strict EU-only processing, Happy Scribe is the closer fit.
VEED.io
Browser-based all-in-one video editor with auto-subtitles, burn-in, and translation.
Best for
Solo creators and small teams who want one tool for the full editing workflow — trim, cut, caption, translate, export. No software install.
Standout
Subtitle styling templates, multi-language translation in 100+ languages, brand kit for consistent caption styling across a creator's library. Strong free tier for testing.
Pricing
$12-59/month. Free tier with 10 min/month and watermark.
Formats
SRT, VTT, TXT, MP4 with burn-in.
Free tier
10 minutes/month with VEED watermark on output.
Honest limitations
Subscription gets pricey at scale ($59/mo Pro vs comparable Descript at $50/mo). Browser editor performance limits long-form (1h+) projects compared to desktop tools.
Kapwing
Browser-based collaborative editor with caption styling presets and team workflows.
Best for
Marketing and social teams needing collaboration features. Educators producing classroom content.
Standout
Real-time collaboration is genuinely useful for team-produced content. Caption styling presets and brand kits. Strong template library for repeatable formats.
Pricing
$16-50/month. Free tier with watermark.
Formats
SRT, VTT, MP4 with burn-in.
Free tier
Watermarked free output.
Honest limitations
Performance ceiling on long-form projects (browser-based). Watermark on free tier removes much of the free-tier value. AI features less aggressive than VEED or Descript.
Rev.com
Industry-standard human + AI caption service with broadcast-grade accuracy guarantee.
Best for
Broadcast TV under FCC regulation. Legal proceedings. Medical content. WCAG 2.1 AAA compliance projects. Any context where ~99% accuracy and human verification are required.
Standout
Human-verified captions at 99% accuracy with FCC-compliant SCC and broadcast format support. The default choice for regulated industries. Mature SLA + dedicated account management at enterprise tier.
Pricing
$0.25/min AI, $1.99/min human. No subscription required (PAYG).
Formats
SRT, VTT, SCC, MCC, TTML, MP4 burn-in.
Free tier
Pay-per-use only.
Honest limitations
Per-minute pricing makes it expensive at scale. $1.99/min human × 1h = $119 — fine for one deposition, brutal for a podcast archive. No flat-rate option. US-hosted (Cloud Act considerations for non-US clients).
Happy Scribe
EU-based subtitle service with 120+ languages and broadcast format support.
Best for
European media operations needing EU residency. Documentary and broadcast workflows. Multilingual content with professional translation review.
Standout
STL and EBU format support (European broadcast standards) that few competitors offer. Optional human review tier for compliance projects. Barcelona hosting — EU-GDPR native.
Pricing
$17-29/month subscription or pay-per-minute. Add-on human review extra.
Formats
SRT, VTT, STL, EBU, TTML.
Free tier
10-minute trial — single use, not recurring.
Honest limitations
Trial is small (10 min) — hard to evaluate properly. Pay-per-minute at scale matches Rev's pricing concern. Subscription cap on minutes limits high-volume use.
Aegisub
Free open-source desktop editor — the broadcast and anime fansub standard.
Best for
Anime fansub communities. Broadcast TV professionals. Any project requiring frame-accurate timing and full styling control (font, color, position, fade, rotation).
Standout
Full ASS/SSA styling control with frame-accurate karaoke timing. Audio waveform editor. Free forever, no subscription, no limits, no watermark. The de facto standard for professional subtitle craft.
Pricing
Free (open source under BSD license).
Formats
ASS, SSA, SRT, MicroDVD, SubViewer, and more.
Free tier
Free forever.
Honest limitations
No AI auto-caption — designed for manual entry with audio cue reference. Steep learning curve. UI is dated (last major release 2022). No translation features. Not the right tool for casual creators — built for craftsmen.
Subtitle Edit
Free open-source desktop editor supporting 300+ subtitle formats with waveform editing.
Best for
Format conversion (SRT to TTML for Netflix, SRT to SCC for broadcast, etc). QA workflow for AI-generated subtitles. Any project needing format flexibility.
Standout
Reads and writes 300+ subtitle formats — by far the broadest format library in the industry. Spell check, OCR for image-based subtitles (DVD, Blu-ray), waveform visualization, batch convert.
Pricing
Free (open source under GPL).
Formats
300+ formats supported.
Free tier
Free forever.
Honest limitations
Windows-first (runs on macOS via Mono with rough edges). UI is functional but dated. No AI auto-caption — built around manual or imported subtitle files for editing/conversion.
Otter.ai
Live meeting captions with speaker labels — built for Zoom, Teams, Google Meet.
Best for
Live meeting transcription and captioning (Zoom, Teams, Meet). Real-time accessibility for live events.
Standout
Live caption quality is consistently among the best for English meetings. Voice profiles for recurring meeting participants. Integration with major meeting platforms.
Pricing
$16.99-30/month. 300 min/month free tier.
Formats
SRT, VTT, TXT.
Free tier
300 min/month — but ENGLISH ONLY (also French and Spanish on paid tiers).
Honest limitations
Does NOT support Portuguese in 2026 — English, French, and Spanish only. For Brazilian, Portuguese, or LATAM use, Otter is functionally useless. US-hosted (Cloud Act). Primarily transcription, not subtitle styling — exports basic SRT without timing tuning for visual presentation.
YouTube Studio
Free native auto-captions for YouTube uploads — the universal baseline.
Best for
YouTube creators who want zero-friction captions as a starting point. Any video destined for YouTube only.
Standout
Free, automatic, zero setup. Integrated directly into the YouTube upload workflow. Supports community-contributed translations.
Pricing
Free with YouTube account.
Formats
SRT, VTT export (within YouTube ecosystem).
Free tier
Unlimited free use.
Honest limitations
Quality 75-85% on English, lower on other languages. No speaker identification. No sound effect cues. No closed-caption metadata (insufficient for WCAG 2.1 AAA, FCC, or EAA compliance). Export limited to YouTube ecosystem — can't easily push to other platforms.
Honorable mentions
Tools we considered but didn't rank in the main 12, with the reason:
- ●Maestra (multilingual subtitle + dubbing — voice cloning ethically controversial)
- ●Riverside (podcast studio with caption export)
- ●Opus Clip (AI shorts with auto-generated styled captions)
- ●Adobe Premiere Pro (NLE built-in auto-captions for pro editors)
- ●DaVinci Resolve (free NLE with strong auto-caption in Studio version)
- ●Final Cut Pro (Apple NLE auto-captions)
- ●AssemblyAI (developer API for building subtitle features into apps)
- ●Deepgram (developer API alternative to AssemblyAI)
By use case: who picks what
Use case shapes the right tool more than abstract rankings. Match yours to the row below.
| Persona | Primary | Alternative | Why |
|---|---|---|---|
| YouTube long-form creator | Descript | VexaScribe | Transcript-based editing for talking-head content; VexaScribe for SRT export without the editor overhead |
| TikTok / Reels / Shorts creator | Submagic | CapCut | Animated styled captions optimized for short-form engagement; CapCut free if budget-constrained |
| Podcaster making audiograms | Descript | Riverside or Opus Clip | Descript's transcript-editing paradigm is built for podcast workflows; Riverside if you record remote interviews; Opus Clip for AI-generated shorts with captions |
| Corporate L&D / training | Rev (human) | Happy Scribe | Human-verified accuracy for WCAG/EAA compliance audits in regulated industries |
| Film / TV professional | Aegisub + Subtitle Edit | Premiere Pro | Frame-accurate ASS styling, broadcast format conversion (TTML, SCC, STL, EBU) |
| University educator | Otter.ai (live) | VexaScribe | Otter for live lecture captions; VexaScribe for post-class SRT export in multiple languages |
| Live streamer | Otter.ai (live) | YouTube live auto-captions | Real-time captioning during stream |
| Documentary / foreign-language | Happy Scribe | VexaScribe | Happy Scribe's 120+ languages + EU residency + optional human translation review; VexaScribe is the cheaper alternative for in-house translation workflows |
| Accessibility-first (WCAG/EAA) | Rev (human verified) | Happy Scribe + manual review | Human verification is the safest path for AA/AAA compliance audit |
| Brazil / LATAM creator (PT-BR) | VexaScribe | CapCut (free) | Whisper Tier 1 PT-BR accuracy at BRL-friendly pricing — Otter is non-viable here (English/FR/ES only); CapCut if free is the priority |
Subtitle compliance in 2026: EAA, WCAG, FCC, ABNT
The regulatory environment shifted significantly in 2025-2026. If you produce video content for any regulated market, this section matters.
European Accessibility Act (Directive 2019/882)
Enforcement began 28 June 2025 with phased compliance through 2030. Fines vary by Member State — some up to €50,000 per violation. Affects: e-commerce, banking video customer service, video-on-demand (Netflix-style), e-books, e-readers, e-learning platforms. Practical impact: any company selling video products or services to EU customers must caption their video content to WCAG 2.1 AA-equivalent quality. AI-generated subtitles are permitted; quality must be sufficient to convey audio meaning equivalently.
WCAG 2.1 (Universal)
Success Criterion 1.2.2 (Captions, Prerecorded) — Level A: captions must be provided for all prerecorded audio in synchronized media. SC 1.2.4 (Captions, Live) — Level AA: captions for live audio. SC 1.4.5 (Images of Text) — implicates burn-in subtitles which can't be turned off. Tools that help: Rev (human verified), VexaScribe (high AI accuracy + manual review workflow), Subtitle Edit (QA verification), Aegisub (broadcast precision).
FCC 47 CFR § 79.1 (US)
Applies to broadcast TV, cable, satellite, and OTT services in the US. Requires near-perfect closed captions including speaker identification and sound effects. AI captions alone do NOT satisfy FCC quality standards — human verification is the de facto requirement. Format: SCC for traditional broadcast, MCC for enhanced content. Rev.com is the industry default.
ABNT NBR 15290:2016 (Brazil)
Brazilian standard for accessibility in audiovisual communication. Defines closed-caption requirements for TV broadcast and increasingly enforced for video-on-demand and government communication. Aligned with LBI (Lei 13.146/2015 — Brazilian Inclusion Law). Practical compliance: SRT with proper speaker identification, ABNT-compliant line length (32-42 chars typical), proper reading speed (~17 CPS). We cover Brazilian compliance in depth in our PT-BR subtitle guide.
Format cheat-sheet
Which format you need depends on destination platform and use case. The vast majority of creators only need SRT.
| Format | What it is | When to use | Limitations |
|---|---|---|---|
| SRT | Universal text format with timestamps. No styling. | Default choice. YouTube, social, NLEs, most use cases. | No bold/italic, no positioning, no color. |
| VTT | Web standard, supports basic styling (bold, italic, positioning). | HTML5 video players, web platforms. | Less universal than SRT outside web. |
| ASS / SSA | Advanced SubStation Alpha — full styling control. | Anime fansub, karaoke, broadcast precision, Aegisub workflow. | Not supported by simple players or most social platforms. |
| TTML / DFXP | XML-based, Netflix and broadcast standard. Complex styling. | Netflix delivery, professional broadcast. | Requires specialized tools to produce/edit correctly. |
| SCC | Scenarist Closed Captions — FCC-compliant US broadcast. | US TV broadcast under FCC 47 CFR § 79.1. | US-specific. Not for general web/video. |
| MCC | MacCaption Closed Captions — enhanced SCC variant. | US broadcast TV with extended character set. | Even more US-specific than SCC. |
| STL / EBU | European Broadcasting Union subtitle standards. | European broadcast TV delivery. | Region-specific to European broadcast. |
| MP4 burn-in | Subtitles burned permanently into video pixels. | Social platforms without subtitle support (Reels, TikTok native). Final deliverables. | Can't be turned off; can't be retranslated; doesn't satisfy WCAG 1.4.5 (user control). |
Need to convert between formats? Subtitle Edit reads and writes 300+ formats — it's the free utility every subtitle workflow eventually needs. Need to generate SRT from audio quickly? Try our SRT generator.
When we'd send you elsewhere
We make VexaScribe — we placed ourselves at position 4 in this list, which we think is honest. Here's where we'd explicitly send you to a different tool:
- ●Use Submagic if you produce TikTok, Reels, or YouTube Shorts and need viral-style animated word-by-word captions. We don't do animated text styling.
- ●Use CapCut if you're on a tight budget, work primarily on mobile, and produce short-form content for TikTok or Instagram. CapCut is genuinely free with no watermark.
- ●Use Descript if you're a podcaster or course creator who wants the transcript-edits-video paradigm. We don't have an integrated video editor.
- ●Use Rev.com (human) if you're producing broadcast TV under FCC regulation, legal evidence transcripts, or anything requiring 99% verified accuracy. AI tools including us are not appropriate for FCC-compliant captions.
- ●Use Happy Scribe if you need strict EU-only processing (we currently use Replicate for Whisper inference, which means audio touches US infrastructure during transcription even though storage is in London) or if you need broadcast formats like STL and EBU.
- ●Use Aegisub + Subtitle Edit if you work in anime fansub, professional broadcast, or any project requiring frame-accurate styling with full ASS/SSA control. We export clean SRT/VTT — not designed for fansub craft.
- ●Use Otter.ai if you need real-time live captions during English-language Zoom or Teams meetings. We're file-upload based, not live.
- ●Use AssemblyAI or Deepgram if you're a developer building subtitle features into your own product via API. They have mature batch APIs; we're UI-first.
If your use case isn't above — long-form YouTube without the editor overhead, educators producing multilingual lecture captions, or PT-BR work where Otter doesn't reach — we're a reasonable fit. Try the 30-minute free trial.
Frequently asked questions
What is the best free subtitle generator in 2026?
Depends on the source. For YouTube creators: YouTube Studio's built-in auto-captions are free and adequate as a baseline (~75-85% accuracy). For TikTok/Reels: CapCut is free with no watermark and styled caption templates. For desktop power users: Subtitle Edit (open-source, Windows-first) and Aegisub (open-source, cross-platform) are completely free with no upload limits. For accuracy + multilingual + privacy: VexaScribe's 30-minute free trial uses Whisper Large-v3 — the same model OpenAI charges $0.006/min for via API. Honest caveat: free tiers from VEED, Submagic, and Happy Scribe are usually capped at 10-30 minutes total trial — not a free tier you can keep using monthly.
What's the difference between subtitles and closed captions?
Subtitles assume the viewer can hear the audio and just need translation or text reinforcement — they typically transcribe dialogue only. Closed captions (CC) assume the viewer cannot hear and include speaker identification ([JOHN]), sound effects ([door slams]), and music cues ([upbeat music]). In the US, FCC rules 47 CFR § 79.1 require closed captioning on broadcast TV and many OTT services. In the EU, the European Accessibility Act (Directive 2019/882) phases in similar requirements through 2030. We cover this in depth at /captions-vs-subtitles.
Are AI-generated subtitles accurate enough for broadcast?
For broadcast TV under FCC regulation: no, not without human review. FCC quality standards require near-perfect captions, and Whisper Large-v3 at ~95% accuracy on clean English audio still produces ~50 errors per 1,000 words — unacceptable for compliance. The industry standard is AI-first-pass + human-review-second-pass: services like Rev.com or 3Play Media. For non-broadcast use (YouTube, social media, internal corporate training, courses), AI is widely acceptable and is what most creators ship. For WCAG 2.1 accessibility compliance on websites, AI captions with a brief manual review of speaker labels and proper nouns typically passes audit.
What subtitle format should I use — SRT, VTT, or ASS?
SRT for almost everything — universal compatibility, supported by YouTube, Premiere, DaVinci, Final Cut, CapCut, VLC, and every social platform. Use VTT specifically for HTML5 web video players and when you need basic styling (bold, italic, positioning). Use ASS/SSA only for anime fansub, karaoke, or when you need precise styling control (font, color, position, fade, rotation) — Aegisub is the standard editor. TTML is for Netflix and broadcast TV (XML-based, complex styling). SCC/MCC for US broadcast FCC compliance. STL/EBU for European broadcast. As a creator: start with SRT, branch out only when a destination platform requires something else.
How long should each subtitle line be?
Industry standard: 32-42 characters per line, maximum 2 lines per subtitle block. Netflix specifies 42 characters per line for English; BBC specifies 37-39. The 2-line maximum is universal — 3+ lines block too much visual content. Why this matters: line length × reading speed determines whether viewers can finish reading before the subtitle disappears. If you exceed 42 chars on 2 lines (84 chars) at the typical 2-3 second display duration, viewers are forced to choose between watching the video and reading the subtitle. Most AI subtitle generators don't enforce these limits automatically — manual review or post-processing in Subtitle Edit/Aegisub is often needed for professional output.
What is the Netflix reading speed standard?
Netflix specifies 17 characters per second (CPS) for adult content, 20 CPS for children's content. BBC specifies 15-17 CPS as the accessibility floor for general audiences. The math: a 6-second shot allows roughly 100 characters of subtitle text at 17 CPS — about 2.5 standard 42-char lines. If your subtitle exceeds this, viewers can't read it in time. CPS is the single most overlooked subtitle quality metric. Aegisub and Subtitle Edit both calculate CPS per block and flag violations. Most AI tools (Submagic, VEED, CapCut) don't expose CPS metrics — they prioritize visual styling over reading speed. For accessibility-first projects (educational, healthcare, government), enforce CPS limits manually.
Do YouTube auto-captions meet WCAG 2.1 accessibility standards?
Not reliably. WCAG 2.1 Success Criterion 1.2.2 (Captions, Prerecorded) is Level A — captions must be provided. YouTube auto-captions technically satisfy “captions exist” but typically fail on accuracy (75-85%), speaker identification, sound effects, and music cues. WCAG implicitly requires captions to be accurate enough to convey meaning equivalent to the audio. For most academic, government, and corporate sites under WCAG 2.1 AA audit, YouTube auto-captions alone are insufficient — manual review or a higher-accuracy AI tool (VexaScribe Whisper Large-v3, Rev human review) is the safer choice. For personal vlogs or non-regulated content, YouTube auto-captions are commonly accepted.
What does the European Accessibility Act require for video subtitles?
The European Accessibility Act (Directive 2019/882) requires accessible captioning on video content provided by audiovisual media services, video-on-demand, and e-commerce platforms operating in the EU. Enforcement began 28 June 2025 with phased compliance through 2030. Member state fines vary: some impose up to €50,000 per violation. Affected services include Netflix-style VOD, e-learning platforms, e-commerce product videos, banking video customer service, and e-book audio narration. Practical impact: any company selling video products or services to EU customers must caption their video content to WCAG 2.1 AA-equivalent quality. AI-generated subtitles are permitted; quality must be sufficient to convey audio meaning equivalently.
Can I burn subtitles directly into the video file?
Yes — that's called hardcoded or burned-in subtitles. They become permanent pixels in the video and can't be turned off. Use burn-in when: the destination platform doesn't support separate subtitle tracks (Instagram Reels, TikTok native uploads), you want guaranteed styling consistency across all viewers, or you're producing a final deliverable for clients. Use separate SRT/VTT files when: viewers may want to toggle captions on/off (YouTube, web video), accessibility regulation requires user control over caption display (WCAG 2.1 SC 1.4.5), or you'll later need to translate to additional languages. Tools for burning: VEED, Kapwing, CapCut, Submagic (UI-based); FFmpeg (command line, free); Adobe Premiere, DaVinci Resolve, Final Cut Pro (NLE built-ins).
What's the best subtitle tool for translating videos into multiple languages?
Three tiers. For volume and language coverage: Happy Scribe (120+ languages, EU-based, professional translation review available). For free starting point: VexaScribe (99 languages with Whisper Large-v3, Tier 1 includes major European, Asian, and Arabic markets) + manual review. For full dub + subtitle workflow: Maestra (combines subtitle generation with optional voice cloning — though we recommend skipping voice cloning for ethical reasons documented in our Brazilian Portuguese guide on the Dublagem Viva movement). Honest read: translation quality from any AI tool is roughly equivalent to mid-tier machine translation — Tier 1 languages (English ↔ Spanish/French/German/Portuguese) are good; lower-resource pairs (English ↔ Swahili/Tamil) need manual review. Always have a native speaker verify final subtitles for paid distribution.
Methodology and sources
- ● Pricing and feature details verified June 2026 on each vendor's site. Plans change frequently — confirm before subscribing.
- ● Whisper Large-v3 accuracy benchmarks: OpenAI Whisper paper (Radford et al. 2022); Distil-Whisper PT-BR Common Voice Brasil 8.22% WER (~92% accuracy).
- ● Netflix Timed Text Style Guide — Partner Help Center (17 CPS adult, 20 CPS children, 42 chars/line English).
- ● BBC Subtitle Guidelines — bbc.github.io/subtitle-guidelines (15-17 CPS, 37-39 chars/line, accessibility floor).
- ● W3C WCAG 2.1 — Success Criteria 1.2.2 (Captions Prerecorded, Level A), 1.2.4 (Captions Live, Level AA), 1.4.5 (Images of Text).
- ● European Accessibility Act — Directive (EU) 2019/882. Enforcement 28 June 2025, phased compliance through 2030.
- ● FCC Closed Captioning — 47 CFR § 79.1.
- ● ABNT NBR 15290:2016 — Brazilian audiovisual accessibility standard. Aligned with LBI (Lei 13.146/2015).
- ● WebVTT specification — W3C.
- ● SRT format reference — Matroska SubRip de facto standard.
- ● SSA/ASS specification — Aegisub documentation.
- ● DCMP Captioning Key — Described and Captioned Media Program (US educational standards).
- ● Otter.ai supported languages: verified on otter.ai/languages June 2026 — English, French, Spanish only.
Related guides
SRT generator
Direct SRT/VTT output from audio or video
How to add subtitles to video
Tutorial — burning vs softcoded, tool walkthroughs
Video to SRT
SRT from video file directly
Captions vs subtitles
The legal and practical difference (FCC, WCAG)
Best podcast transcription tools
Sibling listicle for podcast-specific workflows
Best diarization tools
Speaker separation depth across 14 tools
AI transcription
How Whisper Large-v3 actually works
Bulk transcription
50-file batches for agencies and research labs
Transcribe audio to text
General-purpose audio transcription guide
Pricing
VexaScribe plans
All comparisons
Compare hub — all listicles and tool comparisons
Best transcription software 2026
Broader transcription tools listicle