12 Best Subtitle Generators in 2026: Honest Picks by Use Case

The essentials

Quick read of the 12 tools. Detailed breakdowns and use-case recommendations follow.

Tool	Best for	Pricing	Free tier	Formats	Accuracy
Submagic	TikTok / Reels / Shorts styled captions	$15-49/mo	Limited trial	MP4 burn-in, SRT	88-93%
CapCut	Free mobile/desktop, TikTok-native	Free / $7.99 Pro	Unlimited (free tier)	MP4 burn-in, SRT	85-92%
Descript	Podcast and course editing workflow	$16-50/mo	1h/mo free	SRT, VTT, TXT, MP4 burn-in	93-95%
VexaScribe	Accuracy + multilingual SRT + EU storage	$2-20/mo	30 min trial	SRT, VTT, TXT, DOCX, JSON, PDF	92-95% (Whisper Large-v3)
VEED.io	All-in-one browser video editor	$12-59/mo	10 min/mo (watermarked)	SRT, VTT, TXT, MP4 burn-in	90-94%
Kapwing	Team collaboration, caption presets	$16-50/mo	Watermarked free	SRT, VTT, MP4 burn-in	90-94%
Rev.com	Broadcast-grade human review	$0.25/min AI, $1.99/min human	Pay-per-use only	SRT, VTT, SCC, TTML, MP4 burn-in	99% (human)
Happy Scribe	EU residency + 120+ languages	$17-29/mo or PAYG	10 min trial total	SRT, VTT, STL, EBU, TTML	94-97%
Aegisub	Anime fansub, broadcast precision	Free (open source)	Free forever	ASS, SSA, SRT, others	Manual entry
Subtitle Edit	300+ format conversion, QA workflow	Free (open source)	Free forever	300+ formats supported	Tool-dependent
Otter.ai	Live meeting captions (English/FR/ES only)	$16.99-30/mo	300 min/mo free	SRT, VTT, TXT	90-93% (English only)
YouTube Studio	Baseline for YouTube uploads	Free	Unlimited	SRT, VTT (within YouTube)	75-85%

Pricing and feature details verified June 2026 on vendor sites. Plans change frequently — always confirm before subscribing.

How we evaluated

Six criteria, weighted differently per use case (one tool can't win all six):

●Accuracy. Word Error Rate on clean audio. Whisper Large-v3 baseline ~95% English, ~92% Portuguese. Tools using older models or in-house ASR typically trail by 3-8 points.
●Language support. Tier 1 vs Tier 2 vs Tier 3 coverage. Whisper-based tools win here. Otter notably excludes Portuguese.
●Formats. SRT, VTT, ASS/SSA, TTML, SCC, MCC, STL, EBU. The format you need depends on destination — most creators only need SRT; broadcast needs TTML/SCC; European broadcast needs STL/EBU.
●Compliance posture. EAA (EU), WCAG 2.1 (universal), FCC 47 CFR § 79.1 (US), ABNT NBR 15290 (Brazil). Tools vary from full broadcast compliance (Rev) to no signaling (CapCut).
●Pricing fit. Per-minute vs subscription vs free. Effective cost at typical usage. Hidden caps and watermarks.
●Use-case fit. A YouTube long-form creator and a TikTok short-form creator need different tools — neither is “better,” they're shaped for different work.

Honesty statement

We make VexaScribe. We've ranked tools on the criteria above and placed ourselves at position 4 — strong for accuracy + multilingual + affordability, but not the right pick for short-form social styling, broadcast compliance, or animated captions. We sell more reliably long-term by being a credible reference than by writing a self-promotion piece that ranks for a week.

What makes a subtitle tool different from transcription

Transcription gives you a text wall. Subtitles are time-boxed text that has to be readable on screen. The difference is craft, and most AI tools don't enforce the craft rules. Here are the standards that separate professional subtitles from raw transcription:

Timing precision

Frame-accurate (Aegisub, Subtitle Edit) vs word-level (Whisper-based tools, VexaScribe) vs sentence-level (older ASR, YouTube auto-captions). Frame-accurate matters for broadcast and dubbed media; word-level is sufficient for web and social; sentence-level loses sync within 10 seconds of dialogue.

Line length standards

32-42 characters per line, maximum 2 lines per subtitle block. Netflix specifies 42 characters per line for English; BBC specifies 37-39. The 2-line maximum is universal — 3+ lines block too much of the frame. Why this matters: line length × reading speed determines whether viewers can finish reading before the subtitle disappears. AI generators routinely produce 60+ character lines that violate this rule.

Reading speed (CPS / characters per second)

Netflix: 17 CPS adult, 20 CPS children. BBC: 15-17 CPS general audience. A 6-second shot allows roughly 100 characters at 17 CPS — about 2.5 standard 42-char lines. Exceed this and viewers can't read in time. CPS is the single most overlooked subtitle quality metric. Aegisub and Subtitle Edit calculate CPS per block and flag violations; most AI tools (Submagic, VEED, CapCut) don't expose CPS metrics.

Styling

Font, color, position, background opacity, fade in/out. SRT supports none of this. VTT supports basic styling. ASS/SSA supports full control (Aegisub workflow). Burned-in MP4 lets you embed any styling — but viewers can't turn it off, which fails WCAG 2.1 SC 1.4.5 (User Control).

Burn-in vs separate file (hardcoded vs softcoded)

Burn-in (hardcoded): permanent pixels in the video, can't be turned off. Use for: Instagram Reels, TikTok native uploads, client-delivered final cuts. Separate file (softcoded SRT/VTT): viewers can toggle, retranslate, restyle. Use for: YouTube, web video, accessibility compliance, multi-language distribution. Most professional workflows ship both — softcoded SRT for accessibility + burned-in version for platforms that strip captions.

Multi-language tracks

Separate SRT file per language. Standard naming: video.en.srt, video.es.srt, video.pt.srt. YouTube and Vimeo auto-detect this pattern. Happy Scribe (120+ languages) and Maestra are the strongest options for multi-language export with translation review; VexaScribe and Descript handle the export workflow for in-house translation. For complex multi-language broadcast (Netflix-style), TTML with embedded language tags is the standard.

Translation workflow

AI translation + human review is the 2026 industry standard for paid distribution. Pure AI translation is acceptable for personal vlogs, internal content, draft pitches. Always have a native speaker verify final subtitles before public release in languages with cultural nuance (anything past Tier 1 European languages).

The 12 best subtitle generation tools in 2026

Roughly ordered by mainstream creator fit in 2026: short-form video specialists first, all-in-one editors and transcription-first tools next, then verified-human and open-source desktop, with live captioning and YouTube's native baseline last. Not a strict ranking — the use-case matrix below is the better guide for who picks what.

Submagic

AI-styled animated captions purpose-built for short-form vertical video.

Best for

TikTok, Instagram Reels, YouTube Shorts creators who need viral-style animated text overlays. Marketing teams producing high-volume social-first content.

Standout

Pre-built caption templates that emulate top-performing creator styles (MrBeast, Alex Hormozi). Word-by-word animation, emoji insertion, B-roll suggestions. Built for engagement metrics, not transcription accuracy.

Pricing

$15-49/month depending on minutes and team seats.

Formats

MP4 with burned-in subtitles primarily. SRT export available.

Free tier

Limited trial — typically a few exports before requiring subscription.

Honest limitations

Not designed for long-form content (lectures, podcasts, interviews). Accuracy is good but not best-in-class — focus is styling, not WER. No multi-language workflow comparable to Happy Scribe or VexaScribe.

CapCut

Free mobile and desktop editor with auto-captions and TikTok-native styling.

Best for

Mobile-first creators on a budget. TikTok creators who want platform-native styling. Anyone testing video creation before committing to paid tools.

Standout

Genuinely free with no watermark and no time limits — the best free option for finished short-form video. Strong AI features (auto-caption, auto-cut, voice changer) typically only in paid tools.

Pricing

Free / $7.99 Pro for advanced AI features.

Formats

MP4 with burn-in primarily. SRT export.

Free tier

Unlimited free use of core editor + auto-captions.

Honest limitations

Owned by ByteDance — data privacy concerns relevant for corporate, government, healthcare, and EU users. Styling presets heavily TikTok-shaped (less polish for LinkedIn/corporate). No GDPR/LGPD signaling.

Descript

Edit video by editing transcript — delete a sentence in text, the video updates.

Best for

Podcasters and course creators who already think of their content as “text with audio attached.” Teams doing audiogram production for social.

Standout

Text-based editing paradigm is genuinely productivity-changing for talking-head content. AI features (Studio Sound, Overdub voice cloning, filler-word removal) are mature. Subtitle export is clean.

Pricing

$16-50/month. 1 hour/month free tier.

Formats

SRT, VTT, TXT, MP4 with burn-in.

Free tier

1 hour/month transcription on free tier.

Honest limitations

Learning curve — the transcript-edits-video paradigm takes practice. Voice cloning (Overdub) raises ethical concerns for some use cases. Subscription model adds up for occasional users.

VexaScribe

Whisper Large-v3 accuracy across 99 languages with EU storage and clean SRT/VTT export.

Best for

Creators and teams who prioritize transcription accuracy and multilingual workflows. Brazilian, European, and LATAM markets needing affordable BRL/EUR pricing. Anyone who wants a clean SRT/VTT file rather than burned-in styled captions.

Standout

Whisper Large-v3 accuracy at $2/month entry pricing — among the cheaper subscription options in this list. 99 languages with strong PT-BR Tier 1 support (rare among English-first competitors). URL paste — generate an SRT straight from a YouTube/TikTok/Instagram link or Google Drive share, no download-reupload cycle. Audio and transcripts stored in AWS eu-west-2 (London).

Pricing

$2-20/month. 30-minute free trial on signup, no credit card.

Formats

SRT, VTT, TXT, DOCX, JSON, PDF. Single-file or bulk batch (up to 50 files).

Free tier

30-minute trial (single-use).

Honest limitations

No burn-in workflow — exports SRT/VTT for use in your video editor. No animated text styling or templates (use Submagic or CapCut for short-form social). No real-time live captioning (use Otter.ai for live meetings). Transcription inference runs via Replicate (US-based) — storage is EU but compute touches US infrastructure. For strict EU-only processing, Happy Scribe is the closer fit.

VEED.io

Browser-based all-in-one video editor with auto-subtitles, burn-in, and translation.

Best for

Solo creators and small teams who want one tool for the full editing workflow — trim, cut, caption, translate, export. No software install.

Standout

Subtitle styling templates, multi-language translation in 100+ languages, brand kit for consistent caption styling across a creator's library. Strong free tier for testing.

Pricing

$12-59/month. Free tier with 10 min/month and watermark.

Formats

SRT, VTT, TXT, MP4 with burn-in.

Free tier

10 minutes/month with VEED watermark on output.

Honest limitations

Subscription gets pricey at scale ($59/mo Pro vs comparable Descript at $50/mo). Browser editor performance limits long-form (1h+) projects compared to desktop tools.

Kapwing

Browser-based collaborative editor with caption styling presets and team workflows.

Best for

Marketing and social teams needing collaboration features. Educators producing classroom content.

Standout

Real-time collaboration is genuinely useful for team-produced content. Caption styling presets and brand kits. Strong template library for repeatable formats.

Pricing

$16-50/month. Free tier with watermark.

Formats

SRT, VTT, MP4 with burn-in.

Free tier

Watermarked free output.

Honest limitations

Performance ceiling on long-form projects (browser-based). Watermark on free tier removes much of the free-tier value. AI features less aggressive than VEED or Descript.

Rev.com

Industry-standard human + AI caption service with broadcast-grade accuracy guarantee.

Best for

Broadcast TV under FCC regulation. Legal proceedings. Medical content. WCAG 2.1 AAA compliance projects. Any context where ~99% accuracy and human verification are required.

Standout

Human-verified captions at 99% accuracy with FCC-compliant SCC and broadcast format support. The default choice for regulated industries. Mature SLA + dedicated account management at enterprise tier.

Pricing

$0.25/min AI, $1.99/min human. No subscription required (PAYG).

Formats

SRT, VTT, SCC, MCC, TTML, MP4 burn-in.

Free tier

Pay-per-use only.

Honest limitations

Per-minute pricing makes it expensive at scale. $1.99/min human × 1h = $119 — fine for one deposition, brutal for a podcast archive. No flat-rate option. US-hosted (Cloud Act considerations for non-US clients).

Happy Scribe

EU-based subtitle service with 120+ languages and broadcast format support.

Best for

European media operations needing EU residency. Documentary and broadcast workflows. Multilingual content with professional translation review.

Standout

STL and EBU format support (European broadcast standards) that few competitors offer. Optional human review tier for compliance projects. Barcelona hosting — EU-GDPR native.

Pricing

$17-29/month subscription or pay-per-minute. Add-on human review extra.

Formats

SRT, VTT, STL, EBU, TTML.

Free tier

10-minute trial — single use, not recurring.

Honest limitations

Trial is small (10 min) — hard to evaluate properly. Pay-per-minute at scale matches Rev's pricing concern. Subscription cap on minutes limits high-volume use.

Aegisub

Free open-source desktop editor — the broadcast and anime fansub standard.

Best for

Anime fansub communities. Broadcast TV professionals. Any project requiring frame-accurate timing and full styling control (font, color, position, fade, rotation).

Standout

Full ASS/SSA styling control with frame-accurate karaoke timing. Audio waveform editor. Free forever, no subscription, no limits, no watermark. The de facto standard for professional subtitle craft.

Pricing

Free (open source under BSD license).

Formats

ASS, SSA, SRT, MicroDVD, SubViewer, and more.

Free tier

Free forever.

Honest limitations

No AI auto-caption — designed for manual entry with audio cue reference. Steep learning curve. UI is dated (last major release 2022). No translation features. Not the right tool for casual creators — built for craftsmen.

Subtitle Edit

Free open-source desktop editor supporting 300+ subtitle formats with waveform editing.

Best for

Format conversion (SRT to TTML for Netflix, SRT to SCC for broadcast, etc). QA workflow for AI-generated subtitles. Any project needing format flexibility.

Standout

Reads and writes 300+ subtitle formats — by far the broadest format library in the industry. Spell check, OCR for image-based subtitles (DVD, Blu-ray), waveform visualization, batch convert.

Pricing

Free (open source under GPL).

Formats

300+ formats supported.

Free tier

Free forever.

Honest limitations

Windows-first (runs on macOS via Mono with rough edges). UI is functional but dated. No AI auto-caption — built around manual or imported subtitle files for editing/conversion.

Otter.ai

Live meeting captions with speaker labels — built for Zoom, Teams, Google Meet.

Best for

Live meeting transcription and captioning (Zoom, Teams, Meet). Real-time accessibility for live events.

Standout

Live caption quality is consistently among the best for English meetings. Voice profiles for recurring meeting participants. Integration with major meeting platforms.

Pricing

$16.99-30/month. 300 min/month free tier.

Formats

SRT, VTT, TXT.

Free tier

300 min/month — but ENGLISH ONLY (also French and Spanish on paid tiers).

Honest limitations

Does NOT support Portuguese in 2026 — English, French, and Spanish only. For Brazilian, Portuguese, or LATAM use, Otter is functionally useless. US-hosted (Cloud Act). Primarily transcription, not subtitle styling — exports basic SRT without timing tuning for visual presentation.

YouTube Studio

Free native auto-captions for YouTube uploads — the universal baseline.

Best for

YouTube creators who want zero-friction captions as a starting point. Any video destined for YouTube only.

Standout

Free, automatic, zero setup. Integrated directly into the YouTube upload workflow. Supports community-contributed translations.

Pricing

Free with YouTube account.

Formats

SRT, VTT export (within YouTube ecosystem).

Free tier

Unlimited free use.

Honest limitations

Quality 75-85% on English, lower on other languages. No speaker identification. No sound effect cues. No closed-caption metadata (insufficient for WCAG 2.1 AAA, FCC, or EAA compliance). Export limited to YouTube ecosystem — can't easily push to other platforms.

Honorable mentions

Tools we considered but didn't rank in the main 12, with the reason:

●Maestra (multilingual subtitle + dubbing — voice cloning ethically controversial)
●Riverside (podcast studio with caption export)
●Opus Clip (AI shorts with auto-generated styled captions)
●Adobe Premiere Pro (NLE built-in auto-captions for pro editors)
●DaVinci Resolve (free NLE with strong auto-caption in Studio version)
●Final Cut Pro (Apple NLE auto-captions)
●AssemblyAI (developer API for building subtitle features into apps)
●Deepgram (developer API alternative to AssemblyAI)

By use case: who picks what

Use case shapes the right tool more than abstract rankings. Match yours to the row below.

Persona	Primary	Alternative	Why
YouTube long-form creator	Descript	VexaScribe	Transcript-based editing for talking-head content; VexaScribe for SRT export without the editor overhead
TikTok / Reels / Shorts creator	Submagic	CapCut	Animated styled captions optimized for short-form engagement; CapCut free if budget-constrained
Podcaster making audiograms	Descript	Riverside or Opus Clip	Descript's transcript-editing paradigm is built for podcast workflows; Riverside if you record remote interviews; Opus Clip for AI-generated shorts with captions
Corporate L&D / training	Rev (human)	Happy Scribe	Human-verified accuracy for WCAG/EAA compliance audits in regulated industries
Film / TV professional	Aegisub + Subtitle Edit	Premiere Pro	Frame-accurate ASS styling, broadcast format conversion (TTML, SCC, STL, EBU)
University educator	Otter.ai (live)	VexaScribe	Otter for live lecture captions; VexaScribe for post-class SRT export in multiple languages
Live streamer	Otter.ai (live)	YouTube live auto-captions	Real-time captioning during stream
Documentary / foreign-language	Happy Scribe	VexaScribe	Happy Scribe's 120+ languages + EU residency + optional human translation review; VexaScribe is the cheaper alternative for in-house translation workflows
Accessibility-first (WCAG/EAA)	Rev (human verified)	Happy Scribe + manual review	Human verification is the safest path for AA/AAA compliance audit
Brazil / LATAM creator (PT-BR)	VexaScribe	CapCut (free)	Whisper Tier 1 PT-BR accuracy at BRL-friendly pricing — Otter is non-viable here (English/FR/ES only); CapCut if free is the priority

Subtitle compliance in 2026: EAA, WCAG, FCC, ABNT

The regulatory environment shifted significantly in 2025-2026. If you produce video content for any regulated market, this section matters.

European Accessibility Act (Directive 2019/882)

Enforcement began 28 June 2025 with phased compliance through 2030. Fines vary by Member State — some up to €50,000 per violation. Affects: e-commerce, banking video customer service, video-on-demand (Netflix-style), e-books, e-readers, e-learning platforms. Practical impact: any company selling video products or services to EU customers must caption their video content to WCAG 2.1 AA-equivalent quality. AI-generated subtitles are permitted; quality must be sufficient to convey audio meaning equivalently.

WCAG 2.1 (Universal)

Success Criterion 1.2.2 (Captions, Prerecorded) — Level A: captions must be provided for all prerecorded audio in synchronized media. SC 1.2.4 (Captions, Live) — Level AA: captions for live audio. SC 1.4.5 (Images of Text) — implicates burn-in subtitles which can't be turned off. Tools that help: Rev (human verified), VexaScribe (high AI accuracy + manual review workflow), Subtitle Edit (QA verification), Aegisub (broadcast precision).

FCC 47 CFR § 79.1 (US)

Applies to broadcast TV, cable, satellite, and OTT services in the US. Requires near-perfect closed captions including speaker identification and sound effects. AI captions alone do NOT satisfy FCC quality standards — human verification is the de facto requirement. Format: SCC for traditional broadcast, MCC for enhanced content. Rev.com is the industry default.

ABNT NBR 15290:2016 (Brazil)

Brazilian standard for accessibility in audiovisual communication. Defines closed-caption requirements for TV broadcast and increasingly enforced for video-on-demand and government communication. Aligned with LBI (Lei 13.146/2015 — Brazilian Inclusion Law). Practical compliance: SRT with proper speaker identification, ABNT-compliant line length (32-42 chars typical), proper reading speed (~17 CPS). We cover Brazilian compliance in depth in our PT-BR subtitle guide.

Format cheat-sheet

Which format you need depends on destination platform and use case. The vast majority of creators only need SRT.

Format	What it is	When to use	Limitations
SRT	Universal text format with timestamps. No styling.	Default choice. YouTube, social, NLEs, most use cases.	No bold/italic, no positioning, no color.
VTT	Web standard, supports basic styling (bold, italic, positioning).	HTML5 video players, web platforms.	Less universal than SRT outside web.
ASS / SSA	Advanced SubStation Alpha — full styling control.	Anime fansub, karaoke, broadcast precision, Aegisub workflow.	Not supported by simple players or most social platforms.
TTML / DFXP	XML-based, Netflix and broadcast standard. Complex styling.	Netflix delivery, professional broadcast.	Requires specialized tools to produce/edit correctly.
SCC	Scenarist Closed Captions — FCC-compliant US broadcast.	US TV broadcast under FCC 47 CFR § 79.1.	US-specific. Not for general web/video.
MCC	MacCaption Closed Captions — enhanced SCC variant.	US broadcast TV with extended character set.	Even more US-specific than SCC.
STL / EBU	European Broadcasting Union subtitle standards.	European broadcast TV delivery.	Region-specific to European broadcast.
MP4 burn-in	Subtitles burned permanently into video pixels.	Social platforms without subtitle support (Reels, TikTok native). Final deliverables.	Can't be turned off; can't be retranslated; doesn't satisfy WCAG 1.4.5 (user control).

Need to convert between formats? Subtitle Edit reads and writes 300+ formats — it's the free utility every subtitle workflow eventually needs. Need to generate SRT from audio quickly? Try our SRT generator.

When we'd send you elsewhere

We make VexaScribe — we placed ourselves at position 4 in this list, which we think is honest. Here's where we'd explicitly send you to a different tool:

●Use Submagic if you produce TikTok, Reels, or YouTube Shorts and need viral-style animated word-by-word captions. We don't do animated text styling.
●Use CapCut if you're on a tight budget, work primarily on mobile, and produce short-form content for TikTok or Instagram. CapCut is genuinely free with no watermark.
●Use Descript if you're a podcaster or course creator who wants the transcript-edits-video paradigm. We don't have an integrated video editor.
●Use Rev.com (human) if you're producing broadcast TV under FCC regulation, legal evidence transcripts, or anything requiring 99% verified accuracy. AI tools including us are not appropriate for FCC-compliant captions.
●Use Happy Scribe if you need strict EU-only processing (we currently use Replicate for Whisper inference, which means audio touches US infrastructure during transcription even though storage is in London) or if you need broadcast formats like STL and EBU.
●Use Aegisub + Subtitle Edit if you work in anime fansub, professional broadcast, or any project requiring frame-accurate styling with full ASS/SSA control. We export clean SRT/VTT — not designed for fansub craft.
●Use Otter.ai if you need real-time live captions during English-language Zoom or Teams meetings. We're file-upload based, not live.
●Use AssemblyAI or Deepgram if you're a developer building subtitle features into your own product via API. They have mature batch APIs; we're UI-first.

If your use case isn't above — long-form YouTube without the editor overhead, educators producing multilingual lecture captions, or PT-BR work where Otter doesn't reach — we're a reasonable fit. Try the 30-minute free trial.

Frequently asked questions

What is the best free subtitle generator in 2026?

Depends on the source. For YouTube creators: YouTube Studio's built-in auto-captions are free and adequate as a baseline (~75-85% accuracy). For TikTok/Reels: CapCut is free with no watermark and styled caption templates. For desktop power users: Subtitle Edit (open-source, Windows-first) and Aegisub (open-source, cross-platform) are completely free with no upload limits. For accuracy + multilingual + privacy: VexaScribe's 30-minute free trial uses Whisper Large-v3 — the same model OpenAI charges $0.006/min for via API. Honest caveat: free tiers from VEED, Submagic, and Happy Scribe are usually capped at 10-30 minutes total trial — not a free tier you can keep using monthly.

What's the difference between subtitles and closed captions?

Subtitles assume the viewer can hear the audio and just need translation or text reinforcement — they typically transcribe dialogue only. Closed captions (CC) assume the viewer cannot hear and include speaker identification ([JOHN]), sound effects ([door slams]), and music cues ([upbeat music]). In the US, FCC rules 47 CFR § 79.1 require closed captioning on broadcast TV and many OTT services. In the EU, the European Accessibility Act (Directive 2019/882) phases in similar requirements through 2030. We cover this in depth at /captions-vs-subtitles.

Are AI-generated subtitles accurate enough for broadcast?

For broadcast TV under FCC regulation: no, not without human review. FCC quality standards require near-perfect captions, and Whisper Large-v3 at ~95% accuracy on clean English audio still produces ~50 errors per 1,000 words — unacceptable for compliance. The industry standard is AI-first-pass + human-review-second-pass: services like Rev.com or 3Play Media. For non-broadcast use (YouTube, social media, internal corporate training, courses), AI is widely acceptable and is what most creators ship. For WCAG 2.1 accessibility compliance on websites, AI captions with a brief manual review of speaker labels and proper nouns typically passes audit.

What subtitle format should I use — SRT, VTT, or ASS?

SRT for almost everything — universal compatibility, supported by YouTube, Premiere, DaVinci, Final Cut, CapCut, VLC, and every social platform. Use VTT specifically for HTML5 web video players and when you need basic styling (bold, italic, positioning). Use ASS/SSA only for anime fansub, karaoke, or when you need precise styling control (font, color, position, fade, rotation) — Aegisub is the standard editor. TTML is for Netflix and broadcast TV (XML-based, complex styling). SCC/MCC for US broadcast FCC compliance. STL/EBU for European broadcast. As a creator: start with SRT, branch out only when a destination platform requires something else.

How long should each subtitle line be?

Industry standard: 32-42 characters per line, maximum 2 lines per subtitle block. Netflix specifies 42 characters per line for English; BBC specifies 37-39. The 2-line maximum is universal — 3+ lines block too much visual content. Why this matters: line length × reading speed determines whether viewers can finish reading before the subtitle disappears. If you exceed 42 chars on 2 lines (84 chars) at the typical 2-3 second display duration, viewers are forced to choose between watching the video and reading the subtitle. Most AI subtitle generators don't enforce these limits automatically — manual review or post-processing in Subtitle Edit/Aegisub is often needed for professional output.

What is the Netflix reading speed standard?

Netflix specifies 17 characters per second (CPS) for adult content, 20 CPS for children's content. BBC specifies 15-17 CPS as the accessibility floor for general audiences. The math: a 6-second shot allows roughly 100 characters of subtitle text at 17 CPS — about 2.5 standard 42-char lines. If your subtitle exceeds this, viewers can't read it in time. CPS is the single most overlooked subtitle quality metric. Aegisub and Subtitle Edit both calculate CPS per block and flag violations. Most AI tools (Submagic, VEED, CapCut) don't expose CPS metrics — they prioritize visual styling over reading speed. For accessibility-first projects (educational, healthcare, government), enforce CPS limits manually.

Do YouTube auto-captions meet WCAG 2.1 accessibility standards?

Not reliably. WCAG 2.1 Success Criterion 1.2.2 (Captions, Prerecorded) is Level A — captions must be provided. YouTube auto-captions technically satisfy “captions exist” but typically fail on accuracy (75-85%), speaker identification, sound effects, and music cues. WCAG implicitly requires captions to be accurate enough to convey meaning equivalent to the audio. For most academic, government, and corporate sites under WCAG 2.1 AA audit, YouTube auto-captions alone are insufficient — manual review or a higher-accuracy AI tool (VexaScribe Whisper Large-v3, Rev human review) is the safer choice. For personal vlogs or non-regulated content, YouTube auto-captions are commonly accepted.

What does the European Accessibility Act require for video subtitles?

The European Accessibility Act (Directive 2019/882) requires accessible captioning on video content provided by audiovisual media services, video-on-demand, and e-commerce platforms operating in the EU. Enforcement began 28 June 2025 with phased compliance through 2030. Member state fines vary: some impose up to €50,000 per violation. Affected services include Netflix-style VOD, e-learning platforms, e-commerce product videos, banking video customer service, and e-book audio narration. Practical impact: any company selling video products or services to EU customers must caption their video content to WCAG 2.1 AA-equivalent quality. AI-generated subtitles are permitted; quality must be sufficient to convey audio meaning equivalently.

Can I burn subtitles directly into the video file?

Yes — that's called hardcoded or burned-in subtitles. They become permanent pixels in the video and can't be turned off. Use burn-in when: the destination platform doesn't support separate subtitle tracks (Instagram Reels, TikTok native uploads), you want guaranteed styling consistency across all viewers, or you're producing a final deliverable for clients. Use separate SRT/VTT files when: viewers may want to toggle captions on/off (YouTube, web video), accessibility regulation requires user control over caption display (WCAG 2.1 SC 1.4.5), or you'll later need to translate to additional languages. Tools for burning: VEED, Kapwing, CapCut, Submagic (UI-based); FFmpeg (command line, free); Adobe Premiere, DaVinci Resolve, Final Cut Pro (NLE built-ins).

What's the best subtitle tool for translating videos into multiple languages?

Three tiers. For volume and language coverage: Happy Scribe (120+ languages, EU-based, professional translation review available). For free starting point: VexaScribe (99 languages with Whisper Large-v3, Tier 1 includes major European, Asian, and Arabic markets) + manual review. For full dub + subtitle workflow: Maestra (combines subtitle generation with optional voice cloning — though we recommend skipping voice cloning for ethical reasons documented in our Brazilian Portuguese guide on the Dublagem Viva movement). Honest read: translation quality from any AI tool is roughly equivalent to mid-tier machine translation — Tier 1 languages (English ↔ Spanish/French/German/Portuguese) are good; lower-resource pairs (English ↔ Swahili/Tamil) need manual review. Always have a native speaker verify final subtitles for paid distribution.

Methodology and sources

● Pricing and feature details verified June 2026 on each vendor's site. Plans change frequently — confirm before subscribing.
● Whisper Large-v3 accuracy benchmarks: OpenAI Whisper paper (Radford et al. 2022); Distil-Whisper PT-BR Common Voice Brasil 8.22% WER (~92% accuracy).
● Netflix Timed Text Style Guide — Partner Help Center (17 CPS adult, 20 CPS children, 42 chars/line English).
● BBC Subtitle Guidelines — bbc.github.io/subtitle-guidelines (15-17 CPS, 37-39 chars/line, accessibility floor).
● W3C WCAG 2.1 — Success Criteria 1.2.2 (Captions Prerecorded, Level A), 1.2.4 (Captions Live, Level AA), 1.4.5 (Images of Text).
● European Accessibility Act — Directive (EU) 2019/882. Enforcement 28 June 2025, phased compliance through 2030.
● FCC Closed Captioning — 47 CFR § 79.1.
● ABNT NBR 15290:2016 — Brazilian audiovisual accessibility standard. Aligned with LBI (Lei 13.146/2015).
● WebVTT specification — W3C.
● SRT format reference — Matroska SubRip de facto standard.
● SSA/ASS specification — Aegisub documentation.
● DCMP Captioning Key — Described and Captioned Media Program (US educational standards).
● Otter.ai supported languages: verified on otter.ai/languages June 2026 — English, French, Spanish only.

The essentials

How we evaluated

Honesty statement

What makes a subtitle tool different from transcription

Timing precision

Line length standards

Reading speed (CPS / characters per second)

Styling

Burn-in vs separate file (hardcoded vs softcoded)

Multi-language tracks

Translation workflow

The 12 best subtitle generation tools in 2026

Submagic

CapCut

Descript

VexaScribe

VEED.io

Kapwing

Rev.com

Happy Scribe

Aegisub

Subtitle Edit

Otter.ai

YouTube Studio

Honorable mentions

By use case: who picks what

Subtitle compliance in 2026: EAA, WCAG, FCC, ABNT

European Accessibility Act (Directive 2019/882)

WCAG 2.1 (Universal)

FCC 47 CFR § 79.1 (US)

ABNT NBR 15290:2016 (Brazil)

Format cheat-sheet

When we'd send you elsewhere

Frequently asked questions

Methodology and sources

Related guides

SRT generator

YouTube transcript downloader

TikTok transcript generator

Instagram transcript generator

How to add subtitles to video

Video to SRT

Captions vs subtitles

Best podcast transcription tools

Best diarization tools

AI transcription

Bulk transcription

Transcribe audio to text

Pricing

All comparisons

Best transcription software 2026