Key takeaways
- →Zoom's native transcription is paid-plan + cloud-recording only. Free plan users and local recordings get nothing — the setting simply doesn't exist for them.
- →Any recording can be transcribed externally. Upload the audio_only.m4a from Documents/Zoom (Windows) or ~/Documents/Zoom (Mac) — works on every plan.
- →Upload the M4A, not the MP4. Same transcript, ~10× smaller file, much faster upload.
- →Recording already in Google Drive? Paste the share link directly — the download-reupload cycle is skipped, even for large files.
- →Accuracy gap is real. Independent testing puts Zoom native at 48-85%; Whisper Large-v3-class tools reach 91-95% on the same compressed meeting audio.
- →Want future meetings auto-transcribed? VexaScribe's Meeting Bot joins Zoom/Meet/Teams at 3× minute rate — or use a dedicated note-taker (Otter, Fathom) if live capture is your primary daily workflow.
Method 1 — Zoom's built-in transcription (paid plans only)
Zoom can transcribe cloud recordings automatically, but the requirements filter out most users. All four must be true:
Paid plan
Business, Education, or Enterprise license. Free and Pro plans don't include audio transcription.
Cloud recording
Transcripts are only generated for cloud recordings. Local recordings saved to your computer never produce a Zoom transcript — on any plan.
Setting enabled
Settings → Recording → Cloud recording → check "Audio transcript". On managed accounts, an admin may need to enable it account-wide first.
Patience
Zoom processes the transcript after the recording finishes. Processing often takes longer than the meeting itself, and at peak times can take up to 24 hours.
To enable it (per Zoom's official documentation): sign in at zoom.us → Settings → Recording tab → enable Cloud recording → check Audio transcript → Save. Record your next meeting to the cloud, and the transcript appears alongside the recording in the Recordings tab once processing finishes — as a .vtt file you can view, lightly edit, or download.
Honest limits: processing can lag up to 24 hours at peak; accuracy on independent testing ranges 48-85% — usable for search, rough for sharing; export is VTT-only (no DOCX minutes, no plain text without conversion); and it never applies retroactively to recordings made before you enabled the setting.
Method 2 — Upload the recording to an AI tool (works on any plan)
This is the method that works for everyone: free-plan users, local recordings, old recordings from before you had transcription enabled, and paid users whose native transcript quality disappointed.
Step 1 — Find your Zoom recording file
- Local recordings (Windows):
C:\Users\<you>\Documents\Zoom— each meeting has a dated folder. - Local recordings (Mac):
~/Documents/Zoom - Can't find it? Zoom desktop app → Meetings → Recorded — each entry links to its folder.
- Cloud recordings: zoom.us → sign in → Recordings → open the meeting → download the audio file. Cloud storage has caps and expiry on many plans — download anything you want to keep.
Step 2 — Pick the M4A, not the MP4
Every Zoom recording folder contains an audio_only.m4a alongside the video MP4. The transcript comes entirely from the audio track — the video pixels contribute nothing. A 1-hour meeting is typically 30-60 MB as M4A vs 400 MB-1.5 GB as MP4, so the M4A uploads in a fraction of the time with an identical result. (Exception: if you want an SRT subtitle file to pair with the video version, uploading the MP4 works too — the SRT timing syncs to the video either way.)
Step 3 — Upload, diarization on, download
Upload to VexaScribe (files up to 5 GB — covers any Zoom recording), toggle speaker diarization on so each participant gets a label you can rename once, and wait 5-15 minutes for a 1-hour meeting. Export as TXT, DOCX (shareable minutes with speakers + timestamps), SRT/VTT, or JSON. The first 30 minutes are free with no card. Need meeting minutes rather than a verbatim transcript? The Meeting summary template generates decisions and action items from the same upload.
Shortcut: recording already in Google Drive?
Many teams auto-archive Zoom cloud recordings to Google Drive. Instead of downloading a 1 GB MP4 and re-uploading it, paste the Drive "anyone with the link" share URL straight into VexaScribe — the file is fetched server-side, including large files where Drive shows the can't-scan-for-viruses interstitial (that >25MB confirmation-token flow is handled automatically). Also works with direct HTTPS links to audio/video files (S3, Dropbox shares). Private, non-shared Drive files aren't supported — flip sharing on first, or download-then-upload.
Method 3 — Send a meeting bot (for future meetings)
If what you want is every future meeting transcribed automatically — a bot joins the call, records, and the transcript with a structured summary appears in your dashboard — that's a different workflow from uploading recordings, and it's available too. VexaScribe's Meeting Bot joins Zoom, Google Meet, or Teams meetings, records and transcribes them, and generates summaries with action items and decisions. Honest pricing note: the bot consumes 3× your normal minute rate because it holds a live connection for the whole meeting — a 30-minute meeting uses 90 minutes from your plan. On the Pro plan ($10/mo, 2,500 min) that's roughly 13 hours of bot-recorded meetings per month. See features and pricing for details.
When a dedicated meeting-bot product fits better: if live meeting capture is your primary daily workflow — every calendar meeting, team-wide rollout, CRM integrations — purpose-built note-takers go deeper: Otter.ai (mature Zoom bot, $8.33/mo annual), Fathom (generous free tier for individuals), Fellow (team meeting-notes workflows). See our meeting-bot comparison and AI note taker guide for the honest breakdown.
The trade-offs of any bot approach: a visible participant joins your calls (some clients and interviewees find this intrusive), it only helps future meetings (your existing recording archive stays untranscribed), and consent rules in two-party-consent jurisdictions apply. Batch upload — Method 2 — remains the right tool for recordings that already exist, and it costs 1× minutes instead of 3×.
Zoom native vs AI-tool accuracy
| System | Typical accuracy | Notes |
|---|---|---|
| Zoom built-in transcription | ~48-85% (independent testing) | Varies widely with audio conditions. VTT output only. No editing UI beyond basic correction. |
| Whisper Large-v3-class AI (VexaScribe) | ~91-95% on Zoom-quality compressed audio | Speaker diarization included, 5 export formats, in-browser editor. Proper nouns still need a quick proofread. |
| Human transcription (GoTranscript, Rev) | 99%+ | $0.90-$1.99/min and 12-36h turnaround — justified for legal/compliance, overkill for routine meeting notes. |
All three degrade on heavy crosstalk, strong accents, and bad microphones — Zoom audio is compressed, which is why even Whisper-class accuracy sits at 91-95% here rather than the 95-97% it reaches on studio audio. For the deeper accuracy discussion, see how accurate is Whisper.
Common problems and fixes
Transcript stuck at "Processing" for hours
Normal at peak times — Zoom's transcript queue can run up to 24 hours behind. If you need the transcript sooner, download the audio file from the Recordings page and upload it to an AI tool; you'll have a transcript in 5-15 minutes.
No "Audio transcript" option in settings
Your plan or admin settings don't allow it. Audio transcription requires Business/Education/Enterprise, and on managed accounts the admin must enable it at the account level before it appears in your personal settings.
Local recording has no transcript
By design — Zoom never transcribes local recordings. Grab the audio_only.m4a from your Documents/Zoom meeting folder and upload it to a transcription tool instead.
Zoom transcript export is VTT-only
Zoom exports the native transcript as a .vtt file. If you need DOCX meeting minutes or plain text, either convert the VTT manually or re-transcribe the audio in a tool that exports TXT/DOCX/SRT/JSON directly.
Speaker labels are wrong for a shared conference-room laptop
Diarization separates voices by acoustic signature — two people on one microphone in the same room are genuinely hard to split for every tool. Where possible, have participants join from their own devices, or enable Zoom's separate-audio-file-per-participant option before recording.
Frequently asked questions
Does Zoom transcribe meetings for free?
No. Zoom's built-in audio transcription requires a paid plan (Business, Education, or Enterprise), cloud recording enabled, and the audio transcript setting turned on by your admin. Free and Pro plan users don't get native transcripts. It also only works on cloud recordings — a local recording saved to your computer never generates a Zoom transcript, regardless of plan. If you're on the free plan or you have a local recording file, the practical path is uploading the recording to an AI transcription tool: VexaScribe transcribes the first 30 minutes free with no credit card, with speaker labels included.
How do I get a transcript from an already-recorded Zoom meeting?
Find the recording file and upload it to a transcription tool. Local recordings live in Documents\Zoom on Windows or ~/Documents/Zoom on Mac — each meeting folder contains an audio-only M4A and a video MP4. Upload the M4A (it's roughly 10× smaller than the MP4 and produces an identical transcript). Cloud recordings: sign in at zoom.us → Recordings → download the audio file. If the recording is already in Google Drive (common for teams that archive Zoom exports), paste the public share link into VexaScribe directly — no download-reupload cycle. Processing takes 5-15 minutes for a 1-hour meeting, with speakers labeled automatically.
Where does Zoom save my recordings?
Local recordings: Windows → C:\Users\<you>\Documents\Zoom, Mac → ~/Documents/Zoom, each meeting in a dated folder containing audio_only.m4a, video files (MP4), and chat logs. Cloud recordings: zoom.us → sign in → Recordings tab → each meeting lists its files for streaming or download. Note: cloud recordings expire or hit storage caps on many plans, so download anything you want to keep. If you can't find a local recording, check Zoom's desktop app under Meetings → Recorded, which maps recordings to their folders.
Can I transcribe a local Zoom recording?
Yes — but not with Zoom itself. Zoom's native transcription only processes cloud recordings; local recordings never get a Zoom transcript on any plan. Upload the local file to an AI transcription tool instead: grab audio_only.m4a from your Documents/Zoom meeting folder, upload it, and you'll get a timestamped, speaker-labeled transcript in minutes. This works on any Zoom plan including free, because the transcription happens outside Zoom entirely.
Should I upload the M4A or the MP4 from my Zoom folder?
The M4A. Zoom saves both an audio-only M4A and a full video MP4 for every recording. The transcript comes entirely from the audio track, so the video pixels add nothing — but they make the file roughly 10× larger and slower to upload. A 1-hour Zoom meeting is typically 30-60 MB as M4A vs 400 MB-1.5 GB as MP4. Upload the M4A unless you specifically need the video for something else (like generating subtitles to burn back into the video — in that case, MP4 upload also works and produces an SRT synced to the video).
How accurate is Zoom's built-in transcription?
Independent testing has measured Zoom's native transcription at roughly 48-85% accuracy depending on audio conditions — noticeably below dedicated ASR models. Whisper Large-v3-class transcription (what VexaScribe runs) typically reaches 91-95% on Zoom-quality compressed meeting audio. Both degrade on heavy crosstalk, strong accents, or bad microphones. The practical difference: Zoom's native transcript usually needs heavy correction before you can share it; a Whisper-class transcript usually needs a 5-minute proofread for proper nouns.
Can I get speaker names in my Zoom transcript?
Partially automatic. AI speaker diarization detects when the speaker changes and labels them Speaker 1, Speaker 2, and so on — you then rename them once in the editor (e.g., "Maria — PM", "Client"), and the names apply through the whole transcript and all exports. One caveat that applies to every transcription tool: if two people share one laptop and microphone in a conference room, diarization can't reliably separate them — separate mics or Zoom's per-participant audio option produce much better speaker labels.
Can I transcribe a Zoom recording straight from Google Drive without downloading it?
Yes. If your Zoom recording is stored in Google Drive with "anyone with the link" sharing enabled, paste the share link into VexaScribe and the file is fetched server-side — including large files where Drive normally shows a can't-scan-for-viruses confirmation (that >25MB confirmation-token flow is handled automatically). This is the fastest path for teams that auto-archive Zoom cloud recordings to Drive: no download, no re-upload, straight to a speaker-labeled transcript. Private (non-shared) Drive files aren't supported — share the file first or download-then-upload.
Can VexaScribe transcribe my Zoom meetings live, without me recording anything?
Yes — the VexaScribe Meeting Bot joins your Zoom, Google Meet, or Teams meeting as a participant, records it, transcribes it, and generates a structured summary with action items and decisions. Honest pricing note: because the bot holds a live connection for the entire meeting, it consumes 3× your normal minute rate — a 30-minute meeting uses 90 minutes from your plan. For recordings you already have, uploading the file (Method 2) costs regular 1× minutes and is the more economical path. If live meeting capture is your primary daily workflow across every calendar meeting, also compare dedicated note-takers like Otter or Fathom, which are built around the bot-first model.
Related VexaScribe resources
Transcribe audio to text
The general workflow — every format, 99 languages
MP4 to text
If you're uploading the Zoom video file instead of the M4A
M4A to text
The format Zoom saves audio-only recordings in
Transcript to summary
Meeting minutes, decisions, and action items from the transcript
Sales call transcription
Zoom sales calls — quotes, objections, CRM notes
AI note taker guide
The live meeting-bot landscape, honestly compared