Captions, Subtitles & Accessibility
Captions are not optional. They’re not a “nice to have” or a “someday when I have time.” They are a standard feature of professional course content.
Here’s why — and how to add them without spending hours on each video.
Why Captions Matter
Accessibility: Approximately 15% of the global population has some degree of hearing loss. Without captions, your course is partially or completely inaccessible to those learners. Captions aren’t charity — they’re the baseline for inclusive education.
Comprehension: Even viewers with perfect hearing benefit from captions. Reading along while listening improves retention and comprehension, especially for technical content, foreign accents, or complex terminology. Research on multimedia learning consistently shows that dual-channel input (seeing text while hearing audio) reinforces learning.
Viewing situations: Your students don’t always watch in ideal conditions. They might be:
- Watching on mute in a shared office or coffee shop
- Learning in a noisy environment (commuting, gym, kids in the next room)
- Viewing on a device with poor speakers
- Studying late at night while a partner sleeps nearby
Without captions, these viewers miss content. With captions, they learn anywhere.
SEO and discoverability: If your course videos are hosted on a platform that indexes captions (YouTube, Vimeo), the caption text becomes searchable. Your videos show up in more search results.

Auto-Generated Captions
The good news: you don’t have to type out captions manually. Several tools generate captions automatically using speech recognition.
YouTube (Free)
Upload your video to YouTube (even as unlisted). YouTube automatically generates captions within minutes. Download the caption file (SRT format) and use it anywhere.
Quality: Good for clear audio. Struggles with accents, technical jargon, and fast speech.
How to use it:
- Upload video as unlisted
- Wait for auto-captions to generate (usually 5–15 minutes)
- Go to Subtitles in YouTube Studio
- Download the SRT file
- Edit for accuracy (see below)
- Upload the corrected SRT to your course platform
Descript ($0–33/mo)
Descript is a transcription-first video editor. You import your video, it generates a transcript, and you edit the video by editing the text (deleting a sentence in the transcript deletes it from the video).
Quality: Excellent. One of the most accurate speech-to-text tools available.
Why it’s useful beyond captions: Descript can also generate your captions as burned-in text or as an SRT file. The free plan includes 1 hour of transcription per month.
CapCut (Free)
If you’re editing in CapCut, auto-captions are built in. One click generates captions that you can style and position.
Quality: Good. The text editing interface makes it easy to correct mistakes.
Your Editing Software
DaVinci Resolve, ScreenFlow, and Camtasia all have built-in caption tools. In most cases, you’ll import an SRT file or use a built-in speech-to-text feature.
- DaVinci Resolve Studio: Built-in auto-captioning (not available in the free version)
- ScreenFlow: Auto-captioning via integration with third-party services
- Camtasia: Speech-to-text captioning built in
Editing Auto-Captions for Accuracy
Auto-generated captions are usually 90–95% accurate. That sounds good until you realize the 5–10% errors are concentrated in the most important content — technical terms, proper nouns, numbers, and domain-specific vocabulary.
Always review and edit auto-captions. Here’s the efficient workflow:
- Generate captions using whatever tool you prefer
- Export as SRT (SubRip subtitle format — the universal standard)
- Open the SRT in a text editor (it’s plain text with timestamps)
- Play the video and read along with the captions
- Fix errors as you find them — focus on:
- Technical terms and jargon
- Proper nouns (names, brands, products)
- Numbers and prices
- Homophones (their/there, your/you’re, course/coarse)
- Save the corrected SRT
- Upload to your course platform or burn into the video
Time estimate: 2–3 minutes per minute of video. A 10-minute lesson takes 20–30 minutes to caption properly.
Burned-In vs. Separate Captions
Two ways to deliver captions:
Burned-In Captions (Hardcoded)
The text is rendered directly into the video image. It’s always visible, can’t be turned off, and works on every player and platform.
Pros: Universal compatibility, always visible, no extra files to manage.
Cons: Can’t be turned off (annoying for viewers who don’t want them), can obscure visual content, can’t be translated into other languages.
When to use: When your course platform doesn’t support caption files. When you want captions guaranteed to display. For social media clips where autoplay is common (viewers often watch without sound).
Separate Caption Files (SRT/VTT)
A text file that the video player reads and overlays on top of the video. Viewers can toggle captions on/off.
Pros: Can be turned on/off, don’t obscure visuals when off, can be swapped for different languages, can be restyled by the viewer.
Cons: Requires a player that supports caption files, extra file to upload and manage.
When to use: When your course platform supports SRT/VTT upload (most modern platforms do). This is the preferred method for course content.
Check your platform: GoHighLevel, Teachable, Kajabi, Thinkific, and most modern course platforms support separate caption files. If yours doesn’t, use burned-in captions as the fallback.
Making Your Course Accessible Beyond Captions
Captions are the most impactful accessibility feature, but they’re not the only one:
Video player controls: Choose a platform that offers playback speed controls, volume adjustment, and fullscreen toggle. These aren’t just convenience features — they’re accessibility tools.
Downloadable transcripts: Provide a text transcript for each lesson. Some learners process written information more effectively than audio. Transcripts also help students search for specific content later.
Clear visual contrast: If you add text overlays, ensure high contrast between text and background. Light gray text on a white background is unreadable for visually impaired viewers.
Consistent formatting: Use the same slide template, caption style, and video format throughout the course. Predictable formatting reduces cognitive load for all learners, especially those with learning disabilities.
Your Action Step
After you export your next lesson, run it through auto-captioning. Download the SRT file. Spend 20 minutes correcting errors. Upload the corrected SRT alongside your video.
This adds 20–30 minutes per lesson. It’s the highest-return time investment in your entire production workflow.
Next up: building your repeatable production workflow.
Keep going — you're making progress through Produce Your Course Videos.
Need help? Book a free call ↗