← Back to Blog
Scriptwriting

How to Write Voice-Over Scripts for E-Learning

A weak e-learning script wastes recording time and kills learner engagement. Here is exactly how to write voice-over scripts that perform — in the studio and in the course.

Voicfy··Updated
How to Write Voice-Over Scripts for E-Learning

Writing a voice-over script for e-learning is not the same as writing a slide deck or a training manual. It is writing for the ear — and that requires a completely different set of decisions. A script that reads fine on paper can fall apart the moment a voice actor tries to perform it. The result is retakes, delays, and a learner experience that feels flat no matter how good the production is.

This guide covers every stage of the process: structure, pacing, formatting, common mistakes, and how to hand off a script that a voice actor can record cleanly on the first pass.

Write for the Ear, Not the Page

The single most common mistake in e-learning scripts is writing in a register that looks professional but sounds robotic when read aloud. Long sentences with embedded clauses, passive constructions, and formal vocabulary all work against you. The voice has to breathe, pause, and land emphasis — and those things require short, muscular sentences.

Read every line aloud before it leaves your desk. If you stumble, rewrite it. If you have to take a breath mid-sentence, break the sentence in two. If a phrase sounds awkward spoken, it will sound awkward recorded — there is no amount of editing that fixes a badly written line.

Use contractions freely: you'll, it's, we're. They are how people actually speak, and they signal to the learner that this is a conversation, not a policy document. Avoid nominalizations — turning verbs into nouns. “Make a decision” reads professionally; “decide” performs better.

Match Pace to Cognitive Load

E-learning voice-overs typically run at 130 to 150 words per minute — slower than a podcast, faster than a legal deposition. For content-heavy modules covering technical processes or dense procedures, aim for the low end. For onboarding content or soft skills training, the upper end of that range is fine.

A practical test: a 60-second module should have roughly 130–145 words of spoken script, excluding any timed pauses you have marked. Count your words. If you are over, cut — don't ask the voice actor to rush. If you are under, check whether the visuals are carrying enough of the content load, or whether the script is too thin.

Build pauses into the script deliberately. Use a line break or a stage direction like [pause] after any instruction the learner needs to act on, after a key definition, or at the end of each section before a knowledge check. These breathing points are not dead air — they are part of the instructional design.

Structure the Script Around the Visuals

Voice-over in e-learning works best when it and the visuals carry different parts of the message. If the slide shows a three-step process, the voice should not read the three steps aloud — it should provide the context, the reason, and the implication that the visual does not show.

A useful test: if you removed the audio and a learner could still understand the core content from the screen alone, the script is redundant. If you removed the visuals and the audio alone conveyed the message without the screen, the visuals are decorative. Both extremes waste the medium. The script and the screen should depend on each other.

Write your script in parallel with the storyboard, not after it. When the two are developed together, the voice-over naturally fills the gaps in the visual rather than describing what is already visible. If you are handed a finished storyboard and asked to script it afterward, your first job is to identify what the visuals are already saying clearly — then write around that.

Format the Script for the Voice Actor

How you format a script determines how fast the recording session goes. A clean, well-formatted script is a professional courtesy — and it has a direct effect on cost and quality, especially if the voice actor is working from a home studio without a director on the line.

Use a two-column format when possible: left column for screen description or slide reference, right column for the spoken script. This lets the voice actor understand what is happening on screen and adjust tone accordingly, even if they never see the actual course.

Mark any words that need special emphasis using bold. For technical terms with non-obvious pronunciation — brand names, acronyms, proper nouns in other languages — include a phonetic guide in brackets immediately after: Voicfy [VOYS-fee], SCORM [SKORM]. Do this even if you think the pronunciation is obvious. Voice actors handle dozens of clients and do not always have background context on your product.

Avoid ellipses (...) to indicate pause. Use a hard line break or write [0.5s pause] explicitly. Ellipses are ambiguous — they might indicate hesitation, trailing off, or a list continuation. Stage directions are unambiguous.

Common Mistakes That Kill Recordings

Acronym overload is the most common issue. A script full of initialisms — LMS, SME, KPI, SOP — is exhausting to perform and hard to follow as a learner. Spell out any acronym on first use, even if your internal audience knows it. Voice actors are not insiders.

Tongue-twisters are more common in professional scripts than you would expect. Alliteration, repeated consonant clusters, and sequences of similarly structured sentences create real performance difficulties. “Successful stakeholder skill-sets” looks fine in a Word document. Try saying it at pace, five times, in a recording booth.

Overly long sentences are the other consistent problem. A 40-word sentence that requires a nested clause to make its point will always be re-read in the final audio, even by an experienced voice actor. The fix is a rewrite, not a retake.

How to Brief the Voice Actor

A script alone is not a complete brief. A good voice direction note should fit in one short paragraph and cover three things: the audience, the tone, and one or two concrete reference points.

For example: “This is onboarding content for new sales hires at a financial services company. Tone is warm and encouraging — like a knowledgeable colleague explaining the ropes, not a compliance officer. Think clear and supportive, not formal. Similar energy to [reference recording or example].”

If you are posting the project on a platform like Voicfy, include this direction note in the project brief alongside the script. Voice actors who understand the audience and the context deliver a more accurate first take — which means fewer revisions and faster turnaround.

Reviewing Before You Send

Before the script leaves your hands, run through this checklist:

  • Read the entire script aloud at recording pace — fix anything that trips you
  • Check the word count against your module timing targets
  • Confirm every acronym is spelled out on first use
  • Verify all brand names and technical terms have phonetic guides where needed
  • Check that the script and storyboard are in sync — no redundancy, no gaps
  • Remove all filler phrases: “as we mentioned earlier”, “moving on now”, “let's take a look at”

This review takes ten minutes and consistently catches the kinds of errors that would otherwise cost you a revision round. A voice actor who receives a clean, well-structured script delivers a cleaner recording, faster — and the difference shows in the finished course.

Voicfy

Ready to hire a voice actor?

Post a brief, receive quotes from curated native talent, and get broadcast-ready audio within 48 hours.

Post a Project