Use spoken audio for explanations paired with visual diagrams
Split visual and auditory working memory resources to process more information than either channel alone can handle.
Why it works
Working memory has separate, partially independent processing pathways for visual-spatial and phonological information (Baddeley’s model). When a diagram is explained via spoken narration rather than on-screen text, the two streams are processed in parallel rather than competing for the same visual channel — effectively doubling the throughput. The modality effect means that a diagram plus narration is reliably better for learning than the same diagram plus a text caption, especially under high-load conditions.
How to do it
- When explaining a complex diagram or visual, use spoken narration rather than on-screen text.
- Ensure the narration describes the visual directly (not background information), so both channels work on the same content.
- For self-study of visual material, try reading explanatory text aloud while viewing the diagram.
- Avoid presenting the same words as both text on-screen and spoken — that is redundancy, not modality use.
Evidence
The modality effect is well replicated in CLT research, with controlled studies showing audio narration + visuals outperforms written text + visuals for complex material. Baddeley’s multi-component working memory model provides the theoretical grounding. (rct)
The modality effect is most pronounced for novices and for high-interactivity material; for simple content, the difference is negligible.
Sources
- Sweller, Ayres & Kalyuga (2011), Cognitive Load Theory, Springer
- Baddeley (2000), The episodic buffer: a new component of working memory, Trends in Cognitive Sciences
Common mistake
Reading on-screen text aloud verbatim — which activates the redundancy effect — rather than narrating the visual and trusting the learner to read supplementary text independently.
Practice this with IX Coach
IX Coach combines text and adaptive spoken prompts when introducing complex concepts, routing information across both channels to keep total cognitive load within your capacity.
7 days free, then $40/month (~$1.30/day).