For this I chose to look at the iconic training montage from Rocky IV. The sequence where Rocky Balboa prepares for his big fight against Ivan Drago. It’s longer than many typical scenes, which gives rich material for visual, audio and combined analysis. I followed the three-step method inspired by Roger Ebert: look only, listen only, then both together.
First Viewing – Visuals Only
Watching the montage with the sound off revealed how much storytelling happens purely through visuals. We see Rocky practicing in harsh winter conditions — running through snow, chopping wood, punching meat carcasses, climbing icy stairs. The camera alternates between wide shots of the environment (vast and cold) and tight close-ups on Rocky’s face and muscles. The framing often places Rocky in the center or slightly left of center, as though he’s battling both around him and internally. The cold, muted color palette conveys isolation and intensity. Using Ebert’s framing ideas, Rocky’s prominence in the frame (often on the right when he’s victorious, on the left when struggling) visually maps his journey from underdog to contender. The repeating motion of climbing, punching, sweating without words shows struggle, progress, and transformation purely through image. The lack of sound actually magnifies the effort in the visuals – I found myself rooting for him even without hearing a single note of music.
Second Viewing – Audio Only
Now focusing only on the soundtrack and ambient audio: the pounding of feet on snow, heavy breaths, the thud of meat hitting the floor, the creaking of logs, the rhythmic shots of punches and sled pulls. And of course the driving 1980s rock soundtrack (for example, “Hearts On Fire”) which builds energy and momentum. Without visuals, the audio alone builds a narrative of exertion and ascent. The beat of the music syncs with the action, every push, every punch, every climb is punctuated by the score. From Ebert’s perspective, sound becomes story: we don’t see the steps, but we hear the muscle burn, the cold bite, the acceleration of purpose. The music does more than entertain, it becomes Rocky’s inner voice urging him on.
Third Viewing – Both Together
Watching the sequence together, which I have seen many times, felt different. I felt like I actually analyzed the film better and noticed how the camera work establishes scale and struggle, the editing speeds up and slows down to match Rocky’s rhythm; the music amplifies the emotional stakes. Rocky’s placement in the frame shifts from small and dominated by the environment to large and integral in the frame, this mirrors his narrative arc of growth. The cold setting, the harsh training, the isolation: visually they show what he’s enduring. Audibly, the soundtrack and sound effects show how he’s enduring it. I felt like Ebert’s idea of “reading” the movie really came alive here, the camera angle, the editing pace, the placement of Rocky in the frame, the shift in tone, all of these speak meaning without needing explicit dialogue. For instance, early on, when he’s alone and struggling, the shots are wider, he’s smaller, the music slower. Later, close-ups, faster cuts, louder music indicate transformation.
Breaking down a sequence like this helped me understand that film storytelling isn’t just about dialogue or plot, but it’s about how we are made to feel the journey. The bore of the cold morningsm the aches of muscles, the surge of feeling ready. The montage doesn’t explicitly tell us that Rocky is ready, but it shows us and we feel it. Elbert’s reading method gave me tools to see the craft behind what we usually just “watch”. I never thought that just a training montage could be filled with so much. I feel like the next time I watch a movie, especially one with little words, I will actually be able to understand what the purpose was with how the people filmed it.
The clip I watched:

Leave a Reply