LosAgency Reference Framework · v1

The Anatomy
of a Short-Form
Video Ad.

A reference framework for how 6-, 15-, and 30-second video ads are structured — a shared language for talking about the elements, durations, and decisions that shape short-form work.

Subject Short-form video advertising
Edition May 2026
Total Eight chapters
  The Anatomy of a Short-Form Video Ad01 · The Five Jobs
CHAPTER ONE

Every spot performs the same five jobs, in roughly the same order.

Whether it's six seconds or thirty, every short-form ad runs through the same five-step anatomy. The difference between durations isn't which jobs get done — it's how much room each job has to land.

01

The Hook

First 1–3 seconds

Stops the scroll. Earns the rest of the seconds. A visual surprise, a question, motion that can't be ignored, a face you recognize, a stat that punches.

02

The Context

Seconds 2–5

Tells the viewer who this is for and what it is. Brand cue, product reveal, category line. The "this is X, for Y" beat.

03

The Value Beats

The middle

Stacks the reasons-to-care. One per beat. Not a list — a build. Punchy callouts, ingredient stats, social proof, quick-cut variety. Each beat earns the next.

04

The Outro

Last 2–4 seconds

Resolves what the spot has been building toward. Names the offer or the feeling. One bold line that lands clean.

05

The End-Card

Final 1–2 seconds

Tells the viewer exactly what to do next. Static. Logo lock-up, CTA, market-specific info. Holds long enough to be screenshotted; readable without sound.

  The Anatomy of a Short-Form Video Ad02 · The Six-Second Formula
CHAPTER TWO

Six seconds. One truth.

Six seconds is not enough room for an arc. Pick the single most important thing the viewer needs to feel — and lead with it. Everything else is supporting cast.

The "One Truth" Structure

0:06
0–1s · Hook Visual stop.
1–3s · Context + Truth The single claim.
3–5s · Outro Resolve the moment.
5–6s End-card + CTA.
0s1.5s3s4.5s6s

Pacing. 4–6 cuts maximum. Each cut advances the truth — no decorative footage. The opening hook is visual only: action carries it; text waits for the category line and truth. End-card holds at least one full second.

  The Anatomy of a Short-Form Video Ad03 · The Fifteen-Second Formula
CHAPTER THREE

Fifteen seconds. Hook + three.

The default short-form unit. Long enough to build a small case, short enough that every beat must earn its place. Three middle beats is the sweet spot — fewer feels light, more feels rushed.

The "Hook + Three" Structure

0:15
0–2s · HookStop the scroll.
2–4s · ContextBrand cue.
4–7s · Beat 1Reason #1.
7–10s · Beat 2Reason #2.
10–13s · Beat 3Reason #3.
13–14sOutro.
14–15sEnd-card.
0s3s6s9s12s15s

Pacing. A new visual idea every 1.5–2 seconds. The three middle beats should feel different from each other — different shot scale, different motion, different angle — so the eye stays alert.

  The Anatomy of a Short-Form Video Ad04 · The Thirty-Second Formula
CHAPTER FOUR

Thirty seconds. A full arc.

Now there's room for a story shape: setup → escalation → climax → resolution. Five middle beats. The risk doubles — more time means more places to lose the viewer. Pacing has to build, not stay flat.

The "Full Arc" Structure

0:30
0–3s · HookOpen big.
3–6s · ContextSet the world.
6–10s · Beat 1Setup.
10–14s · Beat 2Escalate.
14–18s · Beat 3Variety break.
18–22s · Beat 4Climax.
22–28s · OutroLand message.
28–30sEnd-card.
0s6s12s18s24s30s

Pacing. Cut frequency accelerates from middle to outro — slower setup beats, faster climax beats, then a deliberate breath into the end-card. Energy curves up; final card holds 1.5–2 seconds so the brand mark is unmissable.

  The Anatomy of a Short-Form Video Ad05 · The Science
CHAPTER FIVE

The numbers that shape every choice.

1.7s
Average dwell on a feed video before the viewer scrolls past.
Mobile feed · 2024–25
50%
Drop-off in the first three seconds of mobile feed video.
Cross-platform median
1.5s
Time a viewer needs to read a six-word headline at standard size.
Reading-rate research
01 · THE HOOK

Why the first 1–3 seconds carry everything.

On a phone in a vertical feed, the viewer decides whether to keep watching in roughly the time it takes to read a single sentence. If the first 1.5–3 seconds don't create a reason to stay, the viewer scrolls — and the rest of the ad never plays. The hook isn't an opening; it's the entire ad's permission to exist.

02 · THE CUTS

Why the picture changes every 1.5–2 seconds.

Visual attention rebuilds itself with each new frame. Holding a single shot too long lets the viewer's gaze drift off-screen. A cut, a camera move, or a fresh element resets attention. Too much, and the viewer can't track meaning; too little, and they tune out. The 1.5–2s rhythm is the working compromise.

03 · THE CARD

Why the end-card is static and held.

After the kinetic middle, the brain needs a beat of stillness to encode the brand and the action. A flashy or moving end-card reduces brand recall. The card should be readable without sound, hold long enough to be screenshotted, and answer one question: what next?

  The Anatomy of a Short-Form Video Ad06 · Visual Choices
CHAPTER SIX

Visual choices that carry meaning.

Beyond structure, four categories of choice shape how the ad feels. Each carries its own performance signal independent of the writing or the visuals themselves.

Aspect Ratio

9:16 vertical — Reels, TikTok, Shorts, Stories. Mobile-native, full-screen. Where 80% of short-form lives in 2026.

1:1 square — feed, in-line. Lower commitment from the viewer; shorter messages work better here.

4:5 portrait — feed-optimized portrait. More vertical real estate than square; good for in-line stops.

16:9 horizontal — YouTube pre-roll, connected TV. Longer dwell, sound-on, often family co-viewing.

Cuts & Motion

Fast cuts = energy, urgency, modernity. Use for hooks and climax beats.

Held shots = trust, calm, quality cues. Use for ingredient or product hero moments — the viewer needs to see.

Camera movement (push-in, pan, parallax) substitutes for cuts when continuity matters but the picture still needs life.

Pattern interrupts — a sudden style shift every 4–6 seconds keeps a 30 from feeling slow.

On-Screen Text

On phones, sound-off is the default — roughly 85% on Meta, 50% on TikTok. Text is the script for half the audience.

Six-word rule. Any line of text must be readable in 1.5 seconds or less. If it can't be read aloud in that window, cut it down.

Place text away from the bottom 15% of vertical frames — that's where platform UI sits (captions, profile pic, CTA button).

Sound & Music

Sound design has to work without being heard. Match-cut on beat (snap, hit, drop) so even on mute, the rhythm reads visually.

Voiceover should never carry information that's not also on screen. Treat VO as a bonus layer, not a load-bearing one.

Music genre is a persona signal — energy/club for 18–24, indie/folk for 25–34 lifestyle, hip-hop instrumental for fashion-forward, lo-fi for wellness. Choose deliberately.

  The Anatomy of a Short-Form Video Ad07 · A Worked Example
CHAPTER SEVEN

One idea, three durations.

The same idea — "Real Energy. Real People. Feel-Good. Do Good." — sized across all three formulas. Notice how the hook, brand cue, and end-card stay constant across cuts; only the middle stretches.

The 6-Second Cut

TimeJobWhat's on screen
0–1sHookSkater explodes through frame, fruit splash freeze.
1–3sContext + TruthProduct hero shot. Caption: "Real rides."
3–5sOutroHeadline: "Real Energy. Real People."
5–6sEnd-CardLogo + CTA: "Find a location nearest you."

The 15-Second Cut

TimeJobWhat's on screen
0–2sHookSkater + fruit explosion. Caption: "Pick your ride."
2–4sContextBrand cue, product reveal.
4–7sBeat 1 — PeachBright color flash + can.
7–10sBeat 2 — Strawberry LemonadeBubble cascade.
10–13sBeat 3 — Tropical MangoGolden splash.
13–14sOutro"Four Flavors. One Smooth Ride."
14–15sEnd-CardLogo + CTA.
  The Anatomy of a Short-Form Video Ad07 · A Worked Example, cont.
CONTINUED

The 30-Second Cut.

TimeJobWhat's on screen
0–3sHookSkater roll-in, neon-rink reveal. Caption: "Refreshed. Turned up."
3–6sContextSkaters laughing, brand cue, can entrance.
6–10sBeat 1 — Real EnergyIngredient overlay: 80mg natural caffeine.
10–14sBeat 2 — Real PeopleReal-skater montage, no models.
14–18sBeat 3 — VarietyPattern-break flavor reel: Peach / Lemonade / Mango / Punch.
18–22sBeat 4 — Feel-GoodCommunity moment: cheers, group skate.
22–28sOutroHeadline + sub: "Real Energy. Real People. / Feel-Good. Do Good."
28–30sEnd-CardLogo + market-specific CTA.
What stayed constant: the hook image, the brand cue, the headline, and the end-card. What stretched: how many proof beats fit. That consistency is what makes a campaign feel like a campaign.
  The Anatomy of a Short-Form Video Ad08 · Glossary
CHAPTER EIGHT

A shared vocabulary.

The words the work uses — handy reference for craft conversations across writing, edit, and post.

Hook
The first 1–3 seconds whose only job is to stop the scroll. Every other decision in the ad assumes the hook worked.
CTA
Call to action. The line that tells the viewer what to do next. Every spot has exactly one primary CTA, usually placed on the end-card.
Mid-CTA
Internal beats that tee up the final CTA. Not action prompts — mini-claims that make the final CTA feel earned.
End-Card
The final 1–2 seconds: a static frame with logo, CTA, and any market-specific info. The viewer's takeaway frame.
Outro
The 2–4 seconds before the end-card where the spot resolves emotionally. Different from the end-card: outro is feeling, end-card is action.
Cutaway
A short shot inserted between two main shots — a detail, a texture, a motion grab — to vary the picture.
Beat
A self-contained 2–4 second unit of meaning inside the middle of an ad. One claim per beat.
Pattern Interrupt
A deliberate visual or rhythmic shift — color flip, speed change, format change — that resets viewer attention.
Pacing
How frequently the picture changes. Cuts-per-second. Faster pacing = higher energy, lower clarity.
Aspect Ratio
The shape of the frame. 9:16 (vertical) for Reels, TikTok, Shorts; 1:1 and 4:5 for Feed; 16:9 for YouTube pre-roll and CTV.
Cut
The instant transition between two shots. The default in short-form. Anything fancier (dissolve, wipe) is a deliberate slowdown signal.
Match-Cut
A cut where the next shot mirrors the geometry, motion, or sound of the previous shot. Used to build energy invisibly.
VO (Voiceover)
Spoken narration. On mobile, treat as bonus content — never load-bearing.
Sound Design
Non-music audio: whooshes, impacts, risers, designed-to-cut hits. What makes a fast-cut middle feel intentional.
Variant / Cut
A different version of the same spot — shorter, different market, different language. Same idea; the structure adapts.