Workflows
How to Make an AI Podcast Trailer in 2026 (60-90s)
A step-by-step workflow for producing a 60-90 second AI podcast trailer that sells the episode in the first three seconds and converts cold scrollers into listeners.
A podcast trailer is not a teaser. A teaser hints. A trailer sells. In 2026, the difference between an episode that gets 400 plays and one that gets 40,000 is almost always the 60-90 second cutdown you post the morning it goes live. Most podcasters still treat the trailer like an afterthought — a static waveform with a quote slapped on top. That format died in 2023 and the algorithm has been punishing it ever since.
The right AI stack lets a solo host produce a cinematic trailer in about 25 minutes, end to end. Here is the exact playbook we use.
1. Pick the moment before you pick the visuals
Open the episode transcript. Read it once. Highlight every line that makes you stop scrolling — the contradictions, the confessions, the numbers that sound made up but aren't. You are looking for one anchor quote of 8-15 seconds. Not a summary. A moment.
Three trailer-grade moments to hunt for:
- The reveal: the guest admits something they have never said publicly.
- The receipt: a specific number, date or name that anchors the claim in reality.
- The reversal: the host asks one thing, the guest answers the opposite, and the room goes quiet.
If the episode does not contain at least one of these, the trailer will not convert. Re-record the cold open before you bother editing.
2. Write the trailer script as a three-act structure
The 60-90 second window is short enough to be tight and long enough to need shape. Treat it like a film trailer:
- Seconds 0-5: the hook. A single line of voiceover plus one striking visual. No logo, no intro music swell, no host name yet.
- Seconds 5-45: the guest tease. The anchor quote, intercut with two or three supporting beats from the conversation.
- Seconds 45-90: the CTA. Episode title, where to listen, and one final line that re-justifies the click.
Write the whole thing in a doc before you open any tool. If you cannot read it aloud in 75 seconds with breath room, it is too long.
3. Generate the visual world of the episode
This is where AI replaces the stock footage trap. Most podcast trailers reuse the same five b-roll clips — neon city at night, a generic studio, hands typing on a keyboard — and every viewer has seen them a thousand times. Build a custom visual world for each episode instead.
Open Versely's AI b-roll generator and write one prompt per beat in the script. If the guest is talking about scaling a SaaS company, you need a shot of a server room at golden hour, a founder pacing in an empty office, a Slack notification on a phone screen. If the guest is a chef, you need a sizzling pan in slow motion, a knife on a wooden board, steam rising from a stockpot.
Use VEO 3.1 or SORA 2 for the hero shots that need camera movement. Drop down to Kling 3.0 or Hailuo for tighter background inserts where the cost per second matters more than the cinematic feel. Generate three to five seconds per beat — never more. Trailers live or die on cut frequency.
4. Lock a voice that sounds like the show
If the host has a recognisable voice, clone it once with AI voice cloning and reuse the clone for every trailer going forward. This solves three problems at once: you can record the trailer voiceover at 2am without setting up the mic, you can produce multilingual versions of the same trailer for international distribution, and you keep the brand voice consistent even when the host is travelling.
If you are launching a new show and have no voice yet, pick an ElevenLabs v3 voice and stick with it for the first 20 episodes. Voice consistency is a retention lever — listeners build a parasocial relationship with the sound, not the script.
5. The 25-minute build workflow
Here is the exact order of operations. Do not improvise — the order matters because it stops you from re-rendering expensive video clips when the script changes.
- Lock the script. Paste the three-act draft into a doc and time it with a stopwatch. Trim until it lands at 75 seconds spoken.
- Generate the voiceover. Run the script through your cloned voice or chosen ElevenLabs v3 voice. Export as a single WAV. This is your timing spine.
- Generate the b-roll prompts in batch. Open the AI b-roll generator and queue every clip in one session. Sample prompts:
- "Cinematic wide shot of a founder pacing in an empty glass-walled office at sunrise, golden hour, shallow depth of field, slight handheld motion, anamorphic lens, VEO 3.1"
- "Macro close-up of a vinyl record dropping onto a turntable, dust particles in a single beam of light, cinematic, 5 seconds"
- "Slow push-in on a vintage radio microphone in a dimly lit studio, warm tungsten light, shallow focus, cinematic 35mm film grain"
- Score the trailer. Generate a 90-second instrumental in Suno v5.5 or Lyria with a clear tempo and a single drop around the 45-second mark. Prompt it specifically: "Cinematic trailer score, slow build, single drop at 45 seconds, sparse piano into driving 808s, 90 seconds total."
- Assemble in the AI video generator. Drop the voiceover on track one, the music on track two, and align every b-roll clip to land on a downbeat. Cut every 1.2-1.8 seconds in the first 30 seconds. Slow the cuts to every 2.5 seconds in the back half.
- Add captions. Burn in word-by-word captions on the bottom third. Around 70% of listeners will discover the trailer with sound off.
- Generate the thumbnail. Use the AI thumbnail generator for the YouTube and Spotify Video versions. One face, one big number, one contrasting colour.
Total active time once you have done it twice: 22-28 minutes.
6. The five mistakes that kill trailer performance
Avoid these and you will out-perform 90% of podcast trailers shipped this quarter.
- Opening with the show logo. No one watches a logo. Open with the guest mid-sentence and put the logo on second 50.
- Running the intro music under the hook. The hook needs silence around it. Music starts at the second beat, not the first.
- Letting the trailer breathe. Breathing room is a luxury for 30-minute documentaries. A 75-second trailer should feel slightly too fast.
- Static waveform plus quote card. Already covered. Still happening. Stop it.
- Posting the trailer once. Post it as a Reel, a Short, a TikTok, a LinkedIn native video, and an X video — five separate uploads, each with platform-native captions and aspect ratios. Same trailer, five times the surface area.
FAQ
How long should an AI podcast trailer be in 2026?
Sixty to ninety seconds is the sweet spot for cross-platform distribution. Anything under 45 seconds does not give the guest tease enough room to land. Anything over 100 seconds gets cut by the platform algorithms when it crosses to Reels and Shorts. Build to 75 seconds and trim if needed.
Can I use AI voice cloning if my podcast already has a real host?
Yes — that is exactly the use case. Clone the host once with a clean three-minute sample, then use the cloned voice for all trailer voiceovers, multilingual versions, and pickup lines. The cloned voice is for marketing assets, not for the episode itself.
What is the right music style for a podcast trailer?
Match the show's energy and contradict the listener's expectation. Business shows over-index on cinematic strings — try something with more rhythm. True crime over-indexes on tension drones — try something melodic. Generate three options in Suno v5.5 and A/B test on your first two trailers.
Should every episode get a trailer?
Yes, but not the same kind. Cornerstone interview episodes get the full 75-second treatment. Mid-week solo episodes get a 30-second cutdown built from one anchor quote. Bonus episodes get a 15-second teaser. The format scales with the asset.
Where should I post the trailer first?
Post the vertical 9:16 cut to TikTok and Reels at the same minute the episode goes live, then upload the 1:1 version to LinkedIn and Instagram feed two hours later, and the 16:9 version to YouTube as a community post. Stagger the drops so the algorithm reads each one as native, not as a re-share.
Ship the trailer before the episode goes live
The trailer is the highest-leverage asset in the entire podcast workflow. A great episode with no trailer dies in the back catalogue. A mediocre episode with a great trailer pulls 10x the listens of the average release. The AI stack means you have no excuse to skip it.
Build the workflow once, save the prompts, and treat trailer production as the non-negotiable last step before any episode ships. Want a deeper system for turning every episode into multi-platform content? Read the AI content creation 2026 playbook next.