These are tested Veo prompts for dialogue, lip-sync, sound effects, and ambient audio — the part most AI video models still get wrong. To make a character speak, write the line in quotation marks after a speech verb and a comma: A woman says, "We have to leave now." That comma-and-quotes pattern is what triggers Veo’s lip-sync; without it, the words often appear as on-screen text instead. Copy any prompt below, paste it into Veo, and change the bold details. This cluster is one part of the full Veo Prompt Library, covering product, UGC, and image-to-video prompts as well.
Why Veo’s audio is different
Veo generates picture and sound together in one pass, not as a soundtrack bolted on afterward. That joint process is why lip movements line up with the words and why ambient sounds match the scene. It also means you should prompt audio as its own layer of the scene — name the dialogue, the sound effects, and the ambience explicitly — rather than tacking “with sound” onto the end of a visual prompt and hoping.
How to write Veo audio prompts
- Dialogue:
[Character] says, "[line]" — keep the comma and the quotation marks. Add a voice cue before the quote (says in a calm, low voice) to direct the performance.
- Sound effects: prefix with
SFX: and name each sound precisely — SFX: the sharp clang of hammer on steel, not “blacksmith sounds”.
- Ambient: prefix with
Ambient noise: for the environmental bed — rain, room tone, distant traffic.
- Protect the lip-sync: add
no background music when dialogue matters; Veo can add music by default and it competes with speech.
- Stop caption bleed: add
no on-screen text, no subtitles if Veo keeps printing your words on screen.
- Mind the clock: clips are 4, 6, or 8 seconds. Keep one short line per character so it is not rushed or truncated.
Ready to script a scene? Build it in the Veo Prompt Builder — the dialogue preset starts from these same says-quote and no-subtitles defaults. If the delivery is more casual than scripted, the UGC-style prompts show the handheld, selfie-vlog version of a spoken line, and the product video prompts show how to add a short spoken hook to an ad.
Got your prompt? Run it on a model with native audio
These prompts need a Veo-capable runner. If you do not have direct Veo access, Pollo AI lets you run Veo and other video models in one place, so you can test a dialogue prompt and re-roll without juggling accounts. Disclosure: this is an affiliate link — we may earn a commission if you subscribe, at no extra cost to you. We only suggest tools we would use to run these prompts.
FAQ
How do I make a character actually speak in Veo?
Write the line in quotation marks after a speech verb and a comma — for example, A woman says, "We have to leave now." The comma and quotation marks are the cues that trigger lip-sync. Without them Veo may render the words as on-screen text or skip the dialogue entirely.
Why does Veo show my dialogue as subtitles on screen?
That usually happens when the line is not clearly framed as speech, or when no negative is set. Use the says, "..." pattern and add "no on-screen text, no subtitles" to the prompt.
How long can a spoken line be?
Veo clips are 4, 6, or 8 seconds, so keep a single spoken line to roughly 10–15 words. Longer lines get rushed or cut off. For a conversation, give each character one short line.
How do I control the voice or emotion?
Add a delivery cue before the quote: says in a weary voice, shouts excitedly, whispers nervously. Veo shapes the vocal performance from these descriptors.
Does native audio work on Veo 3.1 Lite and the free tier?
Veo 3.1 and 3.1 Lite generate native synchronized audio. Note that the Veo image-editing "add/remove object" path runs on Veo 2 and does not generate audio — verify your access tier before relying on dialogue.