Veo Image-to-Video Prompts

These are tested Veo prompts for image-to-video — animating a still photo, portrait, product shot, or artwork. The single most important rule: the image already defines the subject, setting, and style, so your prompt should describe only the motion, the camera, and the audio — never re-describe the scene. Copy a prompt, attach your image in Veo, and change the bold details. This cluster is part of the wider Veo Prompt Library, which also covers product, UGC, and dialogue prompts.

The image-to-video mindset

In text-to-video you build the whole world with words. In image-to-video, the world already exists in the picture — your job is to add life to it. Spend the prompt on three things: what moves (and how forcefully), how the camera behaves, and what the scene sounds like. If you start re-describing what is already visible, you give Veo room to reinterpret and drift away from your image.

How to write Veo I2V prompts

Want to structure the motion and camera cues without starting from a blank prompt? Open the Veo Prompt Builder — the cinematic preset’s camera-move and lighting fields work well as a starting point for I2V motion, even though there is no dedicated I2V preset yet. For a photo of a person who needs to talk, pair it with the dialogue and audio prompts; for a product photo, see the product video prompts for the hero-shot language to reuse.

Got your image? Run it on a Veo-capable tool

Image-to-video needs a model that accepts an input image plus a prompt. If you do not have direct Veo access, Pollo AI supports image-to-video across Veo and other models in one place, so you can attach your image, run this prompt, and re-roll the motion. Disclosure: affiliate link — we may earn a commission if you subscribe, at no extra cost to you. We only recommend tools we would use ourselves.

Prompt deck

Copy a format, check the evidence, then customize it.

18 prompts 8 evidenced 8 community 0 owner-tested

Veo / Image-to-video (I2V)

Animate a product photo (subtle hero motion)

Prompt
Slow 90-degree orbit around the product, shallow depth of field. The surface catches a soft moving highlight as the camera moves. Gentle drifting studio haze in the background. Keep the product still and sharp; only the camera and the light move. Photoreal, smooth gimbal motion, 8 seconds. Ambient noise: faint studio room tone. No background music.

TweakFor a busier shot, change "slow 90-degree orbit" to "slow push-in"; the image already defines the product, so never re-describe it here.

Veo / Image-to-video (I2V)

Bring a portrait to life (person photo)

Prompt
The person breathes softly, blinks naturally, and turns their head slowly toward the camera with a faint, warm smile. Hair and clothing shift gently as if a light breeze passes. Subtle handheld camera, no scene change. Photoreal, natural motion. Ambient noise: soft outdoor air. No background music.

TweakKeep motion small for realism. Adding too many actions makes faces drift — one head turn plus a blink reads more natural than a full performance.

Veo / Image-to-video (I2V)

Make a portrait speak (photo + dialogue)

Prompt
The person looks into the camera and says, in a calm, friendly voice, "Thanks for stopping by — let me show you around." Natural blinking and subtle head movement, lips synced to the line. Static framing, photoreal. No background music, no on-screen text.

TweakKeep the line under ~12 words for an 8-second clip. Use the says, "..." pattern so Veo lip-syncs instead of printing captions.

Veo / Image-to-video (I2V)

Animate a landscape photo (atmospheric motion)

Prompt
Slow cinematic drift forward across the scene. Clouds move gently across the sky, grass and trees sway in a light wind, water ripples and catches the light. The light shifts subtly as if time is passing. Photoreal, smooth motion, 8 seconds. Ambient noise: wind through grass, distant birdsong. No background music.

TweakUse force verbs — "sway", "ripple", "drift" — so motion feels weighted. Vague verbs make the scene float unnaturally.

Veo / Image-to-video (I2V)

Animate artwork / illustration (stylized motion)

Prompt
Bring the illustration to life with subtle parallax: the foreground drifts slightly faster than the background as the camera pushes in slowly. Small living details animate — flickering light, drifting particles, a gentle sway. Keep the original art style, colors, and linework intact. Smooth motion, 6 seconds. Ambient noise: soft atmospheric tone. No background music.

TweakAdd "keep the original art style intact" so Veo animates the image rather than re-rendering it into photoreal.

Veo / Image-to-video (I2V)

First-frame → last-frame transition

Prompt
Smoothly transition from the first frame to the final frame over 8 seconds. The camera moves steadily and the subject shifts naturally between the two compositions, with no hard cut. Maintain consistent lighting and character appearance throughout. Photoreal, fluid motion. Ambient noise: matching room tone. No background music.

TweakSupply two images (start and end). Describe the path between them, not the frames themselves — Veo already has both endpoints.

Veo / Image-to-video (I2V)

Cinematic reveal from a still

Prompt
Begin tight on a detail in the image, then slowly pull back and crane up to reveal the full scene and its scale. Steady continuous camera move, dramatic and smooth. Light blooms gently as the frame widens. Photoreal, cinematic, 8 seconds. Ambient noise: rising ambient swell matched to the scene. No music.

TweakChange "crane up to reveal scale" to "push in to a single detail" to invert the move from reveal to focus.

Veo / Image-to-video (I2V)

Looping ambient motion (background / wallpaper)

Prompt
Add slow, continuous, seamless motion suitable for a loop: gentle drifting fog, slowly falling particles, soft shifting light. The camera holds nearly still. Nothing enters or leaves the frame so the clip can loop cleanly. Photoreal, calm, 8 seconds. Ambient noise: soft steady atmosphere. No background music.

TweakKeep all motion subtle and centered so the end can blend back to the start — anything that crosses the frame breaks the loop.

Veo / Image-to-video (I2V)

Add action to a static scene (UGC / lifestyle photo)

Prompt
The **person in the photo** picks up the **coffee cup on the table**, takes a sip, and sets it back down, glancing at the camera with a relaxed smile. Natural, weighted hand motion. Subtle handheld camera. Photoreal, authentic look, 8 seconds. Ambient noise: quiet cafe room tone, a faint cup clink. No background music.

TweakName objects already in the image ("the coffee cup on the table") so Veo animates them instead of inventing new ones.

Veo / Image-to-video (I2V)

Multi-reference fashion scene (dress + person + accessory pattern)

Builder
Prompt
The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress's long train glides and floats gracefully on the water's surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy.

TweakThe official pattern for combining three separate reference images (dress, person, accessory) into one scene. Supply your own three reference images and describe how they combine and move together; the text carries the camera move and setting, not the garment/accessory details already in the images.

Veo / Image-to-video (I2V)

Extend an existing clip with a new action (Google example)

Why it works Because the source is a video (not a still), Veo already has the motion, camera style, and pacing established — the prompt only needs to specify the next beat ("slowly descending"), which is why a four-word instruction is enough to produce a matched continuation instead of a jarring cut.

Prompt
Extend this video with the paraglider slowly descending.

TweakThe video-extension workflow: feed Veo an existing clip instead of a still image, and describe only the next action. Swap the continuing action for whatever should happen next in your source clip.

CreditGoogle AI for Developers — Veo documentation

Veo / Image-to-video (I2V)

Origami subject animated then tracked into a new beat (Google example)

Why it works Naming the material and style explicitly in the action ("origami butterfly... flaps") rather than describing a photoreal butterfly is what keeps the animated result consistent with the source image's craft-paper look instead of drifting toward realism.

Prompt
An origami butterfly flaps its wings and flies out of the french doors into the garden.

TweakA stylized-subject animation: the prompt only adds motion (flapping, flying) and a destination, letting the source image keep the origami look. Use this pattern for any stylized/illustrated subject you want to bring to life without losing its art style.

CreditGoogle AI for Developers — Veo documentation

Veo / Image-to-video (I2V)

Continue an animated sequence with a new subject entering (Google example)

Why it works Starting the extension prompt with the existing camera instruction ("track the butterfly") anchors continuity with the prior clip before introducing the new subject, which is why the puppy's entrance reads as the next beat of the same scene rather than a jump cut.

Prompt
Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower.

TweakThe follow-up extension to the origami butterfly clip above — it continues the camera move ("track the butterfly") and introduces one new subject with one clear action. Use this two-step pattern (animate, then extend with a new beat) to build a longer sequence from a single starting image.

CreditGoogle AI for Developers — Veo documentation

Veo / Image-to-video (I2V)

Animate a landscape photo into atmospheric drone motion (Google example)

Why it works Isolating exactly one moving element (the water) while naming the camera move explicitly gives Veo a focused animation target instead of trying to animate the entire frame, which is why the rest of the rainforest stays stable and photoreal.

Prompt
Tracking drone view of majestic Hawaiian waterfall within lush rainforest with realistic water flow.

TweakA dependable pattern for any landscape/nature photo: name the camera move ("tracking drone view") and the one physical detail that should animate ("realistic water flow"). Swap the location and the moving element — mist, wind in leaves, waves.

CreditGoogle AI for Developers — Veo documentation

Veo / Image-to-video (I2V)

Reference-image surreal macro world (Google example)

Prompt
A surreal, cinematic macro video. Tiny surfers ride perpetual, rolling waves inside a stone bathroom sink. A running vintage brass faucet generates the endless surf. The camera slowly pans across the whimsical, sunlit scene as the miniature figures expertly carve the turquoise water.

TweakOfficial Google I2V pattern: the input image supplies the sink world, while the prompt spends words on water motion, miniature action, camera, and atmosphere.

Veo / Image-to-video (I2V)

Animate a stylized creature render (3D/cartoon subject)

Why it works Repeating the style descriptor ("3D animated") alongside the action keeps Veo anchored to the source image's rendering style through the motion, which is the main failure mode for I2V on non-photoreal images — style drift is more common than subject drift on stylized inputs.

Prompt
3D animated scene: cute snow leopard-like creature prances through whimsical winter forest with falling snowflakes.

TweakUse for any 3D-render or cartoon-style source image. Naming the render style ("3D animated scene") in the motion prompt, not just the subject, is what keeps stylized images from drifting toward photoreal — swap the creature and the environment.

CreditGoogle AI for Developers — Veo documentation

Veo / Image-to-video (I2V)

Ingredients-to-video character costume (Google example)

Prompt
Create a silly cartoon version of the fish wearing the costume, swimming and waving the wand around.

TweakUse when you have separate reference images for a character and an asset. Keep the prompt short: name the transformation and the motion, then let the references carry identity and costume details.

Veo / Image-to-video (I2V)

First/last-frame cliff transition (Google example)

Prompt
Show what happens when the car takes off from a cliff.

TweakFor first-frame/last-frame workflows, the images define the beginning and target composition. The text only describes the transition beat between them.

FAQ

How is an image-to-video prompt different from a text-to-video prompt?

With image-to-video, the picture already defines the subject, setting, composition, and style. Your prompt only needs to describe what moves, how the camera behaves, and what audio plays. Re-describing the scene wastes the prompt and can fight the image.

Why does my image drift or morph into something else?

Usually too much motion or a prompt that re-describes the scene. Keep actions minimal and weighted, do not restate what the image shows, and add "keep the subject consistent" for faces and products.

Can I use a first and last frame in Veo?

Yes — Veo 3.1 supports start-and-end-frame interpolation. Provide both images and describe the transition between them rather than each frame. It gives the most control over how a clip begins and ends.

Does image-to-video generate audio?

Veo 3.1 image-to-video supports native audio. Note that the separate add/remove-object image-edit path runs on Veo 2 and does not generate audio, so check which feature you are using.

How do I keep a character consistent across shots?

Use the reference-image ("ingredients") feature: supply images of the character, object, or style and Veo carries that look across multiple clips, now with audio.