Veo Image-to-Video Prompts

These are tested Veo prompts for image-to-video — animating a still photo, portrait, product shot, or artwork. The single most important rule: the image already defines the subject, setting, and style, so your prompt should describe only the motion, the camera, and the audio — never re-describe the scene. Copy a prompt, attach your image in Veo, and change the bold details. This cluster is part of the wider Veo Prompt Library, which also covers product, UGC, and dialogue prompts.

The image-to-video mindset

In text-to-video you build the whole world with words. In image-to-video, the world already exists in the picture — your job is to add life to it. Spend the prompt on three things: what moves (and how forcefully), how the camera behaves, and what the scene sounds like. If you start re-describing what is already visible, you give Veo room to reinterpret and drift away from your image.

How to write Veo I2V prompts

Describe motion, not the scene. “Slow push-in, the hair sways in a light breeze” — not a fresh description of the person.
Use weighted verbs. Sway, ripple, drift, push, pull. Vague actions make the clip float; force verbs give motion weight.
Name the camera move. Veo defaults to nearly static. State the dolly, orbit, crane, or push-in explicitly.
Keep it small for realism. One or two clear actions beat a busy performance, especially for faces and products.
Layer audio. Ambient noise: for the bed, SFX: for specific sounds, says, "..." to make a pictured person speak. Add no background music to protect dialogue.
For start/end control, supply two frames and describe the transition; for character consistency across clips, use the reference-image (“ingredients”) feature.

Want to structure the motion and camera cues without starting from a blank prompt? Open the Veo Prompt Builder — the cinematic preset’s camera-move and lighting fields work well as a starting point for I2V motion, even though there is no dedicated I2V preset yet. For a photo of a person who needs to talk, pair it with the dialogue and audio prompts; for a product photo, see the product video prompts for the hero-shot language to reuse.

Got your image? Run it on a Veo-capable tool

Image-to-video needs a model that accepts an input image plus a prompt. If you do not have direct Veo access, Pollo AI supports image-to-video across Veo and other models in one place, so you can attach your image, run this prompt, and re-roll the motion. Disclosure: affiliate link — we may earn a commission if you subscribe, at no extra cost to you. We only recommend tools we would use ourselves.

Pillar: Veo Prompt Library — all use cases in one place.
Veo Dialogue & Audio Prompts — make a pictured person speak.
Veo UGC-Style Prompts — animate selfie and lifestyle photos.
Veo Product Video Prompts — hero shots and ads from product images.

Prompt deck

Copy a format, check the evidence, then customize it.

18 prompts 8 evidenced 8 community 0 owner-tested

Veo / Image-to-video (I2V)

Animate a product photo (subtle hero motion)

Prompt

Slow 90-degree orbit around the product, shallow depth of field. The surface catches a soft moving highlight as the camera moves. Gentle drifting studio haze in the background. Keep the product still and sharp; only the camera and the light move. Photoreal, smooth gimbal motion, 8 seconds. Ambient noise: faint studio room tone. No background music.

Veo / Image-to-video (I2V)

Bring a portrait to life (person photo)

Prompt

The person breathes softly, blinks naturally, and turns their head slowly toward the camera with a faint, warm smile. Hair and clothing shift gently as if a light breeze passes. Subtle handheld camera, no scene change. Photoreal, natural motion. Ambient noise: soft outdoor air. No background music.

Veo / Image-to-video (I2V)

Make a portrait speak (photo + dialogue)

Prompt

The person looks into the camera and says, in a calm, friendly voice, "Thanks for stopping by — let me show you around." Natural blinking and subtle head movement, lips synced to the line. Static framing, photoreal. No background music, no on-screen text.

Veo / Image-to-video (I2V)

Animate a landscape photo (atmospheric motion)

Prompt

Slow cinematic drift forward across the scene. Clouds move gently across the sky, grass and trees sway in a light wind, water ripples and catches the light. The light shifts subtly as if time is passing. Photoreal, smooth motion, 8 seconds. Ambient noise: wind through grass, distant birdsong. No background music.

Veo / Image-to-video (I2V)

Animate artwork / illustration (stylized motion)

Prompt

Bring the illustration to life with subtle parallax: the foreground drifts slightly faster than the background as the camera pushes in slowly. Small living details animate — flickering light, drifting particles, a gentle sway. Keep the original art style, colors, and linework intact. Smooth motion, 6 seconds. Ambient noise: soft atmospheric tone. No background music.

Veo / Image-to-video (I2V)

First-frame → last-frame transition

Prompt

Smoothly transition from the first frame to the final frame over 8 seconds. The camera moves steadily and the subject shifts naturally between the two compositions, with no hard cut. Maintain consistent lighting and character appearance throughout. Photoreal, fluid motion. Ambient noise: matching room tone. No background music.

Veo / Image-to-video (I2V)

Cinematic reveal from a still

Prompt

Begin tight on a detail in the image, then slowly pull back and crane up to reveal the full scene and its scale. Steady continuous camera move, dramatic and smooth. Light blooms gently as the frame widens. Photoreal, cinematic, 8 seconds. Ambient noise: rising ambient swell matched to the scene. No music.

Veo / Image-to-video (I2V)

Looping ambient motion (background / wallpaper)

Prompt

Add slow, continuous, seamless motion suitable for a loop: gentle drifting fog, slowly falling particles, soft shifting light. The camera holds nearly still. Nothing enters or leaves the frame so the clip can loop cleanly. Photoreal, calm, 8 seconds. Ambient noise: soft steady atmosphere. No background music.

Veo / Image-to-video (I2V)

Add action to a static scene (UGC / lifestyle photo)

Prompt

The **person in the photo** picks up the **coffee cup on the table**, takes a sip, and sets it back down, glancing at the camera with a relaxed smile. Natural, weighted hand motion. Subtle handheld camera. Photoreal, authentic look, 8 seconds. Ambient noise: quiet cafe room tone, a faint cup clink. No background music.

Veo / Image-to-video (I2V)

Multi-reference fashion scene (dress + person + accessory pattern)

Builder

Prompt

The video opens with a medium, eye-level shot of a beautiful woman with dark hair and warm brown eyes. She wears a magnificent, high-fashion flamingo dress with layers of pink and fuchsia feathers, complemented by whimsical pink, heart-shaped sunglasses. She walks with serene confidence through the crystal-clear, shallow turquoise water of a sun-drenched lagoon. The camera slowly pulls back to a medium-wide shot, revealing the breathtaking scene as the dress's long train glides and floats gracefully on the water's surface behind her. The cinematic, dreamlike atmosphere is enhanced by the vibrant colors of the dress against the serene, minimalist landscape, capturing a moment of pure elegance and high-fashion fantasy.

Veo / Image-to-video (I2V)

Extend an existing clip with a new action (Google example)

Why it works Because the source is a video (not a still), Veo already has the motion, camera style, and pacing established — the prompt only needs to specify the next beat ("slowly descending"), which is why a four-word instruction is enough to produce a matched continuation instead of a jarring cut.

Prompt

Extend this video with the paraglider slowly descending.

Veo / Image-to-video (I2V)

Origami subject animated then tracked into a new beat (Google example)

Why it works Naming the material and style explicitly in the action ("origami butterfly... flaps") rather than describing a photoreal butterfly is what keeps the animated result consistent with the source image's craft-paper look instead of drifting toward realism.

Prompt

An origami butterfly flaps its wings and flies out of the french doors into the garden.

Veo / Image-to-video (I2V)

Continue an animated sequence with a new subject entering (Google example)

Why it works Starting the extension prompt with the existing camera instruction ("track the butterfly") anchors continuity with the prior clip before introducing the new subject, which is why the puppy's entrance reads as the next beat of the same scene rather than a jump cut.

Prompt

Track the butterfly into the garden as it lands on an orange origami flower. A fluffy white puppy runs up and gently pats the flower.

Veo / Image-to-video (I2V)

Animate a landscape photo into atmospheric drone motion (Google example)

Why it works Isolating exactly one moving element (the water) while naming the camera move explicitly gives Veo a focused animation target instead of trying to animate the entire frame, which is why the rest of the rainforest stays stable and photoreal.

Prompt

Tracking drone view of majestic Hawaiian waterfall within lush rainforest with realistic water flow.

Veo / Image-to-video (I2V)

Reference-image surreal macro world (Google example)

Prompt

A surreal, cinematic macro video. Tiny surfers ride perpetual, rolling waves inside a stone bathroom sink. A running vintage brass faucet generates the endless surf. The camera slowly pans across the whimsical, sunlit scene as the miniature figures expertly carve the turquoise water.

Veo / Image-to-video (I2V)

Animate a stylized creature render (3D/cartoon subject)

Why it works Repeating the style descriptor ("3D animated") alongside the action keeps Veo anchored to the source image's rendering style through the motion, which is the main failure mode for I2V on non-photoreal images — style drift is more common than subject drift on stylized inputs.

Prompt

3D animated scene: cute snow leopard-like creature prances through whimsical winter forest with falling snowflakes.

Veo / Image-to-video (I2V)

Ingredients-to-video character costume (Google example)

Prompt

Create a silly cartoon version of the fish wearing the costume, swimming and waving the wand around.

Veo / Image-to-video (I2V)

First/last-frame cliff transition (Google example)

Prompt

Show what happens when the car takes off from a cliff.

FAQ

How is an image-to-video prompt different from a text-to-video prompt?

With image-to-video, the picture already defines the subject, setting, composition, and style. Your prompt only needs to describe what moves, how the camera behaves, and what audio plays. Re-describing the scene wastes the prompt and can fight the image.

Why does my image drift or morph into something else?

Usually too much motion or a prompt that re-describes the scene. Keep actions minimal and weighted, do not restate what the image shows, and add "keep the subject consistent" for faces and products.

Can I use a first and last frame in Veo?

Yes — Veo 3.1 supports start-and-end-frame interpolation. Provide both images and describe the transition between them rather than each frame. It gives the most control over how a clip begins and ends.

Does image-to-video generate audio?

Veo 3.1 image-to-video supports native audio. Note that the separate add/remove-object image-edit path runs on Veo 2 and does not generate audio, so check which feature you are using.

How do I keep a character consistent across shots?

Use the reference-image ("ingredients") feature: supply images of the character, object, or style and Veo carries that look across multiple clips, now with audio.

Veo Image-to-Video Prompts

The image-to-video mindset

How to write Veo I2V prompts

Got your image? Run it on a Veo-capable tool

Related Veo prompt guides

Copy a format, check the evidence, then customize it.

FAQ