Explore/muapi.ai/grok-imagine-video-1-5-preview

muapi/grok-imagine-video-1-5-preview

Image to Video

Generate videos from images using the Grok Imagine Video 1.5 Preview model with support for multiple aspect ratios, resolutions, and durations up to 15 seconds.

Result

Price varies by resolution and duration

Resolution	Duration	Cost
480p	5s	$0.40
480p	8s	$0.64
480p	10s	$0.80
720p	5s	$0.70
720p	8s	$1.12
720p	10s	$1.40

🚀Related Models

View all

wan2.2-image-to-video

Wan 2.2’s I2V mode brings static visuals to life with vivid, expressive animations. It interprets motion, emotion, and background dynamics from a single image to generate smooth and cinematic short videos.

Image to Video

veo3.1-fast-image-to-video

Veo 3.1 Fast is an optimized version of Google’s Veo 3.1 AI that transforms static images into dynamic 8-second videos at higher speed. It preserves visual fidelity while enabling rapid generation, making it ideal for social media clips, storyboards, and quick creative previews.

Image to Video

gemini-omni-image-to-video

Gemini Omni Image to Video — animate one or more reference images with a text prompt. Unified reasoning across modalities preserves subject identity and generates synchronized audio natively.

Image to Video

kling-v2.5-turbo-std-i2v

Kling 2.5 Turbo Std: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Image to Video

minimax-hailuo-2.3-pro-i2v

Hailuo 2.3 Pro I2V breathes life into still images with stunning motion synthesis and cinematic camera control. Using deep motion understanding, it predicts realistic subject movement, depth, and environmental motion from a single input frame — delivering smooth, film-grade clips.

Image to Video

kling-v3.0-standard-image-to-video

Kling 3.0 Standard Image-to-Video animates a single input image into a short, realistic video with smooth, stable motion. It prioritizes temporal consistency, natural physics, and subtle camera movement, making it ideal for everyday scenes, travel moments, people, vehicles, and calm cinematic shots.

Image to Video

kling-v2.1-standard-i2v

Kling 2.1 Standard (developed by Kuaishou) brings static images to life by generating smooth, realistic video clips from a single frame. It captures subtle motion, background dynamics, and camera movement to produce professional-looking animations — ideal for portraits, digital art, and cinematic illustrations.

Image to Video

seedance-2-image-to-video-fast

SD 2 Image-to-Video (Fast) by ByteDance. Quickly animates a start-frame image into video with 4–15 second duration at reduced cost.

Image to Video

seedance-2-omni-reference-no-video

SD 2 Omni Reference by ByteDance. Generate videos using up to 9 image references and up to 3 audio references. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Image to Video

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video

seedance-2-vip-image-to-video-fast

SD 2 Image-to-Video VIP Fast by ByteDance. Faster animation of a start-frame image with priority routing, 4–15 second duration, and 2K resolution.

Image to Video

veo3-image-to-video

VEO3 I2V animates static images into expressive video sequences, adding lifelike movement while preserving the original composition.

Image to Video

📝

Overview

About this model

Grok Imagine Video 1.5 Preview is an advanced image-to-video generation model that transforms your static images into fluid, high-quality videos. Supporting durations from 1 to 15 seconds with resolutions up to 720p, it excels at producing cinematic animations across a wide range of aspect ratios. Whether you need portrait, landscape, or square formats, this model delivers smooth motion and consistent visual quality from a single reference image.

1Social Media: Create engaging short video clips from product or lifestyle photos for Instagram, TikTok, and YouTube.

2Marketing: Animate brand imagery and promotional visuals into dynamic video content.

3Creative Projects: Transform artwork, illustrations, and photos into animated video sequences.

4Content Production: Generate B-roll footage from still images for video productions.

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$0.08/sec at 480p, $0.14/sec at 720p (default 8s = $0.64)	Pay-as-you-go with no subscription required. Credits deducted per generation.
Fal.ai	Not available	This model is not available on Fal.ai.
Replicate	Not available	This model is not available on Replicate.

muapiapp$0.08/sec at 480p, $0.14/sec at 720p (default 8s = $0.64)

Pay-as-you-go with no subscription required. Credits deducted per generation.

Fal.aiNot available

This model is not available on Fal.ai.

ReplicateNot available

This model is not available on Replicate.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Prompt	string	Text description for video generation.	`The whale suddenly begins swimming through the apartment as if the room is underwater. Furniture crashes into walls, water bursts outward, and the whale breaks through multiple rooms while the camera follows beside it.`
Image URLs	array	Upload or provide image URLs to use as input for video generation.	`https://cdn.muapi.ai/assets/grok-imagine-video-1-5-preview.jpg`
Aspect Ratio	Enum (8 options)	Aspect ratio for the generated video. Use 'auto' to match the input image.	`auto`
Resolution	Enum (2 options)	Output video resolution.	`480p`
Duration (seconds)	int	Video duration in seconds.	`8`

Promptstring

Text description for video generation.

Default Value

The whale suddenly begins swimming through the apartment as if the room is underwater. Furniture crashes into walls, water bursts outward, and the whale breaks through multiple rooms while the camera follows beside it.

Image URLsarray

Upload or provide image URLs to use as input for video generation.

Default Valuehttps://cdn.muapi.ai/assets/grok-imagine-video-1-5-preview.jpg

Aspect RatioEnum (8 options)

Aspect ratio for the generated video. Use 'auto' to match the input image.

Default Valueauto

ResolutionEnum (2 options)

Output video resolution.

Default Value480p

Duration (seconds)int

Video duration in seconds.

Default Value8

📖

Implementation Guide

Developer documentation

How to Use Grok Imagine Video 1.5 Preview

Upload your image: Provide one or more image URLs via the images_list field. Supported formats: JPEG, PNG, WebP (max 20MB each).
Write a prompt (optional): Describe the motion or scene you want — e.g., "A person walking through a neon-lit city, camera slowly panning right."
Set duration: Choose between 1 and 15 seconds. Default is 8 seconds.
Choose aspect ratio: Select auto to match your input image dimensions, or specify 16:9, 9:16, 1:1, 4:3, 3:4, 3:2, or 2:3.
Select resolution: Choose 480p for faster generation or 720p for higher quality output.
Submit and poll: The API returns a request_id immediately. Poll GET /api/v1/predictions/{request_id}/result until status is completed.

❓

Common Questions

Frequently asked

Is a prompt required?

No, the prompt is optional. You can submit just an image and the model will animate it based on the visual content. Adding a descriptive prompt gives you more control over the motion and scene.

How many images can I provide?

You can provide multiple image URLs in the `images_list` field. The model uses them as reference frames for the video generation.

What is the difference between 480p and 720p?

480p generates faster and costs less per second. 720p produces sharper, higher-resolution output and is recommended for final deliverables or content where visual quality is critical.

What aspect ratios are supported?

The model supports auto, 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, and 2:3. Use 'auto' to preserve your input image's original proportions.

ai-product-photography

wan2.2-image-to-video

facebook-publish

hunyuan-text-to-video

runway-aleph-v2v

flux-dev-lora

happy-horse-1.1-text-to-video-1080p

pixverse-v4.5-t2v

hidream-i1-full

creatify-lipsync

flux-kontext-pro-i2i

kling-v1-avatar-standard

heygen-video-translate

wan2.2-animate

ai-image-extension

openai-sora-2-text-to-video

ai-video-upscaler-pro

ai-object-eraser

veed-lipsync

veo3.1-fast-image-to-video

veo3.1-fast-text-to-video

ai-dance-effects

image-effects

gemini-omni-image-to-video

veo3-fast-text-to-video

ltx-2-fast-text-to-video

kling-v2.5-turbo-std-i2v

minimax-hailuo-2.3-pro-i2v

minimax-hailuo-2.3-pro-t2v

wan2.1-text-to-image

reve-image-edit

grok-imagine-text-to-video

nano-banana-pro-edit

qwen-image-edit-plus-lora

ai-image-face-swap

google-imagen4-fast

sdxl-lora

infinitetalk-image-to-video

wan2.2-edit-video

ltx-2-pro-text-to-video

mmaudio-v2-text-to-audio

kling-v2-avatar-pro

flux-2-flex

flux-2-pro-edit

ai-product-shot

seedance-v1.5-pro-t2v

bytedance-seededit-v3

add-video-watermark

ai-skin-enhancer

seedance-v1.5-pro-t2v-fast

qwen-image-edit-2511

qwen-text-to-image-2512

kling-v2.1-standard-i2v

kling-v3.0-standard-image-to-video

kling-v3.0-std-motion-control

suno-add-vocals

seedance-2-video-watermark-remover-pro

ai-background-remover

latent-sync

claude-opus-4-6

flux-kontext-dev-i2i

seedance-2-image-to-video-fast

pixverse-v5.5-t2v

wan2.7-video-edit

seedance-2-omni-reference-no-video

seedance-2-i2v-480p

suno-remix-music

seedance-2-vip-image-to-video-fast

happy-horse-1-text-to-video-1080p

veo3-image-to-video

flux-schnell

happy-horse-1-text-to-video-720p

kling-v2.1-pro-i2v

seedance-2-vip-image-to-video-1080p

seedance-2-vip-first-last-frame-1080p

kling-v3.0-4k-image-to-video

gemini-2-5-pro

wan2.2-text-to-video

vidu-v2.0-i2v

vidu-q3-turbo-text-to-video