Explore/muapi.ai/grok-imagine-video-1-5-preview

muapi/grok-imagine-video-1-5-preview

Image to Video

Generate videos from images using the Grok Imagine Video 1.5 Preview model with support for multiple aspect ratios, resolutions, and durations up to 15 seconds.

Input

Configure the model parameters below.

0/1 items
Drag & drop images here or paste file/image

Result

Price varies by resolution and duration

ResolutionDurationCost
480p5s$0.40
480p8s$0.64
480p10s$0.80
720p5s$0.70
720p8s$1.12
720p10s$1.40

🚀Related Models

View all
wan2.2-image-to-video

wan2.2-image-to-video

Wan 2.2’s I2V mode brings static visuals to life with vivid, expressive animations. It interprets motion, emotion, and background dynamics from a single image to generate smooth and cinematic short videos.

Image to Video
veo3.1-fast-image-to-video

veo3.1-fast-image-to-video

Veo 3.1 Fast is an optimized version of Google’s Veo 3.1 AI that transforms static images into dynamic 8-second videos at higher speed. It preserves visual fidelity while enabling rapid generation, making it ideal for social media clips, storyboards, and quick creative previews.

Image to Video
gemini-omni-image-to-video

gemini-omni-image-to-video

Gemini Omni Image to Video — animate one or more reference images with a text prompt. Unified reasoning across modalities preserves subject identity and generates synchronized audio natively.

Image to Video
kling-v2.5-turbo-std-i2v

kling-v2.5-turbo-std-i2v

Kling 2.5 Turbo Std: Top-tier image-to-video generation with unparalleled motion fluidity, cinematic visuals, and exceptional prompt precision.

Image to Video
minimax-hailuo-2.3-pro-i2v

minimax-hailuo-2.3-pro-i2v

Hailuo 2.3 Pro I2V breathes life into still images with stunning motion synthesis and cinematic camera control. Using deep motion understanding, it predicts realistic subject movement, depth, and environmental motion from a single input frame — delivering smooth, film-grade clips.

Image to Video
kling-v3.0-standard-image-to-video

kling-v3.0-standard-image-to-video

Kling 3.0 Standard Image-to-Video animates a single input image into a short, realistic video with smooth, stable motion. It prioritizes temporal consistency, natural physics, and subtle camera movement, making it ideal for everyday scenes, travel moments, people, vehicles, and calm cinematic shots.

Image to Video
kling-v2.1-standard-i2v

kling-v2.1-standard-i2v

Kling 2.1 Standard (developed by Kuaishou) brings static images to life by generating smooth, realistic video clips from a single frame. It captures subtle motion, background dynamics, and camera movement to produce professional-looking animations — ideal for portraits, digital art, and cinematic illustrations.

Image to Video
seedance-2-image-to-video-fast

seedance-2-image-to-video-fast

SD 2 Image-to-Video (Fast) by ByteDance. Quickly animates a start-frame image into video with 4–15 second duration at reduced cost.

Image to Video
seedance-2-omni-reference-no-video

seedance-2-omni-reference-no-video

SD 2 Omni Reference by ByteDance. Generate videos using up to 9 image references and up to 3 audio references. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Image to Video
seedance-2-i2v-480p

seedance-2-i2v-480p

SD 2.0 480p image-to-video generation. Faster and more cost-effective than the 720p variant, ideal for previews and drafts.

Image to Video
seedance-2-vip-image-to-video-fast

seedance-2-vip-image-to-video-fast

SD 2 Image-to-Video VIP Fast by ByteDance. Faster animation of a start-frame image with priority routing, 4–15 second duration, and 2K resolution.

Image to Video
veo3-image-to-video

veo3-image-to-video

VEO3 I2V animates static images into expressive video sequences, adding lifelike movement while preserving the original composition.

Image to Video
📝

Overview

About this model

Grok Imagine Video 1.5 Preview is an advanced image-to-video generation model that transforms your static images into fluid, high-quality videos. Supporting durations from 1 to 15 seconds with resolutions up to 720p, it excels at producing cinematic animations across a wide range of aspect ratios. Whether you need portrait, landscape, or square formats, this model delivers smooth motion and consistent visual quality from a single reference image.

1Social Media: Create engaging short video clips from product or lifestyle photos for Instagram, TikTok, and YouTube.
2Marketing: Animate brand imagery and promotional visuals into dynamic video content.
3Creative Projects: Transform artwork, illustrations, and photos into animated video sequences.
4Content Production: Generate B-roll footage from still images for video productions.
💰

Pricing & Value

Cost analysis

muapiapp$0.08/sec at 480p, $0.14/sec at 720p (default 8s = $0.64)

Pay-as-you-go with no subscription required. Credits deducted per generation.

Fal.aiNot available

This model is not available on Fal.ai.

ReplicateNot available

This model is not available on Replicate.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

Text description for video generation.

Default ValueThe whale suddenly begins swimming through the apartment as if the room is underwater. Furniture crashes into walls, water bursts outward, and the whale breaks through multiple rooms while the camera follows beside it.
Image URLsarray

Upload or provide image URLs to use as input for video generation.

Default Valuehttps://cdn.muapi.ai/assets/grok-imagine-video-1-5-preview.jpg
Aspect RatioEnum (8 options)

Aspect ratio for the generated video. Use 'auto' to match the input image.

Default Valueauto
ResolutionEnum (2 options)

Output video resolution.

Default Value480p
Duration (seconds)int

Video duration in seconds.

Default Value8
📖

Implementation Guide

Developer documentation

How to Use Grok Imagine Video 1.5 Preview

  1. Upload your image: Provide one or more image URLs via the images_list field. Supported formats: JPEG, PNG, WebP (max 20MB each).

  2. Write a prompt (optional): Describe the motion or scene you want — e.g., "A person walking through a neon-lit city, camera slowly panning right."

  3. Set duration: Choose between 1 and 15 seconds. Default is 8 seconds.

  4. Choose aspect ratio: Select auto to match your input image dimensions, or specify 16:9, 9:16, 1:1, 4:3, 3:4, 3:2, or 2:3.

  5. Select resolution: Choose 480p for faster generation or 720p for higher quality output.

  6. Submit and poll: The API returns a request_id immediately. Poll GET /api/v1/predictions/{request_id}/result until status is completed.

Common Questions

Frequently asked

Is a prompt required?

No, the prompt is optional. You can submit just an image and the model will animate it based on the visual content. Adding a descriptive prompt gives you more control over the motion and scene.

How many images can I provide?

You can provide multiple image URLs in the `images_list` field. The model uses them as reference frames for the video generation.

What is the difference between 480p and 720p?

480p generates faster and costs less per second. 720p produces sharper, higher-resolution output and is recommended for final deliverables or content where visual quality is critical.

What aspect ratios are supported?

The model supports auto, 1:1, 16:9, 9:16, 4:3, 3:4, 3:2, and 2:3. Use 'auto' to preserve your input image's original proportions.