Explore/muapi.ai/sd-2-first-last-frame-fast

muapi/sd-2-first-last-frame-fast

Image to Video

SD 2 First & Last Frame (Fast) by ByteDance. Quickly generate video that transitions between reference images at reduced cost. Provide 1 or 2 images.

Input

Configure the model parameters below.

0/2 items
Drag & drop images here or paste file/image

Result

🚀Related Models

View all
sd-2-omni-reference-no-video

sd-2-omni-reference-no-video

SD 2 Omni Reference by ByteDance. Generate videos using up to 9 image references and up to 3 audio references. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Image to Video
sd-2-vip-text-to-video

sd-2-vip-text-to-video

SD 2 Text-to-Video VIP (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with priority routing, native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Text to Video
sd-2-image-to-video-fast

sd-2-image-to-video-fast

SD 2 Image-to-Video (Fast) by ByteDance. Quickly animates a start-frame image into video with 4–15 second duration at reduced cost.

Image to Video
sd-2-first-last-frame

sd-2-first-last-frame

SD 2 First & Last Frame (Pro) by ByteDance. Generate video that transitions between two reference images. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Image to Video
sd-2-vip-image-to-video-fast

sd-2-vip-image-to-video-fast

SD 2 Image-to-Video VIP Fast by ByteDance. Faster animation of a start-frame image with priority routing, 4–15 second duration, and 2K resolution.

Image to Video
sd-2-vip-first-last-frame-1080p

sd-2-vip-first-last-frame-1080p

SD 2 First & Last Frame VIP 1080p by ByteDance. Generate 1080p video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Image to Video
sd-2-vip-image-to-video-1080p

sd-2-vip-image-to-video-1080p

SD 2 Image-to-Video VIP 1080p by ByteDance. Animates a still image into a cinematic 1080p video with priority routing, 4–15 second duration.

Image to Video
sd-2-omni-reference-no-video-fast

sd-2-omni-reference-no-video-fast

SD 2 Omni Reference (Fast) by ByteDance. Quickly generate videos using up to 9 image references and up to 3 audio references at reduced cost. Reference images in your prompt with @image1, @image2, etc. and audio with @audio1, @audio2, etc.

Image to Video
sd-2-vip-omni-reference-fast

sd-2-vip-omni-reference-fast

SD 2 Omni Reference VIP Fast by ByteDance. Faster video generation using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Image to Video
sd-2-vip-text-to-video-1080p

sd-2-vip-text-to-video-1080p

SD 2 Text-to-Video VIP 1080p by ByteDance. Generates cinematic 1080p video from a text prompt with priority routing, native audio-visual sync, and 4–15 second duration.

Text to Video
sd-2-text-to-video

sd-2-text-to-video

SD 2 Text-to-Video (Pro) by ByteDance. Generates high-quality cinematic video from a text prompt with native audio-visual sync, up to 2K resolution, and 4–15 second duration.

Text to Video
sd-2-image-to-video

sd-2-image-to-video

SD 2 Image-to-Video (Pro) by ByteDance. Animates a start-frame image into a high-quality video with native audio, 4–15 second duration, and 2K resolution.

Image to Video
sd-2-vip-image-to-video

sd-2-vip-image-to-video

SD 2 Image-to-Video VIP (Pro) by ByteDance. Animates a start-frame image into a high-quality video with priority routing, native audio, 4–15 second duration, and 2K resolution.

Image to Video
sd-2-text-to-video-fast

sd-2-text-to-video-fast

SD 2 Text-to-Video (Fast) by ByteDance. Generates video from text at faster speeds with 4–15 second duration and 2K resolution.

Text to Video
sd-2-vip-first-last-frame

sd-2-vip-first-last-frame

SD 2 First & Last Frame VIP (Pro) by ByteDance. Generate video that transitions between two reference images with priority routing. Provide 1 image for start-frame-only, or 2 images for both start and end frames.

Image to Video
sd-2-vip-first-last-frame-fast

sd-2-vip-first-last-frame-fast

SD 2 First & Last Frame VIP Fast by ByteDance. Faster generation of video transitions between two reference images with priority routing.

Image to Video
sd-2-vip-text-to-video-fast

sd-2-vip-text-to-video-fast

SD 2 Text-to-Video VIP Fast by ByteDance. Faster generation with priority routing from a text prompt, 4–15 second duration and 2K resolution.

Text to Video
sd-2-vip-omni-reference

sd-2-vip-omni-reference

SD 2 Omni Reference VIP (Pro) by ByteDance. Generate videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3. Also supports @omni-character:<char_id> for trained characters.

Image to Video
sd-2-vip-omni-reference-1080p

sd-2-vip-omni-reference-1080p

SD 2 Omni Reference VIP 1080p by ByteDance. Generate full HD videos using up to 9 image references, up to 3 video clips, and up to 3 audio references with priority routing. Reference materials in your prompt with @image1…@image9, @video1…@video3, and @audio1…@audio3.

Image to Video
📝

Overview

About this model

SD 2 First & Last Frame (Fast) quickly generates video that transitions between reference images at lower cost. Provide one or two images to anchor the opening and closing frames, with 4–15 second duration.

1Quick Transitions: Rapidly preview frame-to-frame transitions before committing to Pro quality.
2High Volume: Generate many first-last-frame videos at scale.
3Concept Testing: Test visual transitions inexpensively.
💰

Pricing & Value

Cost analysis

muapiapp$0.15 per second (e.g. $0.75 for 5 seconds)

Pay per second of video generated.

Fal.ai$0.3024/sec (high) / $0.2419/sec (basic)

Fal.ai charges $0.3024/sec for high quality and $0.2419/sec for basic. muapiapp Fast at $0.15/sec is 50% cheaper than Fal.ai's basic rate.

Replicate$0.3024/sec (high) / $0.2419/sec (basic)

Replicate charges the same as Fal.ai — $0.3024/sec (high), $0.2419/sec (basic). muapiapp Fast at $0.15/sec is 50% cheaper than Replicate's basic rate.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

Text description guiding the transition between frames.

Default ValueTwo people having a street interview, the interviewer holds a microphone.
Frame Imagesarray

1 image = first frame only; 2 images = first and last frame. Use 'adaptive' aspect ratio to match the reference image geometry.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/ai-images/186/712345784292/4a8c5c70-abcc-4920-873e-b0e219986453.jpg
Aspect RatioEnum (7 options)

Output video aspect ratio. 'adaptive' matches the reference image (recommended); concrete ratios may crop or pad.

Default Valueadaptive
Duration (seconds)int

Video duration in seconds.

Default Value5
📖

Implementation Guide

Developer documentation

How to Use SD 2 First & Last Frame (Fast)

  1. Upload 1 or 2 images: Provide images via images_list. First image = opening frame, second image (optional) = closing frame.

  2. Write your prompt: Describe the transition or motion between frames.

  3. Set duration: Between 4 and 15 seconds.

  4. Submit and poll: Use the returned request_id to poll for results.

Common Questions

Frequently asked

Do I need two images?

No. One image anchors just the first frame. Two images constrain both the start and end of the video.

How does Fast compare to Pro?

The Fast model is quicker and cheaper with a slight reduction in output quality compared to the Pro variant.

What image formats are supported?

JPEG, PNG, WebP, and BMP are supported.