MuAPI AI Models API

Custom workflows and utility models optimized for performance, cost, and high scalability across image and video tasks.

MuAPI's custom utility and workflow models provide optimized performance and routing. MuAPI presents custom models next to other providers so teams can compare capability, pricing and model fit before choosing an integration path.

MuAPI AI models on MuAPI
Back to Providers
Explore/MuAPI Models
MuAPI

MuAPI AI API models

MuAPI AI models on MuAPI

Explore MuAPI models for chat, code, image and video generation, including Gemini, Nano Banana and Veo-style workflows available through MuAPI.

All models

75 Models

Video Generation Models

Video

$0.040 / second

creatify-lipsync

Realistic lipsync video - optimized for speed, quality, and consistency.

Video

$0.250 / second

heygen-video-translate

Convert any video into 175+ languages with synchronized voice translation, AI-voice cloning, and accurate lip sync. Just upload your video (or provide a link), select a target language, and HeyGen recreates the speech in that language. 0.05$ per second.

Video

$0.240 / second

ai-video-upscaler-pro

The AI Video Upscaler is a powerful tool designed to enhance the resolution and quality of videos. Whether you're working with low-resolution videos that need a boost or aiming to improve the clarity of existing footage, this upscaler leverages advanced machine learning models to deliver high-quality, upscaled videos.

Video

$0.040 / second

veed-lipsync

Generate realistic lipsync from any audio using VEED's latest model

Video

$0.300 / second

ai-dance-effects

Bring your characters and worlds to life with AI Dance Effects — a creative video effect that adds playful, dynamic, and cinematic motion to your generations. AI Dance Effects lets you guide how characters move, react, and express themselves.

Video

$0.200 / second

infinitetalk-image-to-video

InfiniteTalk Image-to-Video brings still portraits and character photos to life by generating natural, realistic talking videos. You provide a single face image and a dialogue script, and the model animates lip movement, facial expressions, and subtle head gestures to match the speech.

Video

$0.000 / second

add-video-watermark

Add custom watermark to videos with adjustable position, opacity, and size. Free local processing using FFmpeg.

Video

$0.040 / second

latent-sync

LatentSync is a video-to-video model that generates lip sync animations from audio using advanced algorithms for high-quality synchronization.

Video

$0.560 / second

kling-v3-turbo-standard-image-to-video

Generate fast, high-quality videos from a single image using Kling v3 Turbo Standard (720p). Supports durations from 3 to 15 seconds.

Video

$0.560 / second

kling-v3-turbo-standard-text-to-video

Generate fast, high-quality videos from text prompts using Kling v3 Turbo Standard (720p). Supports durations from 3 to 15 seconds and multiple aspect ratios.

Video

$0.700 / second

kling-v3-turbo-pro-image-to-video

Generate fast, high-quality videos from a single image using Kling v3 Turbo Pro (1080p). Supports durations from 3 to 15 seconds.

Video

$0.200 / second

ovi-text-to-video

Ovi is a unified model that generates synchronized video and audio from textual input. You write a scene description, including dialogue and ambient sounds, and Ovi produces a short video clip (typically ~5 seconds) where visuals and sound align naturally. Videos are generated in 540p resolution.

Video

$0.200 / second

infinitetalk-video-to-video

InfiniteTalk Video-to-Video enhances or transforms existing videos by syncing the subject’s lip movements and facial expressions with new dialogue or speech. Instead of starting from a still image, you provide a video clip, and the model seamlessly reanimates the speaker’s mouth and expressions to match the script.

Video

$0.200 / second

ovi-image-to-video

Ovi is a unified audio–video generation model that can transform a static image plus a descriptive prompt into a short video with synchronized audio. It supports both text-to-video and image-conditioned video inputs. With built-in lip sync, background audio / sound effects, and dialogue support, Ovi brings still visuals to life in cinematic fashion. Videos are generated in 540p resolution.

Video

$0.000 / second

ai-captions

Add AI-generated animated captions to any video using Vadoo's caption engine. Supports multiple languages and viral caption themes like Hormozi style. Perfect for social media creators, marketers, and content producers.

Video

$0.700 / second

kling-v3-turbo-pro-text-to-video

Generate fast, high-quality videos from text prompts using Kling v3 Turbo Pro (1080p). Supports durations from 3 to 15 seconds and multiple aspect ratios.

motion-graphics
Video

$0.630 / second

motion-graphics

Generate animated motion graphics videos from a text prompt using AI-generated React/Remotion code rendered on Modal.

mmaudio-v2-video-to-video
Video

$0.010 / second

mmaudio-v2-video-to-video

MMAudio-v2 generates high-quality, synchronized audio from video or text inputs. Seamlessly integrate it with AI video models to create fully-voiced, expressive video content.

Video

$0.040 / second

sync-lipsync

Generate realistic lipsync animations from audio using advanced algorithms for high-quality synchronization.

ai-video-face-swap
Video

$0.100 / second

ai-video-face-swap

Replace faces in videos with stunning realism. Our AI ensures accurate expression transfer, lighting consistency, and smooth frame-by-frame blending.

Video

$0.300 / second

video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Video

$0.030 / second

ai-video-upscaler

The AI Video Upscaler is a powerful tool designed to enhance the resolution and quality of videos. Whether you're working with low-resolution videos that need a boost or aiming to improve the clarity of existing footage, this upscaler leverages advanced machine learning models to deliver high-quality, upscaled videos.

Video

$0.065 / second

video-watermark-remover

The AI Video Watermark Remover is our flagship model designed to remove Sora 2 watermarks, logos, captions, and unwanted text from videos without compromising quality. Supporting a wide range of formats, it's fast, efficient, and processes with the highest quality.

remix-video
Video

$0.025 / second

remix-video

Transform and resize your videos effortlessly with remix video tool.

Video

$0.500 / second

ai-clipping

Convert long-form videos into engaging short clips using AI clipping.

Video

$0.050 / second

video-combiner

Combine multiple short video clips (5s, 10s, etc.) into a single seamless full-length video. Upload your clips in order and choose the final output aspect ratio. 'Auto' preserves the aspect ratio of your first clip.

Video

$0.050 / second

autocrop

Automatically crop and reframe a specific video segment to your chosen aspect ratio using AI subject tracking.

Video

$0.630 / second

motion-graphics-edit

Edit and modify a previously generated motion graphics animation using a text instruction.

Video

$0.300 / second

ai-video-effects

AI Video Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning videos from images.

Video

$0.300 / second

motion-controls

Motion Controls adds dynamic camera movements, speed ramps, and zoom effects to bring your images to life as smooth, engaging videos.

Video

$0.300 / second

vfx

VFX delivers high-impact visual effects like explosions, particles, and cinematic overlays to transform static images into action-packed videos.

Image Generation Models

ai-product-photography
Image

$0.050 / generation

ai-product-photography

Create professional-grade product photos using AI. Upload your item image and describe it with a prompt, and get studio-style, lifestyle, or creative backgrounds in seconds

ai-image-extension
Image

$0.030 / generation

ai-image-extension

Expand the edges of any image with AI. This model continues your original photo or artwork beyond its borders while matching style, lighting, and content.

ai-object-eraser
Image

$0.050 / generation

ai-object-eraser

Easily remove unwanted objects, people, or text from any image using AI. Just select the area you want to erase, and the model will intelligently fill the space with realistic background matching the surrounding environment. No Photoshop skills needed.

image-effects
Image

$0.030 / generation

image-effects

AI Image Effects applies advanced visual transformations, color grading, and cinematic filters to create stunning images from a image.

ai-image-face-swap
Image

$0.020 / generation

ai-image-face-swap

Advanced facial recognition and blending algorithms enable precise face swaps while preserving skin tone, lighting, and facial geometry.

sdxl-lora
Image

$0.002 / generation

sdxl-lora

The SDXL LoRA image model enhances Stable Diffusion XL with specialized fine-tuning, letting you generate images in unique styles, characters, or themes. By applying LoRA weights, you can create visuals that match a specific aesthetic, celebrity look, anime style, or custom-trained subject.

ai-product-shot
Image

$0.060 / generation

ai-product-shot

Instantly generate studio-quality product images with AI. Upload your item photo and get clean, stylized shots perfect for e-commerce, ads, and catalogs.

ai-skin-enhancer
Image

$0.010 / generation

ai-skin-enhancer

Smooth skin, reduce blemishes, and enhance complexion with natural-looking results. Perfect for portraits, selfies, and professional photo retouching.

ai-background-remover
Image

$0.010 / generation

ai-background-remover

Instantly remove image backgrounds with pixel-perfect precision. Ideal for product photos, profile pictures, and creative projects.

ai-anime-generator
Image

$0.030 / 1K tokens

ai-anime-generator

Create stunning anime-style artwork instantly with our AI Anime Generator. Customize characters, scenes, and styles effortlessly in seconds!

perfect-pony-xl
Image

$0.020 / 1K tokens

perfect-pony-xl

Pony XL is a high-quality image generation model based on Stable Diffusion XL architecture. It specializes in character art, hybrid styles, and producing detailed, polished visuals even with simpler prompts.

add-image-watermark
Image

$0.000 / generation

add-image-watermark

Add custom watermark to images with adjustable position, opacity, and size. Free local processing using PIL.

tiktok-carousel
Image

$0.028 / 1K tokens

tiktok-carousel

AI TikTok Carousel Generator — create viral TikTok carousel posts from a single text prompt. Choose a proven storytelling format (Problem-Solution, Listicle, Tutorial, Before & After), set your slide count (3-10), and get stunning AI-generated images at 1080x1920 portrait resolution, ready to upload to TikTok.

portrait-stylist
Image

$0.010 / generation

portrait-stylist

Professional AI portrait styles including hair, makeup, style, and fashion transformations.

ai-dress-change
Image

$0.100 / generation

ai-dress-change

Instantly change outfits in images using AI. Visualize different clothing styles without the need for physical trials—perfect for fashion, e-commerce, and virtual try-ons.

ai-color-photo
Image

$0.010 / generation

ai-color-photo

Automatically add lifelike colors to black-and-white images. Our AI brings history to life with natural tones, accurate shading, and context-aware colorization.

ai-ghibli-style
Image

$0.050 / generation

ai-ghibli-style

Bring your imagination to life with art inspired by the enchanting world of Studio Ghibli. This AI model generates dreamy, hand-drawn visuals with soft colors, whimsical characters, and painterly backgrounds

ai-image-upscaler
Image

$0.020 / generation

ai-image-upscaler

Transform blurry or pixelated images into high-definition visuals. Our AI Image Upscaler uses deep learning to reconstruct details and bring your visuals to life.

sdxl-image
Image

$0.004 / 1K tokens

sdxl-image

SDXL is a high-quality, large Stable Diffusion model for creating photorealistic and stylized images from text. It excels at fine detail, realistic lighting, and complex scenes.

neta-lumina
Image

$0.020 / 1K tokens

neta-lumina

Neta Lumina is a powerful anime-style text-to-image model developed by Neta.art Lab. It’s built on Lumina-Image-2.0, fine-tuned with over 13 million high-quality anime images. It offers strong understanding of multilingual prompts, excellent detail fidelity, support for Danbooru tags, and leaning into niche styles like furry, Guofeng, pets, scenic backgrounds, etc.

chroma-image
Image

$0.020 / 1K tokens

chroma-image

Croma Image is an advanced text-to-image generation model designed for high-quality, creative, and versatile visuals. It can produce anything from photorealistic portraits and products to imaginative concept art, fantasy illustrations, and cinematic scenes.

seedvr2-image-upscale
Image

$0.020 / generation

seedvr2-image-upscale

SeedVR2 is a one-step diffusion-transformer model designed for image restoration, super-resolution, deblurring, and artifact removal. It enhances low-quality or compressed images into clean, sharp, high-resolution results while preserving natural colors and fine details.

Other Utility Models

facebook-publish
Tools

$0.020 / generation

facebook-publish

Publish a video or image to a connected Facebook Page.

moderate-image
Tools

$0.010 / generation

moderate-image

Detect unsafe or policy-violating content in any image. Supply an image URL (and optional context text) and receive structured flags for adult, violent, hateful, or otherwise restricted content — ideal for pre-screening user uploads before they reach generation pipelines.

youtube-publish
Tools

$0.010 / generation

youtube-publish

Upload and publish a video to a connected YouTube account.

instagram-publish
Tools

$0.020 / generation

instagram-publish

Publish a video or image to a connected Instagram Business account.

twitter-fetch-posts
Tools

$0.010 / generation

twitter-fetch-posts

Fetch the latest posts and performance metrics for a Twitter/X user.

facebook-fetch-reels
Tools

$0.010 / generation

facebook-fetch-reels

Fetch the latest Reels metadata and metrics for a Facebook page.

linkedin-publish
Tools

$0.020 / generation

linkedin-publish

Publish a video or image to a connected LinkedIn profile or page.

pinterest-publish
Tools

$0.020 / generation

pinterest-publish

Publish a video or image Pin to a connected Pinterest board.

threads-publish
Tools

$0.020 / generation

threads-publish

Publish a video or image to a connected Threads account.

x-publish
Tools

$0.020 / generation

x-publish

Post a video or image to a connected X (Twitter) account.

tiktok-fetch-profile
Tools

$0.010 / generation

tiktok-fetch-profile

Retrieve profile details, stats, and metadata for a TikTok user.

tiktok-publish
Tools

$0.020 / generation

tiktok-publish

Upload and publish a video to a connected TikTok account.

tiktok-fetch-videos
Tools

$0.010 / generation

tiktok-fetch-videos

Fetch the latest video posts and view metrics for a TikTok creator.

instagram-fetch-reels
Tools

$0.010 / generation

instagram-fetch-reels

Fetch the latest Reels metadata and metrics for an Instagram creator.

youtube-fetch-shorts
Tools

$0.010 / generation

youtube-fetch-shorts

Fetch the latest Shorts from a YouTube channel by ID.

Tools

$0.010 / generation

youtube-download

Download videos from YouTube in your chosen resolution or audio format.

moderate-video
Tools

$0.020 / generation

moderate-video

Scan a video for unsafe or policy-violating content. Supports MP4, MOV, and WebM URLs and returns structured safety classifications across harassment, hate, sexual, sexual-minors, and violence categories — useful for moderating user-generated or AI-generated video before publishing.