Explore/muapi.ai/ai-captions

muapi/ai-captions

Video to VideoPowered by Vadoo AI

Add AI-generated animated captions to any video using Vadoo's caption engine. Supports multiple languages and viral caption themes like Hormozi style. Perfect for social media creators, marketers, and content producers.

Input

Configure the model parameters below.

Drag & drop, paste file/image, or paste a link

Result

🚀Related Models

View all
ai-image-face-swap

ai-image-face-swap

Advanced facial recognition and blending algorithms enable precise face swaps while preserving skin tone, lighting, and facial geometry.

Image to Image
ai-background-remover

ai-background-remover

Instantly remove image backgrounds with pixel-perfect precision. Ideal for product photos, profile pictures, and creative projects.

Image to Image
ai-product-shot

ai-product-shot

Instantly generate studio-quality product images with AI. Upload your item photo and get clean, stylized shots perfect for e-commerce, ads, and catalogs.

Image to Image
ai-skin-enhancer

ai-skin-enhancer

Smooth skin, reduce blemishes, and enhance complexion with natural-looking results. Perfect for portraits, selfies, and professional photo retouching.

Image to Image
ai-product-photography

ai-product-photography

Create professional-grade product photos using AI. Upload your item image and describe it with a prompt, and get studio-style, lifestyle, or creative backgrounds in seconds

Image to Image
ai-anime-generator

ai-anime-generator

Create stunning anime-style artwork instantly with our AI Anime Generator. Customize characters, scenes, and styles effortlessly in seconds!

Text to Image
ai-image-extension

ai-image-extension

Expand the edges of any image with AI. This model continues your original photo or artwork beyond its borders while matching style, lighting, and content.

Image to Image
ai-object-eraser

ai-object-eraser

Easily remove unwanted objects, people, or text from any image using AI. Just select the area you want to erase, and the model will intelligently fill the space with realistic background matching the surrounding environment. No Photoshop skills needed.

Image to Image
heygen-video-translate

heygen-video-translate

Convert any video into 175+ languages with synchronized voice translation, AI-voice cloning, and accurate lip sync. Just upload your video (or provide a link), select a target language, and HeyGen recreates the speech in that language. 0.05$ per second.

Video to Video
ai-video-upscaler-pro

ai-video-upscaler-pro

The AI Video Upscaler is a powerful tool designed to enhance the resolution and quality of videos. Whether you're working with low-resolution videos that need a boost or aiming to improve the clarity of existing footage, this upscaler leverages advanced machine learning models to deliver high-quality, upscaled videos.

Video to Video
ai-dress-change

ai-dress-change

Instantly change outfits in images using AI. Visualize different clothing styles without the need for physical trials—perfect for fashion, e-commerce, and virtual try-ons.

Image to Image
ai-color-photo

ai-color-photo

Automatically add lifelike colors to black-and-white images. Our AI brings history to life with natural tones, accurate shading, and context-aware colorization.

Image to Image
ai-ghibli-style

ai-ghibli-style

Bring your imagination to life with art inspired by the enchanting world of Studio Ghibli. This AI model generates dreamy, hand-drawn visuals with soft colors, whimsical characters, and painterly backgrounds

Image to Image
ai-image-upscaler

ai-image-upscaler

Transform blurry or pixelated images into high-definition visuals. Our AI Image Upscaler uses deep learning to reconstruct details and bring your visuals to life.

Image to Image
ai-video-face-swap

ai-video-face-swap

Replace faces in videos with stunning realism. Our AI ensures accurate expression transfer, lighting consistency, and smooth frame-by-frame blending.

Video to Video
ai-video-upscaler

ai-video-upscaler

The AI Video Upscaler is a powerful tool designed to enhance the resolution and quality of videos. Whether you're working with low-resolution videos that need a boost or aiming to improve the clarity of existing footage, this upscaler leverages advanced machine learning models to deliver high-quality, upscaled videos.

Video to Video
video-watermark-remover

video-watermark-remover

The AI Video Watermark Remover is our flagship model designed to remove Sora 2 watermarks, logos, captions, and unwanted text from videos without compromising quality. Supporting a wide range of formats, it's fast, efficient, and processes with the highest quality.

Video to Video
remix-video

remix-video

Transform and resize your videos effortlessly with remix video tool.

Video to Video
📝

Overview

About this model

AI Captions by Vadoo automatically transcribes and burns animated captions into your videos — no editing software needed. It's the API-first alternative to Submagic, Captions.ai, and CapCut's auto-caption feature, letting you add viral-style captions programmatically at scale. Supports multiple languages and proven caption themes used by top content creators.

1Social Media: Add Hormozi-style captions to short-form videos for TikTok, Instagram Reels, and YouTube Shorts to boost watch time and engagement.
2Marketing: Automatically caption product demo videos and ads without manual editing, saving hours of post-production work.
3Accessibility: Make video content accessible to deaf and hard-of-hearing audiences by burning in accurate, readable captions.
4Multilingual Content: Caption videos in Spanish, French, German, Portuguese, and more to reach global audiences.
5Batch Processing: Caption hundreds of videos programmatically via API — impossible with manual tools like Submagic or CapCut.
💰

Pricing & Value

Cost analysis

muapiapp$0.005 per second (~$0.30/minute), minimum $0.20

Pay per use with no subscription. Minimum charge of $0.20 per video regardless of length.

Submagic$20–$60/month subscription

Web UI only — no API access. Manual upload per video. Not suitable for automation.

Captions.ai$13–$29/month subscription

Mobile/web app only. No programmatic API for batch captioning.

Fal.aiNot available

Fal.ai does not offer an AI captions/subtitle burning endpoint.

ReplicateNot available

No dedicated auto-caption model on Replicate.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Video URLstring

Public URL of the video to add captions to. Maximum 600MB or 10 minutes.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/webassets/videomodels/sample-video.mp4
LanguageEnum (45 options)

Language of the video audio for transcription.

Default ValueEnglish
Caption ThemeEnum (18 options)

Visual style/theme for the captions. E.g. Hormozi_1, Hormozi_2.

Default ValueHormozi_1
📖

Implementation Guide

Developer documentation

How to Add AI Captions to Your Video

  1. Upload or provide a video URL: Your video must be publicly accessible (max 600MB or 10 minutes). Use the /upload_file endpoint to get a hosted URL if needed.

  2. Choose a language: Set the language parameter to match the spoken language in the video. Defaults to English. Supported languages: English, English (USA), English (UK), English (Australia), English (Canada), Japanese, Chinese, German, Hindi, French, French (France), French (Canada), Korean, Portuguese (Brazil), Portuguese (Portugal), Portuguese, Spanish (Spain), Spanish (Mexico), Italian, Spanish, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Arabic (Saudi Arabia), Arabic (UAE), Arabic, Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Telugu, Ukrainian, Russian, Hungarian, Norwegian, Vietnamese.

  3. Pick a caption theme: Select a theme to style your captions. Defaults to Hormozi_1. Available themes:

    • Hormozi_1 / Hormozi_2 / Hormozi_3 — Bold word-by-word highlight style popularised by Alex Hormozi
    • Beast — MrBeast-inspired dynamic captions
    • Ali — Sleek and minimal
    • Noah — Italics with heavy drop shadow
    • Karl — Sharp contrast with border outlines
    • Luke — Shaking/vibrating effect for energy
    • Devin — Rotating and scaling animations
    • Celine — Soft shadow, clean typography
    • Maya — Serif font with glowing effects
    • Ella — Scale and translate animations with blur
    • Dan — Large, impactful uppercase text
    • David — Bold uppercase with scaling highlights
    • Tracy — Minimalist with white glow shadow
    • Umi — Italic thin font, letter-by-letter reveal
    • Iman — Minimalist white text with black border
    • William — Left-right-center alternating animations
  4. Submit and receive a request ID: POST to /ai-captions — you'll get a request_id immediately.

  5. Poll for results: GET /predictions/{request_id}/result until status is completed. Videos under 1 minute typically finish in 2–3 minutes.

  6. Download the output: The result contains a video URL with captions burned in — ready to publish.

Common Questions

Frequently asked

How is this different from Submagic or Captions.ai?

Submagic and Captions.ai are manual SaaS tools — you upload one video at a time through a web UI. muapiapp's AI Captions endpoint lets you caption videos via API, making it ideal for automating workflows, batch processing, and embedding into your own products.

What caption themes are available?

18 themes are supported: Hormozi_1, Hormozi_2, Hormozi_3 (bold word-highlight styles), Beast (MrBeast-inspired), Ali (sleek/minimal), Noah (italics + drop shadow), Karl (border outlines), Luke (shaking/vibrating), Devin (rotating + scaling), Celine (soft shadow), Maya (serif + glow), Ella (scale + blur animations), Dan (large uppercase), David (bold uppercase highlights), Tracy (white glow), Umi (letter-by-letter reveal), Iman (white text + black border), William (alternating animations). Defaults to Hormozi_1.

What languages are supported?

45 languages are supported: English (and regional variants for USA, UK, Australia, Canada), Japanese, Chinese, German, Hindi, French (France, Canada), Korean, Portuguese (Brazil, Portugal), Spanish (Spain, Mexico), Italian, Indonesian, Dutch, Turkish, Filipino, Polish, Swedish, Bulgarian, Romanian, Arabic (Saudi Arabia, UAE), Czech, Greek, Finnish, Croatian, Malay, Slovak, Danish, Tamil, Telugu, Ukrainian, Russian, Hungarian, Norwegian, and Vietnamese. Defaults to English.

What is the maximum video size?

Videos must be under 600MB and no longer than 10 minutes. For longer content, split the video before processing.

How long does processing take?

Videos under 1 minute typically complete within 2–3 minutes. Longer videos may take up to 10–15 minutes depending on duration and server load.