Explore/muapi.ai/gemini-omni-character

muapi/gemini-omni-character

Image to Image

Generate a reusable character from a single reference image and a text description. Optionally attach a voice profile created with Gemini Omni Audio to give the character a consistent voice in future video generations.

Input

Configure the model parameters below.

0/1 items
Drag & drop images here or paste file/image
0/20

Result

Your generated results
will appear here

📝

Overview

About this model

Turn a single reference photo and a text description into a reusable character — complete with a unique ID and a generated character image. Optionally attach a voice profile to give the character a consistent voice across future video generations.

1Storytelling: Create named characters for recurring use in AI-generated video narratives.
2Brand Mascots: Build a visual character asset from a concept image and description for marketing videos.
3Game & Animation Prototyping: Rapidly generate character visuals from reference art before full production.
💰

Pricing & Value

Cost analysis

muapiappFree

No charge for character creation.

Fal.aiNot available

Gemini Omni Character creation is not offered.

ReplicateNot available

Gemini Omni Character creation is not offered.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Character Descriptionstring

Describe the character's appearance, identity, style, and personality.

Default ValueA young woman with short silver hair, wearing a dark trench coat, confident and composed demeanor.
Reference Imagearray

Provide exactly 1 reference image of the character. Maximum 20 MB. Must be a publicly accessible URL.

Default Valuehttps://d3adwkbyhxyrtq.cloudfront.net/ai-images/186/712345784292/4a8c5c70-abcc-4920-873e-b0e219986453.jpg
Character Namestring

Optional display name for the character.

Default ValueAria
Voice Profile IDsarray

Optional list of voice profile IDs from Gemini Omni Audio to associate with this character.

Default Valuea8f1c2d3e4f5b6a7
📖

Implementation Guide

Developer documentation

How to Use Gemini Omni Character

  1. Provide a reference image: Upload one image via images_list (exactly 1 image, max 20 MB, publicly accessible URL). This is the visual anchor for the character.

  2. Describe the character: Fill in descriptions with appearance, identity, style, and personality details.

  3. Name the character (optional): Set character_name for a human-readable label.

  4. Attach a voice (optional): Pass one or more audio_id values in audio_ids — these come from the Gemini Omni Audio endpoint and give the character a consistent voice.

  5. Use the result: The response includes a character_id, character_name, and an image URL in outputs. Save the character_id for use in future video pipelines that support character-driven generation.

Common Questions

Frequently asked

How many reference images can I provide?

Exactly one image is required and supported. The image must be a publicly accessible URL and no larger than 20 MB.

Where do the audio_ids come from?

Voice profile IDs are created using the Gemini Omni Audio endpoint. Call that endpoint first to register a voice, then pass the returned `audio_id` here.

What does the output image show?

The image URL returned in `outputs` is the generated character image — a visual representation of the character derived from your reference photo and description.