Explore/muapi.ai/gpt-5-5

muapi/gpt-5-5

Text to Text

GPT 5.5 is OpenAI's state-of-the-art flagship reasoning model for high-complexity problems. Supports image and file uploads, system prompts, web search capabilities, and reasoning effort control. Pricing: $2.40/M input tokens, $16.00/M output tokens.

Token-based pricing

Type	Rate
Input tokens	$2.40/M
Output tokens	$16.00/M
Minimum per run	$0.0004

🚀Related Models

View all

claude-opus-4-6

Claude Opus 4.6 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-6) and live streaming (/claude-opus-4-6/stream) via SSE.

Text to Text

gemini-2-5-pro

Gemini 2.5 Pro is Google's advanced multimodal reasoning model, optimized for complex coding, logical tasks, and deep analysis. Supports text and image inputs. Token-based pricing: $1.25/M input tokens, $10.00/M output tokens. Two endpoints: standard async (/gemini-2-5-pro) and live streaming (/gemini-2-5-pro/stream) via SSE.

Text to Text

gemini-3-5-flash-openai

Gemini 3.5 Flash (OpenAI-compatible) is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash-openai) and live streaming (/gemini-3-5-flash-openai/stream) via SSE.

Text to Text

claude-opus-4-7

Claude Opus 4.7 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-7) and live streaming (/claude-opus-4-7/stream) via SSE.

Text to Text

gemini-3-1-pro

Gemini 3.1 Pro is Google's next-generation multimodal model, optimized for complex reasoning, planning, coding, and multi-turn conversation. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-1-pro) and live streaming (/gemini-3-1-pro/stream) via SSE.

Text to Text

claude-sonnet-4-5

Claude Sonnet 4.5 is Anthropic's state-of-the-art model offering high intelligence, speed, and efficiency for code generation, writing, and logical analysis. Supports text and image inputs. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-5) and live streaming (/claude-sonnet-4-5/stream) via SSE.

Text to Text

claude-haiku-4-5

Claude Haiku 4.5 is Anthropic's fastest and most cost-effective model, designed for high-frequency queries, simple tasks, and near-instant response times. Supports text and image inputs. Token-based pricing: $0.60/M input tokens, $3.00/M output tokens. Two endpoints: standard async (/claude-haiku-4-5) and live streaming (/claude-haiku-4-5/stream) via SSE.

Text to Text

gemini-2-5-flash

Gemini 2.5 Flash is Google's high-speed multimodal language model, optimized for rapid text generation, real-time image understanding, and high-frequency tasks. Supports text and image inputs. Token-based pricing: $0.30/M input tokens, $2.50/M output tokens. Two endpoints: standard async (/gemini-2-5-flash) and live streaming (/gemini-2-5-flash/stream) via SSE.

Text to Text

generate-social-video-script

Generate viral short-form video scripts for social media based on a topic and niche.

Text to Text

claude-opus-4-8

Claude Opus 4.8 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-8) and live streaming (/claude-opus-4-8/stream) via SSE.

Text to Text

gemini-3-pro

Gemini 3 Pro is Google's powerful multimodal reasoning model, designed for complex problem solving, coding, and logical tasks. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-pro) and live streaming (/gemini-3-pro/stream) via SSE.

Text to Text

gpt-codex

OpenAI GPT Codex delivers advanced coding capabilities with scalable reasoning depth. Supports multiple model variants (gpt-5-codex through gpt-5.4-codex) and multimodal inputs. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-codex) and live streaming (/gpt-codex/stream) via SSE.

Text to Text

gpt-5-2

GPT 5.2 is a lightweight reasoning model with fast response times and deep coding capabilities. Supports image inputs, system prompts, web search capabilities, and reasoning effort control. Pricing: $1.25/M input tokens, $9.00/M output tokens.

Text to Text

claude-fable-5

Claude Fable 5 is the latest flagship model from Anthropic. Supports text and image inputs with advanced reasoning and creative capabilities. Token-based pricing: $8.00/M input tokens, $40.00/M output tokens. Two endpoints: standard async (/claude-fable-5) and live streaming (/claude-fable-5/stream) via SSE.

Text to Text

claude-sonnet-4-6

Claude Sonnet 4.6 delivers strong reasoning, advanced coding, and native computer-use functionality. Supports text and image inputs with up to 1M token context. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-6) and live streaming (/claude-sonnet-4-6/stream) via SSE.

Text to Text

claude-opus-4-5

Claude Opus 4.5 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-5) and live streaming (/claude-opus-4-5/stream) via SSE.

Text to Text

gemini-3-5-flash

Gemini 3.5 Flash is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash) and live streaming (/gemini-3-5-flash/stream) via SSE.

Text to Text

gemini-3-flash

Gemini 3 Flash is a fast, multimodal language model for real-time text generation. Supports text and image inputs, function calling, and Google Search grounding. Token-based pricing: $0.30/M input tokens and $1.80/M output tokens. Two endpoints: standard async (/gemini-3-flash) and live streaming (/gemini-3-flash/stream) via SSE.

Text to Text

gpt-5-4

GPT-5.4 delivers powerful reasoning, coding, and professional knowledge work. Supports multimodal inputs (text and image) with adjustable reasoning depth. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-5-4) and live streaming (/gpt-5-4/stream) via SSE.

Text to Text

📝

Overview

About this model

GPT 5.5 is OpenAI's state-of-the-art flagship reasoning model for high-complexity problems. It supports image and file uploads, system prompts, web search capabilities, and reasoning effort control. Pricing: $2.40 per million input tokens and $16.00 per million output tokens.

1Advanced Reasoning: Solve highly complex mathematics, programming, and logic puzzles.

2Document Parsing: Upload pdfs, word docs, or text files to perform advanced analysis.

3Visual Cognition: Reason over diagrams, charts, and complex screenshots.

4Real-Time Data: Integrate web search to answer questions with the latest information.

💰

Pricing & Value

Cost analysis

Provider	Cost	Notes
muapiapp	$2.40/M input tokens, $16.00/M output tokens	Token-based billing. Minimum $0.00045 per call. ~68% of official input pricing.
OpenAI (official)	~$3.50/M input tokens, ~$25.00/M output tokens	Official pricing via api.openai.com.

muapiapp$2.40/M input tokens, $16.00/M output tokens

Token-based billing. Minimum $0.00045 per call. ~68% of official input pricing.

OpenAI (official)~$3.50/M input tokens, ~$25.00/M output tokens

Official pricing via api.openai.com.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Parameter	Type	Description	Default
Prompt	string	The user message or instruction.	`Provide a comprehensive analysis of the attached documents.`
Image URL	string	Optional image URL for multimodal requests.	`undefined`
System Prompt	string	Optional system-level instruction to guide model behavior.	`You are a professional financial analyst.`
Web Search Switch	boolean	Enable web search capability.	`false`
Reasoning Effort	Enum (3 options)	Level of reasoning depth: low, medium, or high.	`low`

Promptstring

The user message or instruction.

Default ValueProvide a comprehensive analysis of the attached documents.

Image URLstring

Optional image URL for multimodal requests.

Default Valueundefined

System Promptstring

Optional system-level instruction to guide model behavior.

Default ValueYou are a professional financial analyst.

Web Search Switchboolean

Enable web search capability.

Default Valuefalse

Reasoning EffortEnum (3 options)

Level of reasoning depth: low, medium, or high.

Default Valuelow

📖

Implementation Guide

Developer documentation

Standard (Async)

POST /api/v1/gpt-5-5 — returns request_id, poll for result via /api/v1/predictions/{id}/result.

Streaming (SSE)

POST /api/v1/gpt-5-5/stream — returns a live SSE stream. Each chunk: data: {"choices":[{"delta":{"content":"text"}}]}, ending with data: [DONE].

❓

Common Questions

Frequently asked

How is pricing calculated?

Pricing is token-based: $2.40 per million input tokens and $16.00 per million output tokens. The minimum charge per call is $0.00045. Actual cost is deducted after each call based on token counts returned by the model.

How do I upload a file?

Pass a file_url in the request body to upload document files like PDFs or Word docs. They will be parsed and analyzed by the model.

What is the reasoning effort option?

You can control the depth of reasoning by passing a reasoning_effort value of 'low', 'medium', or 'high'. The default value is 'low'.

ai-product-photography

wan2.2-image-to-video

facebook-publish

hunyuan-text-to-video

runway-aleph-v2v

flux-dev-lora

happy-horse-1.1-text-to-video-1080p

pixverse-v4.5-t2v

hidream-i1-full

creatify-lipsync

flux-kontext-pro-i2i

kling-v1-avatar-standard

heygen-video-translate

wan2.2-animate

ai-image-extension

openai-sora-2-text-to-video

ai-video-upscaler-pro

ai-object-eraser

veed-lipsync

veo3.1-fast-image-to-video

veo3.1-fast-text-to-video

ai-dance-effects

image-effects

gemini-omni-image-to-video

veo3-fast-text-to-video

ltx-2-fast-text-to-video

kling-v2.5-turbo-std-i2v

minimax-hailuo-2.3-pro-i2v

minimax-hailuo-2.3-pro-t2v

wan2.1-text-to-image

reve-image-edit

grok-imagine-text-to-video

nano-banana-pro-edit

qwen-image-edit-plus-lora

ai-image-face-swap

google-imagen4-fast

sdxl-lora

infinitetalk-image-to-video

wan2.2-edit-video

ltx-2-pro-text-to-video

mmaudio-v2-text-to-audio

kling-v2-avatar-pro

flux-2-flex

flux-2-pro-edit

ai-product-shot

seedance-v1.5-pro-t2v

bytedance-seededit-v3

add-video-watermark

ai-skin-enhancer

seedance-v1.5-pro-t2v-fast

qwen-image-edit-2511

qwen-text-to-image-2512

kling-v2.1-standard-i2v

kling-v3.0-standard-image-to-video

kling-v3.0-std-motion-control

suno-add-vocals

seedance-2-video-watermark-remover-pro

ai-background-remover

latent-sync

claude-opus-4-6

flux-kontext-dev-i2i

seedance-2-image-to-video-fast

pixverse-v5.5-t2v

wan2.7-video-edit

seedance-2-omni-reference-no-video

seedance-2-i2v-480p

suno-remix-music

seedance-2-vip-image-to-video-fast

happy-horse-1-text-to-video-1080p

veo3-image-to-video

flux-schnell

happy-horse-1-text-to-video-720p

kling-v2.1-pro-i2v

seedance-2-vip-image-to-video-1080p

seedance-2-vip-first-last-frame-1080p

kling-v3.0-4k-image-to-video

gemini-2-5-pro

wan2.2-text-to-video

vidu-v2.0-i2v

vidu-q3-turbo-text-to-video