Explore/muapi.ai/claude-sonnet-4-6

muapi/claude-sonnet-4-6

Text to Text

Claude Sonnet 4.6 delivers strong reasoning, advanced coding, and native computer-use functionality. Supports text and image inputs with up to 1M token context. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-6) and live streaming (/claude-sonnet-4-6/stream) via SSE.

Input

Configure the model parameters below.

Append past messages to the context to remember conversation history.

claude-sonnet-4-6 Chat

🚀Related Models

View all
claude-opus-4-6

claude-opus-4-6

Claude Opus 4.6 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-6) and live streaming (/claude-opus-4-6/stream) via SSE.

Text to Text
claude-haiku-4-5

claude-haiku-4-5

Claude Haiku 4.5 is Anthropic's fastest and most cost-effective model, designed for high-frequency queries, simple tasks, and near-instant response times. Supports text and image inputs. Token-based pricing: $0.60/M input tokens, $3.00/M output tokens. Two endpoints: standard async (/claude-haiku-4-5) and live streaming (/claude-haiku-4-5/stream) via SSE.

Text to Text
claude-opus-4-8

claude-opus-4-8

Claude Opus 4.8 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-8) and live streaming (/claude-opus-4-8/stream) via SSE.

Text to Text
gemini-3-5-flash

gemini-3-5-flash

Gemini 3.5 Flash is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash) and live streaming (/gemini-3-5-flash/stream) via SSE.

Text to Text
gemini-3-5-flash-openai

gemini-3-5-flash-openai

Gemini 3.5 Flash (OpenAI-compatible) is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash-openai) and live streaming (/gemini-3-5-flash-openai/stream) via SSE.

Text to Text
claude-opus-4-7

claude-opus-4-7

Claude Opus 4.7 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-7) and live streaming (/claude-opus-4-7/stream) via SSE.

Text to Text
gemini-3-1-pro

gemini-3-1-pro

Gemini 3.1 Pro is Google's next-generation multimodal model, optimized for complex reasoning, planning, coding, and multi-turn conversation. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-1-pro) and live streaming (/gemini-3-1-pro/stream) via SSE.

Text to Text
gemini-3-pro

gemini-3-pro

Gemini 3 Pro is Google's powerful multimodal reasoning model, designed for complex problem solving, coding, and logical tasks. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-pro) and live streaming (/gemini-3-pro/stream) via SSE.

Text to Text
claude-opus-4-5

claude-opus-4-5

Claude Opus 4.5 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-5) and live streaming (/claude-opus-4-5/stream) via SSE.

Text to Text
claude-sonnet-4-5

claude-sonnet-4-5

Claude Sonnet 4.5 is Anthropic's state-of-the-art model offering high intelligence, speed, and efficiency for code generation, writing, and logical analysis. Supports text and image inputs. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-5) and live streaming (/claude-sonnet-4-5/stream) via SSE.

Text to Text
gemini-2-5-pro

gemini-2-5-pro

Gemini 2.5 Pro is Google's advanced multimodal reasoning model, optimized for complex coding, logical tasks, and deep analysis. Supports text and image inputs. Token-based pricing: $1.25/M input tokens, $10.00/M output tokens. Two endpoints: standard async (/gemini-2-5-pro) and live streaming (/gemini-2-5-pro/stream) via SSE.

Text to Text
gemini-2-5-flash

gemini-2-5-flash

Gemini 2.5 Flash is Google's high-speed multimodal language model, optimized for rapid text generation, real-time image understanding, and high-frequency tasks. Supports text and image inputs. Token-based pricing: $0.30/M input tokens, $2.50/M output tokens. Two endpoints: standard async (/gemini-2-5-flash) and live streaming (/gemini-2-5-flash/stream) via SSE.

Text to Text
gpt-5-2

gpt-5-2

GPT 5.2 is a lightweight reasoning model with fast response times and deep coding capabilities. Supports image inputs, system prompts, web search capabilities, and reasoning effort control. Pricing: $1.25/M input tokens, $9.00/M output tokens.

Text to Text
claude-fable-5

claude-fable-5

Claude Fable 5 is the latest flagship model from Anthropic. Supports text and image inputs with advanced reasoning and creative capabilities. Token-based pricing: $8.00/M input tokens, $40.00/M output tokens. Two endpoints: standard async (/claude-fable-5) and live streaming (/claude-fable-5/stream) via SSE.

Text to Text
gpt-5-5

gpt-5-5

GPT 5.5 is OpenAI's state-of-the-art flagship reasoning model for high-complexity problems. Supports image and file uploads, system prompts, web search capabilities, and reasoning effort control. Pricing: $2.40/M input tokens, $16.00/M output tokens.

Text to Text
generate-social-video-script

generate-social-video-script

Generate viral short-form video scripts for social media based on a topic and niche.

Text to Text
gemini-3-flash

gemini-3-flash

Gemini 3 Flash is a fast, multimodal language model for real-time text generation. Supports text and image inputs, function calling, and Google Search grounding. Token-based pricing: $0.30/M input tokens and $1.80/M output tokens. Two endpoints: standard async (/gemini-3-flash) and live streaming (/gemini-3-flash/stream) via SSE.

Text to Text
gpt-5-4

gpt-5-4

GPT-5.4 delivers powerful reasoning, coding, and professional knowledge work. Supports multimodal inputs (text and image) with adjustable reasoning depth. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-5-4) and live streaming (/gpt-5-4/stream) via SSE.

Text to Text
gpt-codex

gpt-codex

OpenAI GPT Codex delivers advanced coding capabilities with scalable reasoning depth. Supports multiple model variants (gpt-5-codex through gpt-5.4-codex) and multimodal inputs. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-codex) and live streaming (/gpt-codex/stream) via SSE.

Text to Text
📝

Overview

About this model

Claude Sonnet 4.6 is Anthropic's high-performance model combining strong reasoning, advanced coding, and native computer-use functionality. With an adaptive thinking mode and up to 1 million token context, it excels at building reliable AI agents, complex automation, and large-scale content workflows. Pricing is token-based: $1.80 per million input tokens and $9.00 per million output tokens — significantly below official Anthropic pricing.

1Coding & Engineering: Write, review, and debug code with context-aware reasoning across large codebases.
2Agentic Workflows: Build autonomous agents that use tools, call APIs, and complete multi-step tasks.
3Content Generation: Draft long-form articles, summaries, reports, and structured documents.
4Multimodal Analysis: Combine image and text inputs to extract insights from diagrams, screenshots, and documents.
💰

Pricing & Value

Cost analysis

muapiapp$1.80/M input tokens, $9.00/M output tokens

Token-based billing. Minimum $0.00035 per call. ~60% of official Anthropic pricing.

Anthropic (official)~$3.00/M input tokens, ~$15.00/M output tokens

Official pricing via api.anthropic.com.

Fal.aiNot available

Claude Sonnet 4.6 is not available on Fal.ai.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

The user message or instruction.

Default ValueExplain the concept of recursion with a simple example.
Image URLstring

Optional image URL for multimodal requests.

Default Valueundefined
System Promptstring

Optional system-level instruction to guide model behavior.

Default ValueYou are a concise coding assistant.
📖

Implementation Guide

Developer documentation

Standard (Async)

POST /api/v1/claude-sonnet-4-6 — returns request_id, poll for result via /api/v1/predictions/{id}/result.

Streaming (SSE)

POST /api/v1/claude-sonnet-4-6/stream — returns a live Server-Sent Events stream (Content-Type: text/event-stream). Each chunk has the format data: {"choices":[{"delta":{"content":"text"}}]}, ending with data: [DONE].

See Streaming Documentation for full code examples.

Common Questions

Frequently asked

How is pricing calculated?

Pricing is token-based: $1.80 per million input tokens and $9.00 per million output tokens. The minimum charge per call is $0.00035. Actual cost is deducted after each call based on token counts returned by the model.

What is the difference between /claude-sonnet-4-6 and /claude-sonnet-4-6/stream?

/claude-sonnet-4-6 is async — you receive a request_id and poll for the result. /claude-sonnet-4-6/stream returns a live SSE stream. Use streaming for chat UIs; use the async endpoint for workflows and automation.

Does Claude Sonnet 4.6 support images?

Yes. Pass an image_url in the request body to include an image as part of the user message.