Explore/muapi.ai/claude-fable-5

muapi/claude-fable-5

Text to Text

Claude Fable 5 is the latest flagship model from Anthropic. Supports text and image inputs with advanced reasoning and creative capabilities. Token-based pricing: $8.00/M input tokens, $40.00/M output tokens. Two endpoints: standard async (/claude-fable-5) and live streaming (/claude-fable-5/stream) via SSE.

Input

Configure the model parameters below.

Append past messages to the context to remember conversation history.

Token-based pricing

TypeRate
Input tokens$8.00/M
Output tokens$40.00/M
Minimum per run$0.0010

🚀Related Models

View all
claude-opus-4-6

claude-opus-4-6

Claude Opus 4.6 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-6) and live streaming (/claude-opus-4-6/stream) via SSE.

Text to Text
gemini-2-5-pro

gemini-2-5-pro

Gemini 2.5 Pro is Google's advanced multimodal reasoning model, optimized for complex coding, logical tasks, and deep analysis. Supports text and image inputs. Token-based pricing: $1.25/M input tokens, $10.00/M output tokens. Two endpoints: standard async (/gemini-2-5-pro) and live streaming (/gemini-2-5-pro/stream) via SSE.

Text to Text
gemini-3-5-flash-openai

gemini-3-5-flash-openai

Gemini 3.5 Flash (OpenAI-compatible) is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash-openai) and live streaming (/gemini-3-5-flash-openai/stream) via SSE.

Text to Text
claude-opus-4-7

claude-opus-4-7

Claude Opus 4.7 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-7) and live streaming (/claude-opus-4-7/stream) via SSE.

Text to Text
gemini-3-1-pro

gemini-3-1-pro

Gemini 3.1 Pro is Google's next-generation multimodal model, optimized for complex reasoning, planning, coding, and multi-turn conversation. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-1-pro) and live streaming (/gemini-3-1-pro/stream) via SSE.

Text to Text
claude-sonnet-4-5

claude-sonnet-4-5

Claude Sonnet 4.5 is Anthropic's state-of-the-art model offering high intelligence, speed, and efficiency for code generation, writing, and logical analysis. Supports text and image inputs. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-5) and live streaming (/claude-sonnet-4-5/stream) via SSE.

Text to Text
claude-haiku-4-5

claude-haiku-4-5

Claude Haiku 4.5 is Anthropic's fastest and most cost-effective model, designed for high-frequency queries, simple tasks, and near-instant response times. Supports text and image inputs. Token-based pricing: $0.60/M input tokens, $3.00/M output tokens. Two endpoints: standard async (/claude-haiku-4-5) and live streaming (/claude-haiku-4-5/stream) via SSE.

Text to Text
gemini-2-5-flash

gemini-2-5-flash

Gemini 2.5 Flash is Google's high-speed multimodal language model, optimized for rapid text generation, real-time image understanding, and high-frequency tasks. Supports text and image inputs. Token-based pricing: $0.30/M input tokens, $2.50/M output tokens. Two endpoints: standard async (/gemini-2-5-flash) and live streaming (/gemini-2-5-flash/stream) via SSE.

Text to Text
gpt-5-5

gpt-5-5

GPT 5.5 is OpenAI's state-of-the-art flagship reasoning model for high-complexity problems. Supports image and file uploads, system prompts, web search capabilities, and reasoning effort control. Pricing: $2.40/M input tokens, $16.00/M output tokens.

Text to Text
generate-social-video-script

generate-social-video-script

Generate viral short-form video scripts for social media based on a topic and niche.

Text to Text
claude-opus-4-8

claude-opus-4-8

Claude Opus 4.8 is Anthropic's most capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-8) and live streaming (/claude-opus-4-8/stream) via SSE.

Text to Text
gemini-3-pro

gemini-3-pro

Gemini 3 Pro is Google's powerful multimodal reasoning model, designed for complex problem solving, coding, and logical tasks. Supports text and image inputs. Token-based pricing: $4.00/M input tokens, $24.00/M output tokens. Two endpoints: standard async (/gemini-3-pro) and live streaming (/gemini-3-pro/stream) via SSE.

Text to Text
gpt-codex

gpt-codex

OpenAI GPT Codex delivers advanced coding capabilities with scalable reasoning depth. Supports multiple model variants (gpt-5-codex through gpt-5.4-codex) and multimodal inputs. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-codex) and live streaming (/gpt-codex/stream) via SSE.

Text to Text
gpt-5-2

gpt-5-2

GPT 5.2 is a lightweight reasoning model with fast response times and deep coding capabilities. Supports image inputs, system prompts, web search capabilities, and reasoning effort control. Pricing: $1.25/M input tokens, $9.00/M output tokens.

Text to Text
claude-sonnet-4-6

claude-sonnet-4-6

Claude Sonnet 4.6 delivers strong reasoning, advanced coding, and native computer-use functionality. Supports text and image inputs with up to 1M token context. Token-based pricing: $1.80/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/claude-sonnet-4-6) and live streaming (/claude-sonnet-4-6/stream) via SSE.

Text to Text
claude-opus-4-5

claude-opus-4-5

Claude Opus 4.5 is Anthropic's highly capable model for complex coding, long-context reasoning, and agentic workflows. Supports text and image inputs. Token-based pricing: $3.00/M input tokens, $15.00/M output tokens. Two endpoints: standard async (/claude-opus-4-5) and live streaming (/claude-opus-4-5/stream) via SSE.

Text to Text
gemini-3-5-flash

gemini-3-5-flash

Gemini 3.5 Flash is a high-speed, multimodal language model built for real-time text generation, supporting text and image inputs natively. Token-based pricing: $0.60/M input tokens and $3.60/M output tokens. Two endpoints: standard async (/gemini-3-5-flash) and live streaming (/gemini-3-5-flash/stream) via SSE.

Text to Text
gemini-3-flash

gemini-3-flash

Gemini 3 Flash is a fast, multimodal language model for real-time text generation. Supports text and image inputs, function calling, and Google Search grounding. Token-based pricing: $0.30/M input tokens and $1.80/M output tokens. Two endpoints: standard async (/gemini-3-flash) and live streaming (/gemini-3-flash/stream) via SSE.

Text to Text
gpt-5-4

gpt-5-4

GPT-5.4 delivers powerful reasoning, coding, and professional knowledge work. Supports multimodal inputs (text and image) with adjustable reasoning depth. Token-based pricing: $1.25/M input tokens, $9.00/M output tokens. Two endpoints: standard async (/gpt-5-4) and live streaming (/gpt-5-4/stream) via SSE.

Text to Text
📝

Overview

About this model

Claude Fable 5 is the latest flagship model from Anthropic, delivering top-tier reasoning, creative writing, and multimodal analysis. With support for both text and image inputs, it excels at complex multi-step tasks, technical work, and long-form content generation. Pricing is token-based: $8.00 per million input tokens and $40.00 per million output tokens.

1Complex Reasoning: Tackle multi-step logic, deep analysis, and research synthesis with state-of-the-art accuracy.
2Creative Writing: Generate long-form stories, scripts, essays, and creative content with nuanced style control.
3Coding & Engineering: Write, review, debug, and refactor code across any language with expert-level understanding.
4Multimodal Analysis: Combine image and text inputs to extract insights from diagrams, screenshots, and documents.
💰

Pricing & Value

Cost analysis

muapiapp$8.00/M input tokens, $40.00/M output tokens

Token-based billing. Minimum $0.001 per call.

Anthropic (official)~$15.00/M input tokens, ~$75.00/M output tokens

Official pricing via api.anthropic.com. Muapiapp pricing is significantly lower.

Fal.aiNot available

Claude Fable 5 is not available on Fal.ai.

* Competitor pricing is estimated based on similar model architectures and usage tiers.

⚙️

Technical Details

Configuration schema

Promptstring

The user message or instruction.

Default ValueExplain quantum entanglement in simple terms.
Image URLstring

Optional image URL for multimodal requests.

Default Valueundefined
System Promptstring

Optional system-level instruction to guide model behavior.

Default ValueYou are a concise and precise assistant.
📖

Implementation Guide

Developer documentation

Standard (Async)

POST /api/v1/claude-fable-5 — returns request_id, poll for result via /api/v1/predictions/{id}/result.

Streaming (SSE)

POST /api/v1/claude-fable-5/stream — returns a live Server-Sent Events stream (Content-Type: text/event-stream). Each chunk has the format data: {"choices":[{"delta":{"content":"text"}}]}, ending with data: [DONE].

See Streaming Documentation for full code examples.

Common Questions

Frequently asked

How is pricing calculated?

Pricing is token-based: $8.00 per million input tokens and $40.00 per million output tokens. The minimum charge per call is $0.001. Actual cost is deducted after each call based on token counts returned by the model.

What is the difference between /claude-fable-5 and /claude-fable-5/stream?

/claude-fable-5 is async — you receive a request_id and poll for the result. /claude-fable-5/stream returns a live SSE stream. Use streaming for chat UIs; use the async endpoint for workflows and automation.

Does Claude Fable 5 support images?

Yes. Pass an image_url in the request body to include an image as part of the user message for multimodal analysis.