MiniMax M3 API pricing, capabilities, context, and OpenAI-compatible code

Input price¥2.81 / 1M tokens

Output price¥11.23 / 1M tokens

Context length128k tokens

Max output8.2k tokens

What is MiniMax M3 in NextModel?

MiniMax M3 is a MiniMax model listed in the NextModel catalog for high-volume chat, agentic tool use, classification workloads. Its listed price is ¥2.81 / 1M tokens input and ¥11.23 / 1M tokens output per 1M tokens, with a 128k token context window.

Best use cases

high-volume chat
agentic tool use
classification

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID minimax-m3.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="minimax-m3",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

MiniMaxProduction

MiniMax M2.7

78

MiniMax M2.7 is available only through the Volcengine Agent Plan (no public ARK price list); its listed price tracks the OpenRouter reference rate for the same model. Unlike M3, this model has no cache-hit discount.

¥1.68 / 1M tokensInput¥6.74 / 1M tokensOutput128kContext

Best forhigh-volume chat, agentic tool use, classification

RoutingConfigured

Tool callingJSON modeStreamingLow cost

Platform curatedNextModel production gateway; price referenced from OpenRouter minimax/minimax-m2.7

View details

DeepSeekProduction

DeepSeek V4 Flash

83

DeepSeek V4 Flash is the low-cost, low-latency member of the DeepSeek V4 family, onboarded via the Volcengine Agent Plan.

¥1 / 1M tokensInput¥2 / 1M tokensOutput128kContext

Best forhigh-volume chat, lightweight agent steps, classification

RoutingConfigured

Tool callingJSON modeStreamingLow cost

Platform curatedNextModel production gateway and Volcengine Agent Plan pricing config

View details

VolcengineProduction

Doubao Seed 2.0 Lite

80

Doubao Seed 2.0 Lite is the low-cost long-context member of the Seed 2.0 family, onboarded through the Volcengine Agent Plan. Audio input pricing is not modeled in this catalog.

Starting at ¥0.6 / 1M tokensInputStarting at ¥3.60 / 1M tokensOutput256kContext

Best forhigh-volume chat, classification, lightweight agent steps

RoutingConfigured

Tool callingJSON modeLong contextStreaming

Platform curatedNextModel production gateway and Volcengine Agent Plan pricing config

View details

Compare MiniMax M3

MiniMax M3 vs MiniMax M2.7 MiniMax M3 vs DeepSeek V4 Flash MiniMax M3 vs Doubao Seed 2.0 Lite MiniMax M3 vs OpenAI: GPT-4o-mini MiniMax M3 vs GLM-5.2

FAQ

MiniMax M3 API questions

What is MiniMax M3 best for?

Cost-sensitive high-volume chat, classification, and agentic tool use.

How is MiniMax M3 priced through NextModel?

¥2.81 per 1M input tokens and ¥11.23 per 1M output tokens, tracking MiniMax's OpenRouter reference rate.