Loading...Working on your request
MiniMax model

MiniMax M3

MiniMax M3 is a MiniMax model listed in the NextModel catalog for high-volume chat, agentic tool use, classification workloads. Its listed price is ¥2.81 / 1M tokens input and ¥11.23 / 1M tokens output per 1M tokens, with a 128k token context window.

MiniMaxPlatform curatedProduction
Tool callingJSON modeStreamingLow cost
Input price¥2.81 / 1M tokens
Output price¥11.23 / 1M tokens
Context length128k tokens
Max output8.2k tokens

What is MiniMax M3 in NextModel?

MiniMax M3 is a MiniMax model listed in the NextModel catalog for high-volume chat, agentic tool use, classification workloads. Its listed price is ¥2.81 / 1M tokens input and ¥11.23 / 1M tokens output per 1M tokens, with a 128k token context window.

Best use cases

  • high-volume chat
  • agentic tool use
  • classification

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID minimax-m3.

Python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="minimax-m3",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

MiniMaxProduction

MiniMax M2.7 is available only through the Volcengine Agent Plan (no public ARK price list); its listed price tracks the OpenRouter reference rate for the same model. Unlike M3, this model has no cache-hit discount.

¥1.68 / 1M tokensInput¥6.74 / 1M tokensOutput128kContext
Best forhigh-volume chat, agentic tool use, classification
RoutingConfigured
Tool callingJSON modeStreamingLow cost
Platform curatedNextModel production gateway; price referenced from OpenRouter minimax/minimax-m2.7
View details
DeepSeekProduction

DeepSeek V4 Flash is the low-cost, low-latency member of the DeepSeek V4 family, onboarded via the Volcengine Agent Plan.

¥1 / 1M tokensInput¥2 / 1M tokensOutput128kContext
Best forhigh-volume chat, lightweight agent steps, classification
RoutingConfigured
Tool callingJSON modeStreamingLow cost
Platform curatedNextModel production gateway and Volcengine Agent Plan pricing config
View details
VolcengineProduction

Doubao Seed 2.0 Lite is the low-cost long-context member of the Seed 2.0 family, onboarded through the Volcengine Agent Plan. Audio input pricing is not modeled in this catalog.

Starting at ¥0.6 / 1M tokensInputStarting at ¥3.60 / 1M tokensOutput256kContext
Best forhigh-volume chat, classification, lightweight agent steps
RoutingConfigured
Tool callingJSON modeLong contextStreaming
Platform curatedNextModel production gateway and Volcengine Agent Plan pricing config
View details

FAQ

MiniMax M3 API questions

What is MiniMax M3 best for?

Cost-sensitive high-volume chat, classification, and agentic tool use.

How is MiniMax M3 priced through NextModel?

¥2.81 per 1M input tokens and ¥11.23 per 1M output tokens, tracking MiniMax's OpenRouter reference rate.