MiniMax M3
MiniMax M3 is a MiniMax model listed in the NextModel catalog for high-volume chat, agentic tool use, classification workloads. Its listed price is ¥2.81 / 1M tokens input and ¥11.23 / 1M tokens output per 1M tokens, with a 128k token context window.
What is MiniMax M3 in NextModel?
MiniMax M3 is a MiniMax model listed in the NextModel catalog for high-volume chat, agentic tool use, classification workloads. Its listed price is ¥2.81 / 1M tokens input and ¥11.23 / 1M tokens output per 1M tokens, with a 128k token context window.
Best use cases
- high-volume chat
- agentic tool use
- classification
OpenAI-compatible code example
Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID minimax-m3.
from openai import OpenAI
client = OpenAI(
api_key="YOUR_API_KEY",
base_url="https://api.nextmodel.app/v1"
)
resp = client.chat.completions.create(
model="minimax-m3",
messages=[{"role": "user", "content": "Hello from NextModel"}]
)
print(resp.choices[0].message.content)Similar alternatives
MiniMax M2.7 is available only through the Volcengine Agent Plan (no public ARK price list); its listed price tracks the OpenRouter reference rate for the same model. Unlike M3, this model has no cache-hit discount.
DeepSeek V4 Flash is the low-cost, low-latency member of the DeepSeek V4 family, onboarded via the Volcengine Agent Plan.
Doubao Seed 2.0 Lite is the low-cost long-context member of the Seed 2.0 family, onboarded through the Volcengine Agent Plan. Audio input pricing is not modeled in this catalog.
FAQ
MiniMax M3 API questions
What is MiniMax M3 best for?
Cost-sensitive high-volume chat, classification, and agentic tool use.
How is MiniMax M3 priced through NextModel?
¥2.81 per 1M input tokens and ¥11.23 per 1M output tokens, tracking MiniMax's OpenRouter reference rate.