Mistral AI model

Mistral Small 3.2 24B

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

Mistral AIOpenRouter if availableCatalog
Tool callingJSON modeStreamingLow costVisionLong context
Input price$0.1 / 1M tokens
Output price$0.3 / 1M tokens
Context length128k tokens
AvailabilityCatalog

Best use cases

  • translation
  • classification
  • short-form summarization

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID mistral-small-3-2.

Python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="mistral-small-3-2",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

VolcengineProduction

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext
Best forChinese Q&A, low-cost general chat, multimodal understanding
Routingconfigured
Tool callingVisionJSON modeLong context
Platform curatedNextModel production gateway and Volcengine pricing config
View details
OpenRouterCatalog

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext
Best forlow-cost chat, image understanding, classification
Routingconfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
MetaCatalog

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput1MContext
Best foropen-model workflows, cost-sensitive long context, classification
Routingconfigured
JSON modeLong contextStreamingLow cost
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details

FAQ

Mistral: Mistral Small 3.2 24B API questions

What workloads fit Mistral Small?

It is a practical candidate for translation, short summaries, classification, and other cost-sensitive text tasks.