Mistral AI model

Mistral Small 3.2 24B

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

Read quickstart Estimate cost

Mistral AIOpenRouter if availableCatalog

Tool callingJSON modeStreamingLow costVisionLong context

Input price$0.1 / 1M tokens

Output price$0.3 / 1M tokens

Context length128k tokens

AvailabilityCatalog

Best use cases

translation
classification
short-form summarization

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID mistral-small-3-2.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="mistral-small-3-2",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

VolcengineProduction

Doubao Seed 2.0 Mini

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext

Best forChinese Q&A, low-cost general chat, multimodal understanding

Routingconfigured

Tool callingVisionJSON modeLong context

Platform curatedNextModel production gateway and Volcengine pricing config

View details

OpenRouterCatalog

OpenAI: GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext

Best forlow-cost chat, image understanding, classification

Routingconfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

MetaCatalog

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput1MContext

Best foropen-model workflows, cost-sensitive long context, classification

Routingconfigured

JSON modeLong contextStreamingLow cost

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

FAQ

Mistral: Mistral Small 3.2 24B API questions

What workloads fit Mistral Small?

It is a practical candidate for translation, short summaries, classification, and other cost-sensitive text tasks.