OpenRouter model

GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

OpenRouterOpenRouter if availableCatalog
Tool callingVisionJSON modeLong contextStreamingLow cost
Input price$0.15 / 1M tokens
Output price$0.6 / 1M tokens
Context length128k tokens
AvailabilityCatalog

Best use cases

  • low-cost chat
  • image understanding
  • classification
  • structured extraction

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID gpt-4o-mini.

Python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

GoogleCatalog

Gemini 2.5 Flash is a lower-cost long-context and vision candidate for teams that need multimodal coverage without always using a premium model.

$0.3 / 1M tokensInput$2.50 / 1M tokensOutput1MContext
Best forlong-document summarization, image Q&A, fast multimodal routing
Routingconfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise
View details
VolcengineProduction

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext
Best forChinese Q&A, low-cost general chat, multimodal understanding
Routingconfigured
Tool callingVisionJSON modeLong context
Platform curatedNextModel production gateway and Volcengine pricing config
View details
GoogleCatalog

Gemini 2.5 Pro is a strong long-context and multimodal candidate for teams comparing quality, context length, and vision capability.

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext
Best forlong-context analysis, vision workflows, scientific reasoning
Routingconfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise
View details

FAQ

OpenAI: GPT-4o-mini API questions

When should I choose GPT-4o mini?

Choose it when you need a low-cost OpenAI-compatible model with mature tool, JSON, and vision support.

OpenAI: GPT-4o-mini API pricing, capabilities, context, and OpenAI-compatible code · NextModel