OpenRouter model

GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

Read quickstart Estimate cost

OpenRouterOpenRouter if availableCatalog

Tool callingVisionJSON modeLong contextStreamingLow cost

Input price$0.15 / 1M tokens

Output price$0.6 / 1M tokens

Context length128k tokens

AvailabilityCatalog

Best use cases

low-cost chat
image understanding
classification
structured extraction

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID gpt-4o-mini.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

GoogleCatalog

Gemini 2.5 Flash

Gemini 2.5 Flash is a lower-cost long-context and vision candidate for teams that need multimodal coverage without always using a premium model.

$0.3 / 1M tokensInput$2.50 / 1M tokensOutput1MContext

Best forlong-document summarization, image Q&A, fast multimodal routing

Routingconfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise

View details

VolcengineProduction

Doubao Seed 2.0 Mini

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext

Best forChinese Q&A, low-cost general chat, multimodal understanding

Routingconfigured

Tool callingVisionJSON modeLong context

Platform curatedNextModel production gateway and Volcengine pricing config

View details

GoogleCatalog

Gemini 2.5 Pro

Gemini 2.5 Pro is a strong long-context and multimodal candidate for teams comparing quality, context length, and vision capability.

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext

Best forlong-context analysis, vision workflows, scientific reasoning

Routingconfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise

View details

FAQ

OpenAI: GPT-4o-mini API questions

When should I choose GPT-4o mini?

Choose it when you need a low-cost OpenAI-compatible model with mature tool, JSON, and vision support.