Google model

Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

Read quickstart Estimate cost

GoogleOpenRouter if availableCatalog

Tool callingVisionJSON modeLong contextStreamingLow cost

Input price$0.3 / 1M tokens

Output price$2.50 / 1M tokens

Context length1M tokens

AvailabilityCatalog

Best use cases

long-document summarization
image Q&A
fast multimodal routing

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID gemini-2-5-flash.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="gemini-2-5-flash",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

VolcengineProduction

Doubao Seed 2.0 Mini

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext

Best forChinese Q&A, low-cost general chat, multimodal understanding

Routingconfigured

Tool callingVisionJSON modeLong context

Platform curatedNextModel production gateway and Volcengine pricing config

View details

OpenRouterCatalog

OpenAI: GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext

Best forlow-cost chat, image understanding, classification

Routingconfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

GoogleCatalog

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext

Best forlong-context analysis, vision workflows, scientific reasoning

Routingconfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

FAQ

Google: Gemini 2.5 Flash API questions

Is Gemini 2.5 Flash a low-cost vision option?

Yes. It is categorized as a low-cost multimodal candidate with a large context window.