Google model

Gemini 2.5 Flash

Google: Gemini 2.5 Flash is a Google model listed in the NextModel catalog for long-document summarization, image Q&A, fast multimodal routing workloads. Its listed price is $0.3 / 1M tokens input and $2.50 / 1M tokens output per 1M tokens, with a 1M token context window.

Read quickstart Estimate cost

GoogleOpenRouter if availableCatalog

Tool callingVisionJSON modeLong contextStreamingLow cost

Input price$0.3 / 1M tokens

Output price$2.50 / 1M tokens

Context length1M tokens

Max output8.2k tokens

What is Gemini 2.5 Flash in NextModel?

Best use cases

long-document summarization
image Q&A
fast multimodal routing

OpenAI-compatible code example

Keep the OpenAI SDK style, set base_url to NextModel, and use the catalog model ID gemini-2-5-flash.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="gemini-2-5-flash",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Similar alternatives

OpenRouterCatalog

OpenAI: GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext

Best forlow-cost chat, image understanding, classification

RoutingConfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

GoogleCatalog

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext

Best forlong-context analysis, vision workflows, scientific reasoning

RoutingConfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Moonshot AICatalog

MoonshotAI: Kimi K2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

$0.73 / 1M tokensInput$3.49 / 1M tokensOutput262.1kContext

Best forlong Chinese documents, contract review, knowledge-base Q&A

RoutingConfigured

JSON modeLong contextStreamingTool calling

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Compare Google: Gemini 2.5 Flash

Google: Gemini 2.5 Flash vs OpenAI: GPT-4o-mini Google: Gemini 2.5 Flash vs Google: Gemini 2.5 Pro Google: Gemini 2.5 Flash vs MoonshotAI: Kimi K2.6 Google: Gemini 2.5 Flash vs Meta: Llama 4 Maverick Google: Gemini 2.5 Flash vs DeepSeek: DeepSeek V4 Flash

FAQ

Google: Gemini 2.5 Flash API questions

Is Gemini 2.5 Flash a low-cost vision option?

Yes. It is categorized as a low-cost multimodal candidate with a large context window.