Google 模型

Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

GoogleOpenRouter if availableCatalog
Tool callingVisionJSON modeLong contextStreamingLow cost
输入价格$0.3 / 1M tokens
输出价格$2.50 / 1M tokens
上下文长度1M tokens
可用性Catalog

适用场景

  • long-document summarization
  • image Q&A
  • fast multimodal routing

OpenAI 兼容代码示例

保持 OpenAI SDK 调用方式不变,把 base_url 改为 NextModel,并使用模型目录 ID gemini-2-5-flash

Python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="gemini-2-5-flash",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

相似替代项

VolcengineProduction

Doubao Seed 2.0 Mini is the lowest-cost production model currently exposed through the NextModel public gateway. It is a practical default for Chinese Q&A, classification, summarization, and lightweight multimodal tasks.

¥0.2 / 1M tokensInput¥2 / 1M tokensOutput128kContext
Best forChinese Q&A, low-cost general chat, multimodal understanding
Routingconfigured
Tool callingVisionJSON modeLong context
Platform curatedNextModel production gateway and Volcengine pricing config
查看详情
OpenRouterCatalog

GPT-4o mini is a mature low-cost multimodal option for teams that already use OpenAI-compatible SDKs and need a balanced default model for product workflows.

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext
Best forlow-cost chat, image understanding, classification
Routingconfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise
查看详情
GoogleCatalog

Gemini 2.5 Pro is a strong long-context and multimodal candidate for teams comparing quality, context length, and vision capability.

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext
Best forlong-context analysis, vision workflows, scientific reasoning
Routingconfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API when available; curated fallback otherwise
查看详情

FAQ

Google: Gemini 2.5 Flash API 常见问题

Is Gemini 2.5 Flash a low-cost vision option?

Yes. It is categorized as a low-cost multimodal candidate with a large context window.