模型候選名單

Best agent model APIs for tool-calling workflows

Compare model APIs for agent workflows that need tool calling, JSON mode, long context, and budget policies.

這份候選名單適合什麼用途？

Agent workflows are output-heavy and can become expensive quickly. Teams should compare tool calling, JSON support, context length, latency, and output price before routing agent tasks to a model.

來源依據: NextModel capability mapping and supported-parameter metadata when available.

匹配分

推薦候選 agent models

先從候選名單開始，再以真實提示詞測試，並在接入正式環境路由前比較月度成本。

AnthropicCatalog

Anthropic: Claude Opus 4.7

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

$5 / 1M tokensInput$25 / 1M tokensOutput1MContext

Best forfrontier reasoning, large codebase review, strategy analysis

RoutingConfigured

Tool callingJSON modeLong contextReasoningStreamingVision

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

AnthropicCatalog

Anthropic: Claude Sonnet 4.5

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

$3 / 1M tokensInput$15 / 1M tokensOutput1MContext

Best forcoding agents, code review, complex writing

RoutingConfigured

Tool callingJSON modeLong contextReasoningStreamingVision

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

GoogleCatalog

Google: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext

Best forlong-context analysis, vision workflows, scientific reasoning

RoutingConfigured

Tool callingVisionJSON modeLong contextReasoningStreaming

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Alibaba Cloud / QwenCatalog

Qwen: Qwen3 Coder Plus

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

$0.65 / 1M tokensInput$3.25 / 1M tokensOutput1MContext

Best forChinese engineering workflows, code generation, codebase Q&A

RoutingConfigured

Tool callingJSON modeLong contextStreaming

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

比較表

按價格、提供者、上下文、能力與來源比較這份候選名單。

當你在縮小正式環境候選名單、建立備援策略或比較模型經濟性時，可使用此視圖。

Model	Provider	Input	Output	Context	Capabilities	Best for	Latency	Status	Source
Anthropic: Claude Opus 4.7anthropic/claude-opus-4.7	Anthropic	$5 / 1M tokens	$25 / 1M tokens	1M	Tool callingJSON modeLong contextReasoning	frontier reasoning, large codebase review	2300-6800ms	Catalog	OpenRouter if available
Anthropic: Claude Sonnet 4.5anthropic/claude-sonnet-4.5	Anthropic	$3 / 1M tokens	$15 / 1M tokens	1M	Tool callingJSON modeLong contextReasoning	coding agents, code review	1600-4800ms	Catalog	OpenRouter if available
Google: Gemini 2.5 Progoogle/gemini-2.5-pro	Google	$1.25 / 1M tokens	$10 / 1M tokens	1M	Tool callingVisionJSON modeLong context	long-context analysis, vision workflows	1500-5000ms	Catalog	OpenRouter if available
Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plus	Alibaba Cloud / Qwen	$0.65 / 1M tokens	$3.25 / 1M tokens	1M	Tool callingJSON modeLong contextStreaming	Chinese engineering workflows, code generation	1200-3900ms	Catalog	OpenRouter if available
Qwen: Qwen3 Maxqwen/qwen3-max	Alibaba Cloud / Qwen	$0.78 / 1M tokens	$3.90 / 1M tokens	262.1k	Tool callingJSON modeLong contextReasoning	Chinese agent workflows, business analysis	1300-4200ms	Catalog	OpenRouter if available
OpenAI: GPT-4o-miniopenai/gpt-4o-mini	OpenRouter	$0.15 / 1M tokens	$0.6 / 1M tokens	128k	Tool callingVisionJSON modeLong context	low-cost chat, image understanding	800-2400ms	Catalog	OpenRouter if available

常見問題

Agent models 常見問題

Which capabilities matter most for agent models?

Tool calling, structured JSON output, long context, and reliable instruction following matter most.

全部模型價格計算器 OpenAI 相容快速開始