Loading...Working on your request
모델 후보 목록

도구 호출 워크플로를 위한 Agent 모델 API

도구 호출, JSON 모드, 긴 컨텍스트, 예산 정책이 필요한 Agent 워크플로용 모델 API 를 비교합니다.

이 후보 목록은 어디에 쓰나?

Agent 워크플로는 출력량이 많아 비용이 빠르게 커질 수 있습니다. 팀은 Agent 작업을 특정 모델로 라우팅하기 전에 tool calling, JSON 지원, 컨텍스트 길이, 지연 시간, 출력 가격을 먼저 비교해야 합니다.

출처 기준: NextModel 기능 매핑과, 가능한 경우 지원 파라미터 메타데이터.

Fit score

추천 후보 agent 모델

먼저 후보 목록으로 시작한 다음 실제 프롬프트로 테스트하고 운영 라우팅 전에 월간 비용을 비교합니다.

AnthropicCatalog

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

$5 / 1M tokensInput$25 / 1M tokensOutput1MContext
Best forfrontier reasoning, large codebase review, strategy analysis
RoutingConfigured
Tool callingJSON modeLong contextReasoningStreamingVision
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
AnthropicCatalog

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

$3 / 1M tokensInput$15 / 1M tokensOutput1MContext
Best forcoding agents, code review, complex writing
RoutingConfigured
Tool callingJSON modeLong contextReasoningStreamingVision
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
GoogleCatalog

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

$1.25 / 1M tokensInput$10 / 1M tokensOutput1MContext
Best forlong-context analysis, vision workflows, scientific reasoning
RoutingConfigured
Tool callingVisionJSON modeLong contextReasoningStreaming
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
Alibaba Cloud / QwenCatalog

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

$0.65 / 1M tokensInput$3.25 / 1M tokensOutput1MContext
Best forChinese engineering workflows, code generation, codebase Q&A
RoutingConfigured
Tool callingJSON modeLong contextStreaming
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details

비교표

가격, 공급자, 컨텍스트, 기능, 출처 기준으로 후보를 비교합니다.

운영 후보를 좁히거나 폴백 정책을 만들거나 모델 경제성을 비교할 때 사용합니다.

ModelProviderInputOutputContextCapabilitiesBest forLatencyStatusSource
Anthropic: Claude Opus 4.7anthropic/claude-opus-4.7Anthropic$5 / 1M tokens$25 / 1M tokens1M
Tool callingJSON modeLong contextReasoning
frontier reasoning, large codebase review2300-6800msCatalogOpenRouter if available
Anthropic: Claude Sonnet 4.5anthropic/claude-sonnet-4.5Anthropic$3 / 1M tokens$15 / 1M tokens1M
Tool callingJSON modeLong contextReasoning
coding agents, code review1600-4800msCatalogOpenRouter if available
Google: Gemini 2.5 Progoogle/gemini-2.5-proGoogle$1.25 / 1M tokens$10 / 1M tokens1M
Tool callingVisionJSON modeLong context
long-context analysis, vision workflows1500-5000msCatalogOpenRouter if available
Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plusAlibaba Cloud / Qwen$0.65 / 1M tokens$3.25 / 1M tokens1M
Tool callingJSON modeLong contextStreaming
Chinese engineering workflows, code generation1200-3900msCatalogOpenRouter if available
Qwen: Qwen3 Maxqwen/qwen3-maxAlibaba Cloud / Qwen$0.78 / 1M tokens$3.90 / 1M tokens262.1k
Tool callingJSON modeLong contextReasoning
Chinese agent workflows, business analysis1300-4200msCatalogOpenRouter if available
OpenAI: GPT-4o-miniopenai/gpt-4o-miniOpenRouter$0.15 / 1M tokens$0.6 / 1M tokens128k
Tool callingVisionJSON modeLong context
low-cost chat, image understanding800-2400msCatalogOpenRouter if available

FAQ

Agent 모델 FAQ

Agent 모델에서 가장 중요한 기능은 무엇인가요?

tool calling, 구조화된 JSON 출력, 긴 컨텍스트, 안정적인 지시 이행이 가장 중요합니다.