Loading...Working on your request
Danh sach rut gon model

Cac model API LLM gia re tot nhat cho san pham nhay cam ve chi phi

So sanh cac model API LLM chi phi thap theo gia input, gia output, boi canh, capability, nguon va do phu hop voi production.

Danh sach rut gon nay dung de lam gi?

Viec chon API LLM gia re nen bat dau tu hinh dang workload, khong chi tu muc gia thap nhat dang hien thi. Doi voi classification, summarization, routing, support draft va batch transformation, mot model re hon co the giam chi phi hang thang ma khong can thay doi giao dien ung dung. Doi voi final answer, reasoning phuc tap hoac coding agent, doi ngu nen so sanh model gia re voi mot fallback manh hon. NextModel tap hop gia, boi canh, capability, nguon provider va vi du ma nguon trong mot noi truoc khi len production.

Co so nguon: Catalog duoc chon loc cua NextModel, gia cong khai tu provider va metadata OpenRouter khi co san.

Blended price

Ung vien de xuat api llm gia re

Bat dau voi danh sach rut gon, thu prompt thuc te va so sanh chi phi hang thang truoc khi routing production.

DeepSeekCatalog

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

$0.112 / 1M tokensInput$0.224 / 1M tokensOutput1MContext
Best forlow-cost Chinese tasks, long-context summary, batch code assistance
RoutingConfigured
Tool callingJSON modeLong contextReasoningLow cost
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
Mistral AICatalog

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

$0.1 / 1M tokensInput$0.3 / 1M tokensOutput128kContext
Best fortranslation, classification, short-form summarization
RoutingConfigured
Tool callingJSON modeStreamingLow costVisionLong context
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
OpenRouterCatalog

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext
Best forlow-cost chat, image understanding, classification
RoutingConfigured
Tool callingVisionJSON modeLong contextStreamingLow cost
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
MetaCatalog

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput1MContext
Best foropen-model workflows, cost-sensitive long context, classification
RoutingConfigured
JSON modeLong contextStreamingLow costTool callingVision
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details

Bang so sanh

So sanh danh sach rut gon theo gia, nha cung cap, context, kha nang va nguon.

Dung giao dien nay de thu hep shortlist production, xay dung chinh sach fallback hoac so sanh kinh te model.

ModelProviderInputOutputContextCapabilitiesBest forLatencyStatusSource
DeepSeek: DeepSeek V4 Flashdeepseek/deepseek-v4-flashDeepSeek$0.112 / 1M tokens$0.224 / 1M tokens1M
Tool callingJSON modeLong contextReasoning
low-cost Chinese tasks, long-context summary800-2600msCatalogOpenRouter if available
Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instructMistral AI$0.1 / 1M tokens$0.3 / 1M tokens128k
Tool callingJSON modeStreamingLow cost
translation, classification700-2300msCatalogOpenRouter if available
OpenAI: GPT-4o-miniopenai/gpt-4o-miniOpenRouter$0.15 / 1M tokens$0.6 / 1M tokens128k
Tool callingVisionJSON modeLong context
low-cost chat, image understanding800-2400msCatalogOpenRouter if available
Meta: Llama 4 Maverickmeta-llama/llama-4-maverickMeta$0.15 / 1M tokens$0.6 / 1M tokens1M
JSON modeLong contextStreamingLow cost
open-model workflows, cost-sensitive long context950-2800msCatalogOpenRouter if available
Google: Gemini 2.5 Flashgoogle/gemini-2.5-flashGoogle$0.3 / 1M tokens$2.50 / 1M tokens1M
Tool callingVisionJSON modeLong context
long-document summarization, image Q&A900-2800msCatalogOpenRouter if available
MoonshotAI: Kimi K2.6moonshotai/kimi-k2.6Moonshot AI$0.73 / 1M tokens$3.49 / 1M tokens262.1k
JSON modeLong contextStreamingTool calling
long Chinese documents, contract review1400-4400msCatalogOpenRouter if available

FAQ

API LLM gia re FAQ

Model nao re nhat trong catalog nay?

Dieu nay phu thuoc vao ty gia va do dai output. Doubao Seed 2.0 Mini van la lua chon production bang CNY co chi phi thap nhat trong catalog nay.

Cac doi ngu co nen luon chon API LLM re nhat khong?

Khong. Model gia re phu hop voi cong viec lap lai va rui ro thap; doi voi final answer, reasoning phuc tap va coding agent, can so sanh chung voi cac model manh hon.