模型候選名單

適合台灣團隊的低成本 LLM API 模型

依輸入價格、輸出價格、上下文長度、能力、來源與正式環境適配度，為台灣團隊比較低成本 LLM API 模型。

這份候選名單適合什麼用途？

Cheap LLM API selection should start with workload shape, not only the lowest posted rate. For classification, summarization, routing, support drafts, and batch transformations, a lower-cost model can reduce monthly spend without changing your application interface. For final answers, complex reasoning, or coding agents, teams should benchmark a low-cost model against a stronger fallback. NextModel keeps price, context, capability, provider source, and code examples in one place so developers can make that tradeoff before deployment.

來源依據: NextModel curated catalog, provider public pricing, and OpenRouter metadata when available.

综合价格

按價格、提供者、上下文、能力與來源比較這份候選名單。

當你在縮小正式環境候選名單、建立備援策略或比較模型經濟性時，可使用此視圖。

Model	Provider	Input	Output	Context	Capabilities	Best for	Latency	Status	Source
DeepSeek: DeepSeek V4 Flashdeepseek/deepseek-v4-flash	DeepSeek	$0.112 / 1M tokens	$0.224 / 1M tokens	1M	Tool callingJSON modeLong contextReasoning	low-cost Chinese tasks, long-context summary	800-2600ms	Catalog	OpenRouter if available
Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instruct	Mistral AI	$0.1 / 1M tokens	$0.3 / 1M tokens	128k	Tool callingJSON modeStreamingLow cost	translation, classification	700-2300ms	Catalog	OpenRouter if available
OpenAI: GPT-4o-miniopenai/gpt-4o-mini	OpenRouter	$0.15 / 1M tokens	$0.6 / 1M tokens	128k	Tool callingVisionJSON modeLong context	low-cost chat, image understanding	800-2400ms	Catalog	OpenRouter if available
Meta: Llama 4 Maverickmeta-llama/llama-4-maverick	Meta	$0.15 / 1M tokens	$0.6 / 1M tokens	1M	JSON modeLong contextStreamingLow cost	open-model workflows, cost-sensitive long context	950-2800ms	Catalog	OpenRouter if available
Google: Gemini 2.5 Flashgoogle/gemini-2.5-flash	Google	$0.3 / 1M tokens	$2.50 / 1M tokens	1M	Tool callingVisionJSON modeLong context	long-document summarization, image Q&A	900-2800ms	Catalog	OpenRouter if available
MoonshotAI: Kimi K2.6moonshotai/kimi-k2.6	Moonshot AI	$0.73 / 1M tokens	$3.49 / 1M tokens	262.1k	JSON modeLong contextStreamingTool calling	long Chinese documents, contract review	1400-4400ms	Catalog	OpenRouter if available

常見問題

Cheap LLM API 常見問題

What is the cheapest model in this catalog?

The cheapest option depends on currency conversion and output length. Doubao Seed 2.0 Mini is the lowest-cost CNY production entry in this catalog.

Should teams always pick the cheapest LLM API?

No. Use cheap models for repeatable low-risk work, then compare quality against stronger models for final answers, complex reasoning, and coding agents.

全部模型價格計算器 OpenAI 相容快速開始

適合台灣團隊的低成本 LLM API 模型

這份候選名單適合什麼用途？

推薦候選 cheap llm api

DeepSeek: DeepSeek V4 Flash

Mistral: Mistral Small 3.2 24B

OpenAI: GPT-4o-mini

Meta: Llama 4 Maverick

按價格、提供者、上下文、能力與來源比較這份候選名單。

Cheap LLM API 常見問題

What is the cheapest model in this catalog?

Should teams always pick the cheapest LLM API?