モデル候補リスト

コスト重視プロダクト向けの低価格 LLM API モデル

入力価格、出力価格、コンテキスト長、機能、ソース、運用適性で低コスト LLM API モデルを比較します。

この候補リストは何に使う？

低価格 LLM API を選ぶときは、表示価格の安さだけでなく workload の形から考えるべきです。分類、要約、ルーティング、サポート下書き、バッチ変換では、より安いモデルでもアプリのインターフェースを変えずに月額コストを下げられます。一方、最終回答、複雑な推論、コード Agent では、低価格モデルをより強い fallback と並べて検証する必要があります。NextModel は価格、コンテキスト、機能、提供元、コード例を一か所にまとめ、開発者が本番投入前に比較できるようにしています。

ソース基準: NextModel の curated catalog、各 provider の公開価格、および利用可能な OpenRouter metadata。

Blended price

推奨候補低価格 llm api

まず候補リストから始め、実際のプロンプトで試し、本番ルーティング前に月額コストを比較します。

DeepSeekCatalog

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

$0.112 / 1M tokensInput$0.224 / 1M tokensOutput1MContext

Best forlow-cost Chinese tasks, long-context summary, batch code assistance

RoutingConfigured

Tool callingJSON modeLong contextReasoningLow cost

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Mistral AICatalog

Mistral: Mistral Small 3.2 24B

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

$0.1 / 1M tokensInput$0.3 / 1M tokensOutput128kContext

Best fortranslation, classification, short-form summarization

RoutingConfigured

Tool callingJSON modeStreamingLow costVisionLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

OpenRouterCatalog

OpenAI: GPT-4o-mini

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput128kContext

Best forlow-cost chat, image understanding, classification

RoutingConfigured

Tool callingVisionJSON modeLong contextStreamingLow cost

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

MetaCatalog

Meta: Llama 4 Maverick

Llama 4 Maverick 17B Instruct (128E) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

$0.15 / 1M tokensInput$0.6 / 1M tokensOutput1MContext

Best foropen-model workflows, cost-sensitive long context, classification

RoutingConfigured

JSON modeLong contextStreamingLow costTool callingVision

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

比較表

価格、プロバイダー、コンテキスト、機能、ソースで候補を比較します。

本番候補を絞り込むとき、フォールバック方針を作るとき、モデル経済性を比べるときに使います。

Model	Provider	Input	Output	Context	Capabilities	Best for	Latency	Status	Source
DeepSeek: DeepSeek V4 Flashdeepseek/deepseek-v4-flash	DeepSeek	$0.112 / 1M tokens	$0.224 / 1M tokens	1M	Tool callingJSON modeLong contextReasoning	low-cost Chinese tasks, long-context summary	800-2600ms	Catalog	OpenRouter if available
Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instruct	Mistral AI	$0.1 / 1M tokens	$0.3 / 1M tokens	128k	Tool callingJSON modeStreamingLow cost	translation, classification	700-2300ms	Catalog	OpenRouter if available
OpenAI: GPT-4o-miniopenai/gpt-4o-mini	OpenRouter	$0.15 / 1M tokens	$0.6 / 1M tokens	128k	Tool callingVisionJSON modeLong context	low-cost chat, image understanding	800-2400ms	Catalog	OpenRouter if available
Meta: Llama 4 Maverickmeta-llama/llama-4-maverick	Meta	$0.15 / 1M tokens	$0.6 / 1M tokens	1M	JSON modeLong contextStreamingLow cost	open-model workflows, cost-sensitive long context	950-2800ms	Catalog	OpenRouter if available
Google: Gemini 2.5 Flashgoogle/gemini-2.5-flash	Google	$0.3 / 1M tokens	$2.50 / 1M tokens	1M	Tool callingVisionJSON modeLong context	long-document summarization, image Q&A	900-2800ms	Catalog	OpenRouter if available
MoonshotAI: Kimi K2.6moonshotai/kimi-k2.6	Moonshot AI	$0.73 / 1M tokens	$3.49 / 1M tokens	262.1k	JSON modeLong contextStreamingTool calling	long Chinese documents, contract review	1400-4400ms	Catalog	OpenRouter if available

FAQ

低価格 LLM API FAQ

この catalog で最も安いモデルはどれですか？

最安候補は為替と出力長によって変わります。Doubao Seed 2.0 Mini はこの catalog で最も低コストな CNY 本番候補です。

常に一番安い LLM API を選ぶべきですか？

いいえ。低価格モデルは反復的で低リスクな処理に向いていますが、最終回答、複雑な推論、コード Agent では、より強いモデルと比較すべきです。

すべてのモデル料金計算 OpenAI互換クイックスタート

コスト重視プロダクト向けの低価格 LLM API モデル

この候補リストは何に使う？

推奨候補 低価格 llm api

DeepSeek: DeepSeek V4 Flash

Mistral: Mistral Small 3.2 24B

OpenAI: GPT-4o-mini

Meta: Llama 4 Maverick

価格、プロバイダー、コンテキスト、機能、ソースで候補を比較します。

低価格 LLM API FAQ

この catalog で最も安いモデルはどれですか？

常に一番安い LLM API を選ぶべきですか？

推奨候補低価格 llm api