モデル候補リスト

Best Chinese LLM API models for developer teams

Compare Chinese-language LLM API candidates across domestic and global providers, including pricing, context, latency estimates, and best use cases.

モデルを見るコストを見積もる

この候補リストは何に使う？

Chinese LLM API selection has different constraints from English-only workloads. Teams often need domestic provider coverage, Chinese-language quality, CNY budgeting, long document handling, and predictable API behavior. NextModel compares Chinese-friendly models across source type, price, context, and capability so developers can pick candidates for real business samples before committing production traffic.

ソース基準: NextModel catalog taxonomy, provider public pricing, and OpenRouter metadata when available.

Fit score

推奨候補 chinese llm api

まず候補リストから始め、実際のプロンプトで試し、本番ルーティング前に月額コストを比較します。

DeepSeekCatalog

DeepSeek: R1

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

$0.7 / 1M tokensInput$2.50 / 1M tokensOutput163.8kContext

Best forChinese reasoning, math, analysis

RoutingConfigured

JSON modeLong contextReasoningStreamingTool calling

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

DeepSeekCatalog

DeepSeek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

$0.112 / 1M tokensInput$0.224 / 1M tokensOutput1MContext

Best forlow-cost Chinese tasks, long-context summary, batch code assistance

RoutingConfigured

Tool callingJSON modeLong contextReasoningLow cost

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Alibaba Cloud / QwenCatalog

Qwen: Qwen3 Coder Plus

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

$0.65 / 1M tokensInput$3.25 / 1M tokensOutput1MContext

Best forChinese engineering workflows, code generation, codebase Q&A

RoutingConfigured

Tool callingJSON modeLong contextStreaming

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Alibaba Cloud / QwenCatalog

Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

$0.78 / 1M tokensInput$3.90 / 1M tokensOutput262.1kContext

Best forChinese agent workflows, business analysis, structured output

RoutingConfigured

Tool callingJSON modeLong contextReasoningStreaming

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

比較表

価格、プロバイダー、コンテキスト、機能、ソースで候補を比較します。

本番候補を絞り込むとき、フォールバック方針を作るとき、モデル経済性を比べるときに使います。

Model	Provider	Input	Output	Context	Capabilities	Best for	Latency	Status	Source
DeepSeek: R1deepseek/deepseek-r1	DeepSeek	$0.7 / 1M tokens	$2.50 / 1M tokens	163.8k	JSON modeLong contextReasoningStreaming	Chinese reasoning, math	1800-6000ms	Catalog	OpenRouter if available
DeepSeek: DeepSeek V4 Flashdeepseek/deepseek-v4-flash	DeepSeek	$0.112 / 1M tokens	$0.224 / 1M tokens	1M	Tool callingJSON modeLong contextReasoning	low-cost Chinese tasks, long-context summary	800-2600ms	Catalog	OpenRouter if available
Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plus	Alibaba Cloud / Qwen	$0.65 / 1M tokens	$3.25 / 1M tokens	1M	Tool callingJSON modeLong contextStreaming	Chinese engineering workflows, code generation	1200-3900ms	Catalog	OpenRouter if available
Qwen: Qwen3 Maxqwen/qwen3-max	Alibaba Cloud / Qwen	$0.78 / 1M tokens	$3.90 / 1M tokens	262.1k	Tool callingJSON modeLong contextReasoning	Chinese agent workflows, business analysis	1300-4200ms	Catalog	OpenRouter if available
MoonshotAI: Kimi K2.6moonshotai/kimi-k2.6	Moonshot AI	$0.73 / 1M tokens	$3.49 / 1M tokens	262.1k	JSON modeLong contextStreamingTool calling	long Chinese documents, contract review	1400-4400ms	Catalog	OpenRouter if available

FAQ

Chinese LLM API FAQ

Which model should I test first for Chinese support workloads?

Start with Doubao Seed 2.0 Mini for high-volume low-cost Chinese tasks, then compare DeepSeek, Qwen, or Kimi for reasoning and long documents.

Can one gateway cover domestic and global models?

Yes. The public site positions NextModel as one interface for domestic and global model sources, with source labels instead of partnership claims.

すべてのモデル料金計算 OpenAI互換クイックスタート