Zhipu AI (GLM) 模型 · 香港版

GLM-5.2

香港團隊可用的 NextModel 目錄中的 Zhipu AI (GLM) 模型，常用於 general-purpose reasoning、中文問答、agentic tool use 工作負載。當前展示價格為輸入 ¥8 / 1M tokens、輸出 ¥28 / 1M tokens 每 100 萬 token，上下文視窗為 128k token。

閱讀快速開始成本估算

Zhipu AI (GLM)Platform curated生產可用

Tool callingJSON modeStreamingReasoning

輸入價格¥8 / 1M tokens

輸出價格¥28 / 1M tokens

上下文長度128k token

最大輸出8.2k token

GLM-5.2 在 NextModel 中是什麼？

適用場景

general-purpose reasoning
中文問答
agentic tool use

OpenAI 相容呼叫範例

保持 OpenAI SDK 呼叫方式不變，把 base_url 改為 NextModel，並使用模型目錄 ID glm-5-2。

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="glm-5-2",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

相似替代項

Moonshot AI生產可用

Kimi K2.6

Kimi K2.6 是面向長上下文中文場景的模型候選，適合文件密集型團隊在成本、上下文長度與國內模型覆蓋之間做權衡。

¥6.18 / 1M tokens輸入¥31.92 / 1M tokens輸出128k上下文

適用場景general-purpose reasoning, 中文問答, agentic tool use

路由已設定

工具呼叫JSON 模式長上下文串流輸出

平台整理NextModel production gateway; price referenced from OpenRouter moonshotai/kimi-k2.6

查看詳情

Volcengine生產可用

Doubao Seed 2.0 Code

Doubao Seed 2.0 Code is Volcengine's coding-focused variant, onboarded through the Volcengine Agent Plan for agentic and long-context coding workloads.

Starting at ¥3.20 / 1M tokens輸入Starting at ¥16 / 1M tokens輸出256k上下文

適用場景agentic coding, repository-scale refactors, long-context code review

路由已設定

工具呼叫JSON 模式長上下文串流輸出

平台整理NextModel production gateway and Volcengine Agent Plan pricing config

查看詳情

Alibaba Cloud / Qwen目錄

Qwen: Qwen3 Max

Qwen3 Max 是通用的高能力中文模型候選，適合在國內外提供方之間做比較的團隊。

$0.78 / 1M tokens輸入$3.90 / 1M tokens輸出262.1k上下文

適用場景中文 Agent 工作流, 商業分析, 結構化輸出

路由已設定

工具呼叫JSON 模式長上下文推理

OpenRouter（可用時）OpenRouter 公開 Models API 的即時中繼資料；公開價格來自註冊表定價規則。

查看詳情

對比 GLM-5.2

GLM-5.2 對比 Kimi K2.6 GLM-5.2 對比 Doubao Seed 2.0 Code GLM-5.2 對比 Qwen: Qwen3 Max GLM-5.2 對比 Doubao Seed 2.0 Pro GLM-5.2 對比 DeepSeek V4 Flash

常見問題

GLM-5.2 API 常見問題

What is GLM-5.2 best for?

General-purpose Chinese reasoning and agentic tool-use tasks.

How is GLM-5.2 priced through NextModel?

Flat ¥8 per 1M input tokens and ¥28 per 1M output tokens.