Oplata po faktu ispolzovaniya
Taryfy modeleyNachnite s cen na vhodnye i vykhodnye tokeny dlya kazhdoy modeli.
- Bez krupnogo avansa
- Ocenka do zapuska
- Zaprosy sovmestimy s OpenAI
If you're comparing models for a live product, use the calculator first, then choose the plan that fits your spend pattern.
Oplata po faktu ispolzovaniya
Taryfy modeleyNachnite s cen na vhodnye i vykhodnye tokeny dlya kazhdoy modeli.
Kredity
Predoplachennyy balansDelayte rashody bolee predskazuemymi dlya eksperimentov i malykh komand.
Komanda
Upravlyaemoe ispolzovanieUpravlyayte proektami, klyuchami, byudzhetami i politikami modeley dlya produktsionnykh komand.
BYOK
Ispolzuyte svoi klyuchiObedinite sushchestvuyushchie akkaunty provayderov v odnom sloye sravneniya i governance.
Enterprise
IndividualnoChastnye kommercheskie usloviya dlya workloadov s vysokim obemom ili zhestkim governance.
Kalkulyator
Ispolzuyte eto kak predproduktsionnuyu otsenku. Itogovaya oplata dolzhna svyazyvatsya s dannymi ispolzovaniya provaydera i platformy.
Stoimost = zaprosy x ((vkhodnye tokeny x tsena vkhoda) + (vykhodnye tokeny x tsena vykhoda)) / 1,000,000.
Standartnaya otsenka Doubao Seed 2.0 Mini dlya 1M input i 1M output sostavlyaet ¥2.20.
Оцените месячные расходы по цене модели, числу токенов и объему запросов.
Stoimost AI API otsenivaetsya umnozheniem chisla zaprosov na vhodnye i vykhodnye tokeny, a zatem primeneniyem publichnoy tseny kazhdoy modeli za 1M tokenov. Pered marshrutizatsiey produktsionnogo trafika komande stoit poschitat low-cost model, fallback po kachestvu i ozhidaemyy mesyachnyy obem.
Zapustite CacheSafety Bench do vklyucheniya cache-politiki v produktsii. Bad Hit Rate vazhnee, chem syroy hit rate.
Zapustit CacheSafety BenchLow-cost orientir
Tsena - eto lish odin parametr. Pered produktsiey proverte dlinu konteksta, vozmozhnosti, metki istochnika i planiruemye use case.
| Model | Provider | Input | Output | Context | Capabilities | Best for | Latency | Status | Source |
|---|---|---|---|---|---|---|---|---|---|
| Doubao Seed 2.0 Minidoubao-seed-2-0-mini | Volcengine | ¥0.2 / 1M tokens | ¥2 / 1M tokens | 128k | StreamingJSON mode | Coding | 900-2600ms | Catalog | Platform curated |
| DeepSeek: DeepSeek V4 Flashdeepseek/deepseek-v4-flash | DeepSeek | $0.112 / 1M tokens | $0.224 / 1M tokens | 1M | Tool callingJSON modeLong contextReasoning | low-cost Chinese tasks, long-context summary | 800-2600ms | Catalog | OpenRouter if available |
| Mistral: Mistral Small 3.2 24Bmistralai/mistral-small-3.2-24b-instruct | Mistral AI | $0.1 / 1M tokens | $0.3 / 1M tokens | 128k | Tool callingJSON modeStreamingLow cost | translation, classification | 700-2300ms | Catalog | OpenRouter if available |
| OpenAI: GPT-4o-miniopenai/gpt-4o-mini | OpenRouter | $0.15 / 1M tokens | $0.6 / 1M tokens | 128k | Tool callingVisionJSON modeLong context | low-cost chat, image understanding | 800-2400ms | Catalog | OpenRouter if available |
| Meta: Llama 4 Maverickmeta-llama/llama-4-maverick | Meta | $0.15 / 1M tokens | $0.6 / 1M tokens | 1M | JSON modeLong contextStreamingLow cost | open-model workflows, cost-sensitive long context | 950-2800ms | Catalog | OpenRouter if available |
| Google: Gemini 2.5 Flashgoogle/gemini-2.5-flash | $0.3 / 1M tokens | $2.50 / 1M tokens | 1M | Tool callingVisionJSON modeLong context | long-document summarization, image Q&A | 900-2800ms | Catalog | OpenRouter if available | |
| DeepSeek: R1deepseek/deepseek-r1 | DeepSeek | $0.7 / 1M tokens | $2.50 / 1M tokens | 163.8k | JSON modeLong contextReasoningStreaming | Chinese reasoning, math | 1800-6000ms | Catalog | OpenRouter if available |
| Qwen: Qwen3 Coder Plusqwen/qwen3-coder-plus | Alibaba Cloud / Qwen | $0.65 / 1M tokens | $3.25 / 1M tokens | 1M | Tool callingJSON modeLong contextStreaming | Chinese engineering workflows, code generation | 1200-3900ms | Catalog | OpenRouter if available |
FAQ
Kalkulyator umnozhaet vhodnye i vykhodnye tokeny na tsenu vybrannoy modeli za 1M tokenov, a zatem primenyaet chislo zaprosov.
Da. ¥0.20 input plyus ¥2.00 output dayut ¥2.20 dlya etoy odinochnoy otsenki 1M + 1M.
Da. Plan BYOK prednaznachen dlya komand, u kotorykh uzhe est akkaunty provayderov i kotorye khotyat sokhranit posledovatelnye politiki i otchety po ispolzovaniyu.
Da. Enterprise-tseny mogut obsuzhdatsya v zavisimosti ot obema, miksa provayderov, regiona, trebovaniy k podderzhke i governance.