Loading...Working on your request
Meta modelo

Llama 4 Maverick

Meta: Llama 4 Maverick Compare precos de API, fornecedor, comprimento de contexto, capacidades, casos de uso, latencia e alternativas. open-model workflows, cost-sensitive long context, classification. $0.15 / 1M tokens / $0.6 / 1M tokens. 1M tokens.

MetaOpenRouter if availableCatalog
JSON modeLong contextStreamingLow costTool callingVision
Preco de entrada$0.15 / 1M tokens
Preco de saida$0.6 / 1M tokens
Comprimento de contexto1M tokens
Saida maxima8.2k tokens

O que e Llama 4 Maverick no NextModel?

Meta: Llama 4 Maverick Compare precos de API, fornecedor, comprimento de contexto, capacidades, casos de uso, latencia e alternativas. open-model workflows, cost-sensitive long context, classification. $0.15 / 1M tokens / $0.6 / 1M tokens. 1M tokens.

Melhores casos de uso

  • open-model workflows
  • cost-sensitive long context
  • classification

Exemplo compativel com OpenAI

Mantenha o estilo do SDK OpenAI, aponte base_url para NextModel e use o ID do catalogo llama-4-maverick.

Python
from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="llama-4-maverick",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Alternativas semelhantes

GoogleCatalog

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

$0.3 / 1M tokensInput$2.50 / 1M tokensOutput1MContext
Best forlong-document summarization, image Q&A, fast multimodal routing
RoutingConfigured
Tool callingVisionJSON modeLong context
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
Moonshot AICatalog

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

$0.73 / 1M tokensInput$3.49 / 1M tokensOutput262.1kContext
Best forlong Chinese documents, contract review, knowledge-base Q&A
RoutingConfigured
JSON modeLong contextStreamingTool calling
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details
Mistral AICatalog

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

$0.1 / 1M tokensInput$0.3 / 1M tokensOutput128kContext
Best fortranslation, classification, short-form summarization
RoutingConfigured
Tool callingJSON modeStreamingLow cost
OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule
View details

FAQ

Meta: Llama 4 Maverick FAQ

Why include Llama 4 Maverick?

It gives teams an open-model candidate when comparing cost, context length, and provider optionality.