Meta: Llama 4 Maverick precos de API, capacidades, contexto e codigo compativel com OpenAI

Preco de entrada$0.15 / 1M tokens

Preco de saida$0.6 / 1M tokens

Comprimento de contexto1M tokens

Saida maxima8.2k tokens

O que e Llama 4 Maverick no NextModel?

Meta: Llama 4 Maverick Compare precos de API, fornecedor, comprimento de contexto, capacidades, casos de uso, latencia e alternativas. open-model workflows, cost-sensitive long context, classification. $0.15 / 1M tokens / $0.6 / 1M tokens. 1M tokens.

Melhores casos de uso

open-model workflows
cost-sensitive long context
classification

Exemplo compativel com OpenAI

Mantenha o estilo do SDK OpenAI, aponte base_url para NextModel e use o ID do catalogo llama-4-maverick.

Python

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="https://api.nextmodel.app/v1"
)

resp = client.chat.completions.create(
    model="llama-4-maverick",
    messages=[{"role": "user", "content": "Hello from NextModel"}]
)

print(resp.choices[0].message.content)

Alternativas semelhantes

GoogleCatalog

Google: Gemini 2.5 Flash

86

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

$0.3 / 1M tokensInput$2.50 / 1M tokensOutput1MContext

Best forlong-document summarization, image Q&A, fast multimodal routing

RoutingConfigured

Tool callingVisionJSON modeLong context

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Moonshot AICatalog

MoonshotAI: Kimi K2.6

84

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

$0.73 / 1M tokensInput$3.49 / 1M tokensOutput262.1kContext

Best forlong Chinese documents, contract review, knowledge-base Q&A

RoutingConfigured

JSON modeLong contextStreamingTool calling

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Mistral AICatalog

Mistral: Mistral Small 3.2 24B

79

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, version 3.2 significantly improves accuracy on...

$0.1 / 1M tokensInput$0.3 / 1M tokensOutput128kContext

Best fortranslation, classification, short-form summarization

RoutingConfigured

Tool callingJSON modeStreamingLow cost

OpenRouter if availableOpenRouter public Models API live metadata; public price comes from the registry pricing rule

View details

Comparar Meta: Llama 4 Maverick

Meta: Llama 4 Maverick / Google: Gemini 2.5 Flash Meta: Llama 4 Maverick / MoonshotAI: Kimi K2.6 Meta: Llama 4 Maverick / Mistral: Mistral Small 3.2 24B Meta: Llama 4 Maverick / OpenAI: GPT-4o-mini Meta: Llama 4 Maverick / DeepSeek: DeepSeek V4 Flash

FAQ

Meta: Llama 4 Maverick FAQ

Why include Llama 4 Maverick?

It gives teams an open-model candidate when comparing cost, context length, and provider optionality.