Nova Lite 2

Nova Lite 2
Amazon · Text Generation
POST /v1/chat/completions

Fast, cost-effective multimodal reasoning model for text, images, documents, and video on a 1M context (long docs and ~90 min clips).

At a glance

FieldValue
Model idnova-lite-2
Input modalitiesText, Image, Video, Document
Output modalitiesText
Context window1M
Weight precision-
Max output tokens32,000
Featuresvision
Native inferenceNo
NewNo
Supported endpointsPOST /v1/chat/completions, POST /v1/responses, POST /v1/messages

Pricing

ChargeSpecRate
Inputper 1M prompt tokens$0.38
Outputper 1M generated tokens$3.16
Cached inputper 1M tokens$0.2128
Web Search (Linkup)per call when invoked$0.013

Example request

$curl https://api.empiriolabs.ai/v1/chat/completions \
> -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
> -H 'Content-Type: application/json' \
> -d '{"model": "nova-lite-2", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

ParameterTypeRequiredDefaultDescription
temperaturenumberno0.7Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
top_pnumberno0.9Nucleus sampling probability mass. Lower = more focused. · Range: 0 – 1
max_tokensnumberno4096Maximum tokens in the response. · Range: 1 – 65536
stopstringno-Up to 4 strings where the model will stop generating further tokens.
enable_reasoningbooleannotrueEnable the model’s reasoning mode. Slower but improves multi-step problems.
enable_thinkingbooleannotrueEnable extended reasoning before the final answer. Alias of enable_reasoning.
reasoning_effortenumno"medium"Reasoning effort level (low | medium | high). Higher = more thinking time. · Allowed: low, medium, high
reasoningstringno-Responses API reasoning object: {“effort”:“low|medium|high”}
web_search_linkupbooleannofalseOptional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds a flat $0.013 per request on top of the model’s normal token cost. Disabled by default.
disable_formattingbooleannofalseWhen enabled, the gateway will not append the “Sources” footer to assistant responses that used Linkup web search. Useful when the model output is piped to another system that expects no decoration.

Notes

Reasoning traces are NOT exposed from AWS. Video uploads up to ~1 GB.


Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/nova-lite-2.