Nova Micro 1.0 | EmpirioLabs AI Docs

Amazon · Text Generation

POST /v1/chat/completions

Text-only foundation model tuned for ultra-low latency and cost on 128K context. Strong for summarization, translation, and chat with 44% cache discount.

At a glance

Field	Value
Model id	`nova-micro-1-0`
Model release date	2024-12-03
Input modalities	Text
Output modalities	Text
Context window	128K
Weight precision	-
Max output tokens	5,000
Features	fast, function_calling
Native inference	No
New	No
Structured output	JSON Mode
Batch API	35% off list price
Supported endpoints	`POST /v1/chat/completions`, `POST /v1/responses`, `POST /v1/messages`, `POST /v1beta/models/nova-micro-1-0:generateContent`
Alternate model ids	`amazon-nova-micro`, `amazon/nova-micro`, `nova-micro`, `us.amazon.nova-micro-v1:0`

Pricing

Charge	Spec	Rate
Input	per 1M prompt tokens	$0.040
Output	per 1M generated tokens	$0.16
Cached input	per 1M tokens	$0.0224
Web Search (Linkup)	per call when invoked	$0.013

Example request

$ curl https://api.empiriolabs.ai/v1/chat/completions \
>   -H 'Authorization: Bearer $EMPIRIOLABS_API_KEY' \
>   -H 'Content-Type: application/json' \
>   -d '{"model": "nova-micro-1-0", "messages": [{"role":"user","content":"Hello"}]}'

Parameters

Parameter	Type	Required	Default	Description
`temperature`	number	no	`0.7`	Sampling temperature. 0 = deterministic, 2 = maximum randomness. · Range: 0 – 2
`top_p`	number	no	`0.9`	Nucleus sampling probability mass. Lower = more focused. · Range: 0 – 1
`max_tokens`	number	no	`4096`	Maximum tokens in the response. · Range: 1 – 65536
`stop`	string	no	-	Up to 4 strings where the model will stop generating further tokens.
`response_format`	enum	no	-	Return the output as a valid JSON object (JSON mode). Describe the fields you want in your prompt.
`web_search_linkup`	boolean	no	false	Optional web search powered by Linkup. When enabled, recent web sources are retrieved using your latest user message as the query and provided to the model as additional context. Adds $0.013 per call when invoked on top of the model’s normal token cost. Disabled by default.
`disable_formatting`	boolean	no	false	When enabled, the gateway will not append the “Sources” footer to assistant responses that used Linkup web search. Useful when the model output is piped to another system that expects no decoration.

Notes

44% discount on cached chat.

Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/nova-micro-1-0.