Qwen3.5 Flash

POST /v1/chat/completionsVision-language model with hybrid linear-attention plus sparse MoE, 1M context, and fast multimodal text/image/video inference.
At a glance
Pricing
Example request
Parameters
Notes
Built-in tools (billed only when invoked)
- Web search: $0.015/call
- Web extractor: free
- Code interpreter: free
- Text-to-image search: $0.012/call
- Image-to-image search: $0.012/call
Other
- Thinking tokens are billed as output tokens
Text-to-Image Search and Image-to-Image Search use the Image Search pricing row. Each invoked image search is billed at that listed per-call rate.
Per-tool billing (usage.tool_usage)
When this model invokes tools (web search, code interpreter, etc.) inside a single request, the response carries a normalized usage.tool_usage map alongside the token counts. The example below shows the shape — exact field names, units, and which tools appear can vary slightly per provider:
The tool counts are already factored into cost_usd — they are surfaced for transparency so you can audit per-tool billing. The field is omitted when no tools were invoked.
Variants
:variant1
Pricing
Parameters
Machine-readable schema: GET https://api.empiriolabs.ai/v1/models/qwen3-5-flash.
