For AI agents: a documentation index is available at the root level at /llms.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Logo
WebsiteModelsPricingGet Started
DocumentationAPI Reference
DocumentationAPI Reference
  • Overview
    • Welcome
    • Getting Started
    • Authentication
    • Concepts
  • Platform
    • Models and Pricing
    • Billing and Credits
    • Limits and API Keys
    • Account Usage API
    • Generation Templates
    • GPU Cloud
    • Hosted Agents
    • OpenAI and Anthropic Compatibility
    • Integrations
  • Providers and Models
    • All providers
  • Reference
    • API Reference Overview
    • AI Agent Access
    • Support
    • Changelog
  • Welcome
  • Getting Started
  • Authentication
  • Concepts
  • Models and Pricing
  • Billing and Credits
  • Limits and API Keys
  • Account Usage API
  • Generation Templates
  • GPU Cloud
  • Hosted Agents
  • OpenAI and Anthropic Compatibility
  • Integrations
  • All providers
  • ACE-Step overview
  • ACE-Step 1.5 XL
  • Alibaba Cloud overview
  • HappyHorse 1.0
  • Qwen Image 2.0
  • Qwen3 Max
  • Qwen3 Max Preview
  • Qwen3 Max Thinking
  • Qwen3 Rerank
  • Qwen3.5 122B-A10B
  • Qwen3.5 27B
  • Qwen3.5 35B-A3B
  • Qwen3.5 397B-A17B
  • Qwen3.5 4B
  • Qwen3.5 9B
  • Qwen3.5 Flash
  • Qwen3.5 Omni Flash
  • Qwen3.5 Omni Plus
  • Qwen3.5 Plus
  • Qwen3.6 27B
  • Qwen3.6 Flash
  • Qwen3.6 Max Preview
  • Qwen3.6 Plus
  • Qwen3.7 Max
  • Qwen3.7 Plus
  • Text Embedding v4
  • Tongyi Embedding Vision Flash
  • Tongyi Embedding Vision Plus
  • Wan 2.6
  • Wan 2.7
  • Wan2.7 Image
  • Amazon overview
  • Amazon Nova Canvas
  • Amazon Nova Reel 1.1
  • Nova Lite 1.0
  • Nova Lite 2
  • Nova Micro 1.0
  • Nova Premier 1.0
  • Nova Pro 1.0
  • Black Forest Labs overview
  • FLUX.2 Klein 4B
  • ByteDance overview
  • Seed 2.0 Code
  • Seed 2.0 Lite
  • Seed 2.0 Mini
  • Seed 2.0 Pro
  • Seedance 2.0 Fast
  • Seedance 2.0 Pro
  • Seedream 5.0 Lite
  • Deepgram overview
  • Deepgram Nova 3
  • DeepSeek overview
  • DeepSeek Prover V2
  • DeepSeek V3.2
  • DeepSeek V4 Flash
  • DeepSeek V4 Pro
  • Janus-Pro DeepSeek
  • Exa overview
  • Exa Answer
  • Exa Research
  • Exa Search
  • Google overview
  • Gemini 2.5 Flash TTS
  • Gemini 2.5 Pro TTS
  • Gemini 3.1 Flash TTS
  • Gemma 3 27B
  • Gemma 4 26B-A4B
  • Gemma 4 E4B
  • GPTZero overview
  • GPTZero
  • Inworld overview
  • TTS 1.5 Max
  • TTS 1.5 Mini
  • Kling AI overview
  • Kling O3
  • Kling v3 Motion Control
  • Linkup overview
  • Linkup Deep Search
  • Linkup Standard
  • Manus overview
  • Manus
  • Microsoft overview
  • TRELLIS.2 4B
  • MiniMax overview
  • MiniMax M2.7
  • MiniMax M2.7 Highspeed
  • MiniMax M3
  • Mistral AI overview
  • Magistral Medium 2509 Thinking
  • Mistral Medium 3
  • Mistral Medium 3.1
  • Mistral Small 3.1
  • Mistral Small 4
  • Moonshot AI overview
  • Kimi K2.6
  • Kimi K2.7 Code
  • OpenAI overview
  • OpenAI Whisper 1
  • Whisper Large v3 Turbo
  • OpenMOSS overview
  • MOSS Video and Audio
  • Perplexity overview
  • Perplexity Advanced Deep Research
  • Perplexity Deep Research
  • Perplexity Pro Search
  • Perplexity Search
  • Perplexity Sonar
  • Perplexity Sonar Pro
  • Perplexity Sonar Reasoning Pro
  • PixVerse overview
  • Pixverse v5
  • Pixverse v5.6
  • Soul AI Lab overview
  • SoulX Podcast
  • Stability AI overview
  • Stable Audio 2.0
  • Stable Audio 2.5
  • Tavily overview
  • Tavily Research
  • Tavily Search
  • Tencent overview
  • Hunyuan Image 3
  • Hunyuan Video 1.5
  • VITA-Group / EPFL overview
  • SVI 2.0 Pro
  • WinFunc overview
  • DeepReasoning
  • xAI overview
  • Grok Imagine Video 1.5
  • Xiaomi overview
  • MiMo V2 Flash
  • MiMo V2.5
  • MiMo V2.5 Pro
  • Z.ai overview
  • GLM 4.5 Flash
  • GLM 4.6V Flash
  • GLM 4.7 Flash
  • GLM 5.1
  • GLM 5.2
  • GLM TTS
  • API Reference Overview
  • AI Agent Access
  • Support
  • Changelog
  • June 7, 2026
  • May 24, 2026
  • May 7, 2026
WebsiteModelsPricingGet Started
On this page
  • June 7, 2026
  • GPU Cloud and Hosted Agents are now available
  • May 24, 2026
  • Playground improvements and creative templates
  • May 7, 2026
  • Integration Setup Support

Changelog

June 7, 2026
June 7, 2026

May 24, 2026
May 24, 2026

May 7, 2026
May 7, 2026
Built with
June 7, 2026

GPU Cloud and Hosted Agents are now available

GPU Cloud and Hosted Agents are out of early access and available to every account, with full API coverage and in-dashboard chat for deployed models.

What’s new

  • GPU Cloud is generally available. Deploy a managed GPU instance in a click, then serve any Hugging Face model behind an OpenAI-compatible endpoint, run a one-click template (JupyterLab, ComfyUI, Web Terminal, Ollama), or bring your own CUDA Docker image. Pricing is per second, the rate is locked when you launch, and you reach the workload through the authenticated connect path at api.empiriolabs.ai/v1/gpu/connect/{instance_id}/....
  • Chat with your deployed model in the dashboard. Any GPU instance running an OpenAI-compatible model gets a built-in chat page, so you can test prompts, attach images or audio, and stream responses without leaving the dashboard.
  • Hosted Agents is generally available. Deploy a private OpenClaw or Hermes agent that lives in your chat apps, runs code safely in an isolated sandbox, generates media across image, video, speech, and more, browses the web, and connects your tools through remote MCP connectors. Each agent is its own monthly subscription billed to your credits.
  • Full API coverage. Anything you can do in the dashboard you can do programmatically: deploy and manage GPU instances under /v1/gpu/*, and deploy, message, and configure agents (model, skills, connectors, channels, and access) under /v1/hosted-agents/*. Usage and spend for both appear in /v1/account/usage.

Get started

Open GPU Cloud or Hosted Agents in the dashboard, or read the GPU Cloud and Hosted Agents guides.

May 24, 2026

Playground improvements and creative templates

A batch of playground quality-of-life updates plus the new creative templates feature for image and video generation.

What’s new

  • Creative templates for image and video generation. Pre-curated effect recipes you can apply with a single click in the playground or by passing template: "<slug>" to /v1/images/generations or /v1/videos/generations. Each template picks a recommended model, ships a supported model list, and applies sensible default parameters so you can ship a polished generation without tuning every knob.
  • Live cost estimate in the playground. A running estimate now appears next to the Send button before you fire a request, so you can see what each call will cost against your live balance.
  • Save generated media from your usage logs. A Save button on usage log entries downloads the file (image, video, audio) for any past generation, so you can keep your output locally before the standard retention window expires.
  • Edit, regenerate, and delete on text chat messages. Hover any user or assistant bubble in the playground to edit a prompt and resubmit, regenerate the last response, or mark messages for deletion.
May 7, 2026

Integration Setup Support

We added new integration setup guidance to help teams connect EmpirioLabs with coding agents, IDE extensions, CLIs, and OpenAI-compatible tools more quickly.

What’s new

  • Added an Integrations guide with setup paths for popular agent tools, including OpenCode, Claude Code, Cline, Qwen Code, Codex CLI, Aider, Continue, OpenHands, Hermes Agent, goose, Zed, and Kilo/Roo/Cursor-style IDEs.
  • Added a setup helper that can generate selected local configuration files for project-level and user-level workflows, with configurable tool selection, model choice, and an optional smoke test.
  • Clarified the core connection values for OpenAI-compatible clients, Anthropic-style Messages clients, API keys, model IDs, and live model catalog checks.