GPU Cloud and Hosted Agents are now available

GPU Cloud and Hosted Agents are out of early access and available to every account, with full API coverage and in-dashboard chat for deployed models.

What’s new

  • GPU Cloud is generally available. Deploy a managed GPU instance in a click, then serve any Hugging Face model behind an OpenAI-compatible endpoint, run a one-click template (JupyterLab, ComfyUI, Web Terminal, Ollama), or bring your own CUDA Docker image. Pricing is per second, the rate is locked when you launch, and you reach the workload through the authenticated connect path at api.empiriolabs.ai/v1/gpu/connect/{instance_id}/....
  • Chat with your deployed model in the dashboard. Any GPU instance running an OpenAI-compatible model gets a built-in chat page, so you can test prompts, attach images or audio, and stream responses without leaving the dashboard.
  • Hosted Agents is generally available. Deploy a private OpenClaw or Hermes agent that lives in your chat apps, runs code safely in an isolated sandbox, generates media across image, video, speech, and more, browses the web, and connects your tools through remote MCP connectors. Each agent is its own monthly subscription billed to your credits.
  • Full API coverage. Anything you can do in the dashboard you can do programmatically: deploy and manage GPU instances under /v1/gpu/*, and deploy, message, and configure agents (model, skills, connectors, channels, and access) under /v1/hosted-agents/*. Usage and spend for both appear in /v1/account/usage.

Get started

Open GPU Cloud or Hosted Agents in the dashboard, or read the GPU Cloud and Hosted Agents guides.