Deploy a GPU

Start a GPU Cloud instance. Choose a curated model, paste any Hugging Face repo id (served with vLLM, OpenAI-compatible at `/v1`), pick a template (JupyterLab, ComfyUI, Web Terminal, Ollama), or run a custom Docker image. Billing starts when the GPU reaches `running` and is metered by the second against your credit balance. Your account's current GPU limit is enforced at deploy and start time.

Autenticación

AuthorizationBearer

Pass your EmpirioLabs API key as a bearer token. The Anthropic-style x-api-key header is also accepted on every endpoint.

Solicitud

This endpoint expects an object.
gpu_slugstringRequerido
The GPU type to deploy from the catalog.
modeenumOpcional
How to provision the GPU.
hf_idstringOpcional

A Hugging Face repo id to serve with vLLM (mode model). Set HF_TOKEN in env for gated repos.

template_slugstringOpcional

A curated model or template slug (mode model or template).

imagestringOpcional

A CUDA Docker image to run (mode custom).

portslist of integersOpcional

Ports the workload listens on (mode custom).

envmap from strings to stringsOpcional
Environment variables for the workload.
num_gpusintegerOpcional1-64Valor predeterminado: 1
Number of GPUs. Your current account limit is enforced at deploy and start time.
disk_gbintegerOpcional100-300Valor predeterminado: 150

Requested runtime disk in GB (100-300).

namestringOpcional
Optional label for the GPU Cloud instance.

Respuesta

Instance accepted and provisioning.
instanceobject

Errores

402
Payment Required Error
404
Not Found Error
409
Conflict Error
422
Unprocessable Entity Error