Skip to main content

Base URLs

PurposeURL
Inference (OpenAI-compatible)http://localhost:8000/v1
Management APIhttp://localhost:3000/api
Prometheus metricshttp://localhost:8000/metrics

Authentication

Disabled by default. Enable for internet-exposed instances:
ainode auth enable    # generates an API key
ainode auth new-key   # rotate
Pass the key as Authorization: Bearer <key> on all /v1/* requests.

OpenAI-compatible endpoints

EndpointMethodDescription
/v1/chat/completionsPOSTChat completions (streaming supported)
/v1/completionsPOSTText completions
/v1/embeddingsPOSTEmbeddings
/v1/modelsGETList loaded models

Management endpoints

EndpointDescription
GET /api/statusNode status, GPU info, engine state
GET /api/nodesAll discovered cluster nodes
GET /api/cluster/resourcesAggregated VRAM across cluster
GET /api/metricsJSON metrics snapshot
GET /metricsPrometheus text exposition
POST /api/engine/set-modelSwitch active model
POST /api/engine/updateTrigger docker pull + restart
GET /api/version/checkCheck for newer version on GHCR