GET /ai/deployment/{id}

Get Deployment details

Path parameters

  • id string(uuid) Required

Responses

  • 404 application/json

    404

    Hide response attributes Show response attributes object
    • type string(uri-reference) Required
    • title string Required
    • status integer Required

      Minimum value is 100, maximum value is 599.

    • detail string Required
    • instance string(uri-reference)
    • errors array[object]
      Hide errors attributes Show errors attributes object
      • path string
      • detail string
      • pointer string
      • location string
  • 200 application/json

    200

    Hide response attributes Show response attributes object
    • gpu-count integer(int64)

      Number of GPUs

      Minimum value is 1.

    • updated-at string(date-time)

      Update time

    • deployment-url string

      Deployment URL (nullable)

    • service-level string

      Service level

      Minimum length is 1.

    • inference-engine-version string

      Inference engine version

      Values are 0.12.0, 0.15.1, 0.16.0, or 0.17.0. Default value is 0.17.0.

    • name string

      Deployment name

      Minimum length is 1.

    • state string

      Deployment state

      Values are ready, creating, error, or deploying.

    • gpu-type string

      GPU type family

      Minimum length is 1.

    • id string(uuid)

      Deployment ID

    • replicas integer(int64)

      Number of replicas (>=0)

      Minimum value is 0.

    • state-details string

      Deployment state details

    • created-at string(date-time)

      Creation time

    • inference-engine-parameters array[string]

      Optional extra inference engine server CLI args

    • model object
      Hide model attributes Show model attributes object
      • name string

        Associated model name

        Minimum length is 1.

      • id string(uuid)

        Associated model ID

GET /ai/deployment/{id}
curl \
 --request GET 'https://api-ch-gva-2.exoscale.com/v2/ai/deployment/{id}'
Response examples (404)
{
  "type": "string",
  "title": "string",
  "status": 42,
  "detail": "string",
  "instance": "string",
  "errors": [
    {
      "path": "string",
      "detail": "string",
      "pointer": "string",
      "location": "string"
    }
  ]
}
Response examples (200)
{
  "gpu-count": 42,
  "updated-at": "2026-05-04T09:42:00Z",
  "deployment-url": "string",
  "service-level": "string",
  "inference-engine-version": "0.17.0",
  "name": "string",
  "state": "ready",
  "gpu-type": "string",
  "id": "string",
  "replicas": 42,
  "state-details": "string",
  "created-at": "2026-05-04T09:42:00Z",
  "inference-engine-parameters": [
    "string"
  ],
  "model": {
    "name": "string",
    "id": "string"
  }
}