[BETA] Get Deployment

GET /ai/deployment/{id}

Get Deployment details

Path parameters

id string(uuid) Required

Responses

404 application/json

404
Hide response attributes Show response attributes object
- type string(uri-reference) Required
- title string Required
- status integer Required
  
  Minimum value is 100, maximum value is 599.
- detail string Required
- instance string(uri-reference)
- errors array[object]
  
  Hide errors attributes Show errors attributes object
  
  path string
  
  detail string
  
  pointer string
  
  location string
200 application/json

200
Hide response attributes Show response attributes object
- gpu-count integer(int64)
  
  Number of GPUs
  
  Minimum value is 1.
- updated-at string(date-time)
  
  Update time
- deployment-url string
  
  Deployment URL (nullable)
- service-level string
  
  Service level
  
  Minimum length is 1.
- inference-engine-version string
  
  Inference engine version
  
  Values are 0.12.0, 0.15.1, 0.16.0, or 0.17.0. Default value is 0.17.0.
- name string
  
  Deployment name
  
  Minimum length is 1.
- state string
  
  Deployment state
  
  Values are ready, creating, error, or deploying.
- gpu-type string
  
  GPU type family
  
  Minimum length is 1.
- id string(uuid)
  
  Deployment ID
- replicas integer(int64)
  
  Number of replicas (>=0)
  
  Minimum value is 0.
- state-details string
  
  Deployment state details
- created-at string(date-time)
  
  Creation time
- inference-engine-parameters array[string]
  
  Optional extra inference engine server CLI args
- model object
  
  Hide model attributes Show model attributes object
  
  name string
  
  Associated model name
  
  Minimum length is 1.
  
  id string(uuid)
  
  Associated model ID

GET /ai/deployment/{id}

curl \
 --request GET 'https://api-ch-gva-2.exoscale.com/v2/ai/deployment/{id}'

Response examples (404)

{
  "type": "string",
  "title": "string",
  "status": 42,
  "detail": "string",
  "instance": "string",
  "errors": [
    {
      "path": "string",
      "detail": "string",
      "pointer": "string",
      "location": "string"
    }
  ]
}

Response examples (200)

{
  "gpu-count": 42,
  "updated-at": "2026-05-04T09:42:00Z",
  "deployment-url": "string",
  "service-level": "string",
  "inference-engine-version": "0.17.0",
  "name": "string",
  "state": "ready",
  "gpu-type": "string",
  "id": "string",
  "replicas": 42,
  "state-details": "string",
  "created-at": "2026-05-04T09:42:00Z",
  "inference-engine-parameters": [
    "string"
  ],
  "model": {
    "name": "string",
    "id": "string"
  }
}