[BETA] List Deployments

View as markdown
GET /ai/deployment

Responses

  • 200 application/json

    200

    Hide response attribute Show response attribute object
    • deployments array[object]

      AI deployment

      Hide deployments attributes Show deployments attributes object
      • gpu-count integer(int64)

        Number of GPUs

        Minimum value is 0.

      • updated-at string(date-time)

        Update time

      • deployment-url string

        Deployment URL (nullable)

      • service-level string

        Service level

        Minimum length is 1.

      • name string

        Deployment name

        Minimum length is 1.

      • gpu-type string

        GPU type family

        Minimum length is 1.

      • status string

        Deployment status

        Values are ready, creating, error, or deploying.

      • id string(uuid)

        Deployment ID

      • replicas integer(int64)

        Number of replicas (>=0)

        Minimum value is 0.

      • created-at string(date-time)

        Creation time

      • model object
        Hide model attributes Show model attributes object
        • id string(uuid)

          Associated model ID

        • name string

          Associated model name

          Minimum length is 1.

GET /ai/deployment
curl \
 --request GET 'https://api-ch-gva-2.exoscale.com/v2/ai/deployment'
Response examples (200)
{
  "deployments": [
    {
      "gpu-count": 42,
      "updated-at": "2025-05-04T09:42:00Z",
      "deployment-url": "string",
      "service-level": "string",
      "name": "string",
      "gpu-type": "string",
      "status": "ready",
      "id": "string",
      "replicas": 42,
      "created-at": "2025-05-04T09:42:00Z",
      "model": {
        "id": "string",
        "name": "string"
      }
    }
  ]
}