Skip to content

What is dstack?

dstack is a unified control plane for GPU provisioning and orchestration that works with any GPU cloud, Kubernetes, or on-prem clusters.

It streamlines development, training, and inference, and is compatible with any hardware, open-source tools, and frameworks.

Accelerators

dstack supports NVIDIA, AMD, TPU, Intel Gaudi, and Tenstorrent accelerators out of the box.

How does it work?

1. Set up the server

Before using dstack, ensure you've installed the server, or signed up for dstack Sky.

2. Define configurations

dstack supports the following configurations:

  • Fleets — for managing cloud and on-prem clusters
  • Dev environments — for interactive development using a desktop IDE
  • Tasks — for scheduling jobs, incl. distributed ones (or running web apps)
  • Services — for deploying models (or web apps)
  • Volumes — for managing network volumes (to persist data)

Configuration can be defined as YAML files within your repo.

3. Apply configurations

Apply the configuration either via the dstack apply CLI command (or through a programmatic API.)

dstack automatically manages infrastructure provisioning and job scheduling, while also handling auto-scaling, port-forwarding, ingress, and more.

Where do I start?

  1. Proceed to installation
  2. See quickstart
  3. Browse examples
  4. Join Discord