Skip to navigationSkip to main contentSkip to footerScaleway Docs

How to scale Managed Inference deployments

You can scale your Managed Inference deployment up or down to match it to the incoming load of your deployment.

Important

This feature is currently in Public Beta.

Before you start

To complete the actions presented below, you must have:

How to scale a Managed Inference deployment in size

  1. Click Managed Inference in the AI section of the Scaleway console side menu. A list of your deployments displays.
  2. Click a deployment name or more icon > More info to access the deployment dashboard.
  3. Click the Settings tab and navigate to the Scaling section.
  4. Click Update node count and adjust the number of nodes in your deployment.
    Note

    High availability is only guaranteed with two or more nodes.

  5. Click Update node count to update the number of nodes in your deployment.
    Note

    Your deployment will be unavailable for 15-30 minutes while the node update is in progress.

Still need help?

Create a support ticket
No Results