NavigationContentFooter
Jump toSuggest an edit
Was this page helpful?

How to manage scale Managed Inference deployments

Reviewed on 03 June 2025Published on 03 June 2025

You can scale your Managed Inference deployment up or down to match it to the incoming load of your deployment.

Important

This feature is currently in Public Beta.

Before you startLink to this anchor

To complete the actions presented below, you must have:

  • A Scaleway account logged into the console
  • A Managed Inference deployment
  • Owner status or IAM permissions allowing you to perform actions in the intended Organization

How to scale a Managed Inference deployment in sizeLink to this anchor

  1. Click Managed Inference in the AI section of the Scaleway console side menu. A list of your deployments displays.
  2. Click a deployment name or «See more Icon» > More info to access the deployment dashboard.
  3. Click the Settings tab and navigate to the Scaling section.
  4. Click Update node count and adjust the number of nodes in your deployment.
    Note

    High availability is only guaranteed with two or more nodes.

  5. Click Update node count to update the number of nodes in your deployment.
    Note

    Your deployment will be unavailable for 15-30 minutes while the node update is in progress.

See also
How to monitor a deploymentHow to manage allowed IP addresses
Was this page helpful?
API DocsScaleway consoleDedibox consoleScaleway LearningScaleway.comPricingBlogCareers
© 2023-2025 – Scaleway