How to scale dedicated Generative APIs deployments
You can scale your dedicated Generative APIs deployment up or down to match it to the incoming load of your deployment.
Before you start
To complete the actions presented below, you must have:
- A Scaleway account logged into the console
- A dedicated Generative APIs deployment
- Owner status or IAM permissions allowing you to perform actions in the intended Organization
How to scale a dedciated Generative APIs deployment in size
- Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
- Select the Deployments tab.
- From the drop-down menu, select the geographical region you want to manage.
- Click a deployment name to access the deployment's dashboard.
- Click the Settings tab and navigate to the Scaling section.
- Click Update node count and adjust the number of nodes in your deployment.
- Click Update node count to update the number of nodes in your deployment.
See Also
Still need help?Create a support ticket