Generative APIs - Quickstart
Scaleway Generative APIs - Dedicated Deployment is the first European Managed Inference platform on the market. It is a scalable and secure inference engine for Large Language Models (LLMs).
Scaleway Generative APIs - Dedicated Deployment is a fully managed service that allows you to serve generative AI models in a production environment. With Scaleway Generative APIs - Dedicated Deployment, you can easily deploy, manage, and scale LLMs without worrying about the underlying infrastructure.
Here are some of the key features of Scaleway Generative APIs:
- Easy deployment: Deploy state-of-the-art open weights LLMs with just a few clicks. Scaleway Generative APIs provides a simple and intuitive interface for generating dedicated endpoints.
- Security: Scaleway provides a secure environment to run your models. Our platform is built on top of a secure architecture, and we use state-of-the-art cloud security.
- Complete data privacy: No storage or third-party access to your data (prompt or responses), to ensure it remains exclusively yours.
- Interoperability: Scaleway Generative APIs was designed as a drop-in replacement for the OpenAI APIs, for a seamless transition on your applications already using its libraries.
Before you start
To complete the actions presented below, you must have:
- A Scaleway account logged into the console
- Owner status or IAM permissions allowing you to perform actions in the intended Organization
How to create a dedicated deployment
- Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
- Select the Deployments tab.
- Click Deploy a model to launch the model deployment wizard.
- Provide the necessary information:
- Select the desired model and the quantization to use for your deployment from the available options.
- Choose the geographical region for the deployment.
- Select a node type, the GPU Instance that will be used with your deployment.
- Choose the number of nodes for your deployment. Note that this feature is currently in Public Beta.
- Enter a name for the deployment, along with optional tags to aid in organization.
- Configure network settings for the deployment:
- Enable Private Network for secure communication and restricted availability within Private Networks. Choose an existing Private Network from the drop-down list, or create a new one.
- Enable Public Network to access resources via the public Internet. API key protection is enabled by default.
- Click Deploy model to launch the deployment process. Once the deployment is ready, it will be listed among your deployments.
How to access a Generative APIs deployment
Generative APIs deployments have authentication enabled by default. As such, your endpoints expect a secret key generated with Scaleway's Identity and Access Management service (IAM) for authentication.
- Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
- Select the Deployments tab.
- From the drop-down menu, select the geographical region containing your deployment.
- Click the name of the deployment you wish to access. The deployment's Overview page displays.
- Scroll down to the Deployment authentication section and click the Generate key button. The token creation wizard displays.
- Fill in the required information for API key creation and click Generate API key.
- Copy and safely store your credentials before closing the window, as they will not be shown again.
How to interact with Generative APIs
- Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
- Select the Deployments tab.
- From the drop-down menu, select the geographical region where your desired deployment was created.
- Click the name of the deployment you wish to edit. The deployment's Overview page displays.
- Click the Playground tab, then View code to see code examples in various environments. Copy and paste them into your code editor or terminal.
How to delete a deployment
- Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
- Select the Deployments tab.
- From the drop-down menu, select the geographical region where your deployment was created.
- Choose the deployment you wish to delete.
- Navigate to the Settings tab.
- Click Delete deployment at the bottom of the page.
- Type DELETE to confirm and click Delete deployment.
Alternatively, from the Deployments listing, click the more icon icon next to the deployment name you no longer need, and click Delete. A pop-up appears. Type DELETE to confirm, then click Delete deployment.