Skip to navigationSkip to main contentSkip to footerScaleway DocsSparklesIconAsk our AI
SparklesIconAsk our AI

Generative APIs - Quickstart

Scaleway Generative APIs - Dedicated Deployment is the first European Managed Inference platform on the market. It is a scalable and secure inference engine for Large Language Models (LLMs).

Scaleway Generative APIs - Dedicated Deployment is a fully managed service that allows you to serve generative AI models in a production environment. With Scaleway Generative APIs - Dedicated Deployment, you can easily deploy, manage, and scale LLMs without worrying about the underlying infrastructure.

Here are some of the key features of Scaleway Generative APIs:

  • Easy deployment: Deploy state-of-the-art open weights LLMs with just a few clicks. Scaleway Generative APIs provides a simple and intuitive interface for generating dedicated endpoints.
  • Security: Scaleway provides a secure environment to run your models. Our platform is built on top of a secure architecture, and we use state-of-the-art cloud security.
  • Complete data privacy: No storage or third-party access to your data (prompt or responses), to ensure it remains exclusively yours.
  • Interoperability: Scaleway Generative APIs was designed as a drop-in replacement for the OpenAI APIs, for a seamless transition on your applications already using its libraries.

Before you start

To complete the actions presented below, you must have:

  • A Scaleway account logged into the console
  • Owner status or IAM permissions allowing you to perform actions in the intended Organization

How to create a dedicated deployment

  1. Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
  2. Select the Deployments tab.
  3. Click Deploy a model to launch the model deployment wizard.
  4. Provide the necessary information:
    • Select the desired model and the quantization to use for your deployment from the available options.
      AlertCircleIcon
      Important

      Scaleway Generative APIs allows you to deploy various AI models, either from the Scaleway catalog or by importing a custom model. For detailed information about supported models, visit our Generative APIs model catalog.

      InformationOutlineIcon
      Note

      Some models may require acceptance of an end-user license agreement (EULA). If prompted, review the terms and conditions, and accept the license accordingly.

    • Choose the geographical region for the deployment.
    • Select a node type, the GPU Instance that will be used with your deployment.
    • Choose the number of nodes for your deployment. Note that this feature is currently in Public Beta.
      InformationOutlineIcon
      Note

      High availability is only guaranteed with two or more nodes.

  5. Enter a name for the deployment, along with optional tags to aid in organization.
  6. Configure network settings for the deployment:
    • Enable Private Network for secure communication and restricted availability within Private Networks. Choose an existing Private Network from the drop-down list, or create a new one.
    • Enable Public Network to access resources via the public Internet. API key protection is enabled by default.
    AlertCircleIcon
    Important
    • Enabling both private and public networks will result in two distinct endpoints (public and private) for your deployment.
    • Deployments must have at least one endpoint, either public or private.
  7. Click Deploy model to launch the deployment process. Once the deployment is ready, it will be listed among your deployments.

How to access a Generative APIs deployment

Generative APIs deployments have authentication enabled by default. As such, your endpoints expect a secret key generated with Scaleway's Identity and Access Management service (IAM) for authentication.

  1. Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
  2. Select the Deployments tab.
  3. From the drop-down menu, select the geographical region containing your deployment.
  4. Click the name of the deployment you wish to access. The deployment's Overview page displays.
  5. Scroll down to the Deployment authentication section and click the Generate key button. The token creation wizard displays.
  6. Fill in the required information for API key creation and click Generate API key.
  7. Copy and safely store your credentials before closing the window, as they will not be shown again.
CheckCircleOutlineIcon
Tip

You have full control over authentication from the Security tab of your deployment. Authentication is enabled by default.

How to interact with Generative APIs

  1. Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
  2. Select the Deployments tab.
  3. From the drop-down menu, select the geographical region where your desired deployment was created.
  4. Click the name of the deployment you wish to edit. The deployment's Overview page displays.
  5. Click the Playground tab, then View code to see code examples in various environments. Copy and paste them into your code editor or terminal.
InformationOutlineIcon
Note

Prompt structure may vary from one model to another. Refer to the specific instructions for use in our dedicated documentation.

How to delete a deployment

  1. Click Generative APIs in the AI section of the side menu in the Scaleway console to access the dashboard. The list of models displays.
  2. Select the Deployments tab.
  3. From the drop-down menu, select the geographical region where your deployment was created.
  4. Choose the deployment you wish to delete.
  5. Navigate to the Settings tab.
  6. Click Delete deployment at the bottom of the page.
  7. Type DELETE to confirm and click Delete deployment.

Alternatively, from the Deployments listing, click the more iconDotsHorizontalIcon icon next to the deployment name you no longer need, and click Delete. A pop-up appears. Type DELETE to confirm, then click Delete deployment.

AlertCircleIcon
Important

Deleting a deployment is a permanent action that erases all its associated data and resources.

SearchIcon
No Results