Skip to navigationSkip to main contentSkip to footerScaleway Docs HomepageAsk our AI
Ask our AI

Generative APIs

Generative APIs enable you to deploy, manage, and scale AI models through serverless endpoints or dedicated infrastructure, hosted in European data centers.

Generative APIs QuickstartArrowRightIcon

Getting Started

How-tos

Check our guides about creating, managing, and using Generative APIs endpoints.

View How-tosArrowRightIcon
Generative APIs Developer Reference - Serverless

Learn how to manage your Serverless endpoints through the API.

Go to Generative APIs - Serverless APIArrowRightIcon
Generative APIs Developer Reference - Dedicated Deployment

Learn how to manage your Dedicated Deployment endpoints through the API.

Go to Generative APIs - Dedicated Deployment APIArrowRightIcon

Changelog

  • Generative APIs

    Starting 1 May 2026, Managed Inference becomes Generative APIs - Dedicated Deployment. All APIs, existing resources, pricing, and SLAs remain unchanged. All Managed Inference related content (such as documentation or Cockpit dashboards) will gradually be renamed Generative APIs - Dedicated.

  • Generative APIs

  • Generative APIs

    Generative APIs billing is performed in slices of 1,000 tokens (instead of the previous slices of 1,000,000 tokens) starting April 20th. Prices do not change with this update. After this change, for a similar token usage, any bill will remain the same or may be slightly lower.

View the full changelog
Questions?

Visit our Help Center and find the answers to your most frequent questions.

Visit Help CenterArrowRightIcon
No Results