Managed Inference

Effortlessly deploy AI models on a sovereign infrastructure, manage and scale inference with full data privacy. Start now with a simple interface for creating dedicated endpoints.

Managed Inference Quickstart

Getting Started

Quickstart

Learn how to create, connect to, and delete a Managed Inference endpoint in a few steps.

View Quickstart

Concepts

Core concepts that give you a better understanding of Scaleway Managed Inference.

View Concepts

How-tos

Check our guides about creating and managing Managed Inference endpoints.

View How-tos

Additional content

Guides to help you choose a Managed Inference endpoint, understand pricing and advanced configuration.

View additional content

Managed Inference API

Learn how to create and manage your Scaleway Managed Inference endpoints through the API.

Go to Managed Inference API

Changelog

June 2025

Managed Inference
Added
New supported model - Devstral
Devstral Small 2505 is now available for deployment on Managed Inference.

Devstral is a fine-tune of Mistral Small 3.1, optimized for agentic software engineering tasks. Refer to the official documentation for more information and try Devstral within your IDE.

May 2025

Managed Inference
Added
Managed Inference is now in General Availability
Deploy latest AI models from our model catalog to benefit from guaranteed performance and strong security within your VPC.

April 2025

Managed Inference
Added
Custom models deployment available in beta
Custom models can now be deployed on Managed Inference.
Try it now by providing a Hugging Face URL from a compatible model.

View the full changelog

Questions?

Visit our Help Center and find the answers to your most frequent questions.

Visit Help Center