This is an optional step that seeks to simplify your usage of the APIs.
Generative APIs API
Introduction
Scaleway Generative APIs provides access to the latest AI models hosted on Scaleway infrastructure.
Generative APIs specification targets OpenAI API compatibility.
Concepts
Refer to our dedicated concepts page to find definitions of the different terms referring to Generative APIs.
Quickstart
-
Configure your environment variables.
Noteexport SCW_ACCESS_KEY="<API access key>"export SCW_SECRET_KEY="<API secret key>"export SCW_REGION="<Scaleway region>" -
Generate content from a model by running the following command.
curl https://api.scaleway.ai/v1/chat/completions \-H "Content-Type: application/json" \-H "Authorization: Bearer $SCW_SECRET_KEY" \-d '{"model": "llama-3.3-70b-instruct","messages": [{"role": "system","content": "You are a helpful assistant."},{"role": "user","content": "Hello!"}]}'
See How to use Generative APIs
for quickstart and snippets using REST
request or libraries such as openai
python client.
To perform the following steps, you must first ensure that:
- you have an account and are logged into the Scaleway console
- you have created an API key and that the API key has sufficient IAM permissions to perform the actions described on this page.
- you have installed
curl
Technical Information
Regions
Scaleway's infrastructure is spread across different regions and Availability Zones.
Generative APIs is available in the Paris region, which is represented by the following path parameters (optional while there is only one region):
fr-par
Supported endpoints and features
Supported endpoints are:
/v1/chat/completions
/v1/embeddings
/v1/models
The /v1/chat/completions
endpoint:
- Supports many features such as:
- Structured outputs (JSON response format)
- Tool calling (ie. compatibility with workflows using MCP servers)
- Sending and analyzing images
- Does not yet support the following parameters:
user
,system_fingerprint
,reasoning_effort
,modalities
,web_search_options
,service_tier
,store
,audio
,prediction
,metadata
.
Third party tool integration
For full details of direct integration into third party tooling, see Integrating Scaleway Generative APIs with popular AI tools. If your tool is not listed, you can still specify the Scaleway URL and API key in most OpenAI-like plugins, as compatibility largely depends on the above APIs.
Technical Limitations
When choosing a model, select the ones compatible with the APIs endpoints you want to use
in our model catalog.
For example, /v1/embeddings
is only available for embeddings models.
Going Further
For more information about Generative APIs, you can check out the following pages:
- Generative APIs Documentation
- Generative APIs FAQ
- Scaleway Slack Community - AI Channel join the #ai channel
- Contact our support team
Troubleshoooting
See Troubleshooting Generative APIs for advanced APIs behaviour descriptions and common issues solutions.
Chat Completions
A chat completion is a model response for a given conversation. It represents the functionality of generating a response in a chat context
POST
/v1/{project_id}/chat/completions
Embeddings
A vector representation of an input. Similar vectors corresponds to semantically similar inputs.
See How to query embedding models
for code snippets using openai
Python client.
POST
/v1/{project_id}/embeddings
Models
A model refers to a system that has been trained to generate content such as text, images, or other data types based on input prompts or instructions
GET
/v1/models