Skip to navigationSkip to main contentSkip to footerScaleway DocsAsk our AI
Ask our AI

Supported models

Our API supports the most popular models for Chat, Vision, Audio and Embeddings.

Multimodal models

Chat and Vision models

ProviderModel stringContext window (Tokens)Maximum output (Tokens)License *Model card
Google (Preview)gemma-3-27b-it40k8192GemmaHF
Mistralmistral-small-3.2-24b-instruct-2506128k8192Apache-2.0HF
Hholo2-30b-a3b22k8192CC-BY-NC-4.0HF

*Licences which are not open-weight and may restrict commercial usage (such as CC-BY-NC-4.0), do not apply to usage through Scaleway Products due to existing partnerships between Scaleway and the corresponding providers. Original licences are provided for transparency only.

Chat and Audio models

ProviderModel stringContext window (Tokens)Maximum output (Tokens)LicenseModel card
Mistralvoxtral-small-24b-250732k8192Apache-2.0HF

Audio transcription models

ProviderModel stringMaximum audio duration (Minutes)Chunk size (Seconds)Maximum file size (MB)LicenseModel card
Mistralvoxtral-small-24b-2507303025Apache-2.0HF
OpenAIwhisper-large-v3-3025Apache-2.0HF

Chat models

ProviderModel stringContext window (Tokens)Maximum output (Tokens)LicenseModel card
OpenAIgpt-oss-120b128k8192Apache-2.0HF
Metallama-3.3-70b-instruct100k4096Llama 3.3 CommunityHF
Metallama-3.1-8b-instruct128k16384Llama 3.1 CommunityHF
Mistralmistral-nemo-instruct-2407128k8192Apache-2.0HF
Mistraldevstral-2-123b-instruct-2512200k8192Modified MITHF
Qwenqwen3-235b-a22b-instruct-2507250k8192Apache-2.0HF
Qwenqwen3-coder-30b-a3b-instruct128k8192Apache-2.0HF
DeepSeekdeepseek-r1-distill-llama-70b32k4096MITHF
Tip

If you are unsure which chat model to use, we currently recommend Mistral Small 3.2 24B Instruct (mistral-small-3.2-24b-instruct-2506) to get started.

Vision models

ProviderModel stringContext window (Tokens)Maximum output (Tokens)LicenseModel card
Mistralpixtral-12b-2409128k4096Apache-2.0HF
Note

Image sizes are limited to 32 million pixels (e.g., a resolution of about 8096 x 4048). Images with a resolution higher than 1024 x 1024 are supported, but automatically downscaled to fit these limitations (image ratio and proportions will be preserved).

Embedding models

Our Embeddings API provides built-in support for the following models, hosted in Scaleway data centers, available via serverless endpoints.

ProviderModel stringEmbedding dimension (Maximum)Embedding dimensions (Minimum)Context windowLicenseModel card
Qwenqwen3-embedding-8b40963232 000Apache-2.0HF
BAAIbge-multilingual-gemma2358435848192GemmaHF

Request a model

Do not see a model you want to use? Tell us or vote for what you would like to add here.

Deprecated models

These models can still be accessed in Generative APIs, but their End of Life (EOL) is planned according to our model lifecycle policy. Deprecated models should not be queried anymore. We recommend to use newer models available in Generative APIs or to deploy these models in dedicated Managed Inference deployments.

ProviderModel stringDeprecation dateEnd of Life (EOL) dateAfter End of Life date, requests routed to model
Deepseekdeepseek-r1-distill-llama-70b16th January, 202616th April, 2026llama-3.3-70b-instruct
Mistralmistral-nemo-instruct-240716th January, 202616th April, 2026mistral-small-3.2-24b-instruct-2506
Metallama-3.1-8b-instruct16th January, 202616th April, 2026mistral-small-3.2-24b-instruct-2506

End of Life (EOL) models

These models are not accessible anymore from Generative APIs. They can still however be deployed on dedicated Managed Inference deployments.

ProviderModel stringEOL dateRequests routed to model
Mistralmistral-small-3.1-24b-instruct-250314th November, 2025mistral-small-3.2-24b-instruct-2506
Mistraldevstral-small-250514th November, 2025qwen3-coder-30b-a3b-instruct
Qwenqwen2.5-coder-32b-instruct14th November, 2025qwen3-coder-30b-a3b-instruct
Metallama-3.1-70b-instruct25th May, 2025llama-3.3-70b-instruct
SBERTsentence-t5-xxl26 February, 2025None
Still need help?

Create a support ticket
No Results