ScalewaySkip to loginSkip to main contentSkip to footer section

Model-as-a-service

Serve Generative AI models and pay for a dedicated infrastructure or for millions of tokens

Generative APIs

Serve the latest AI models via API, pay by million token

Generative API
qwen3-235b-a22b-instruct-2507QwenChat€0.75 /million tokens€2.25 /million tokensTry
gpt-oss-120bOpenAIChat€0.15 /million tokens€0.60 /million tokensTry
gemma-3-27b-itGoogleChat and Vision€0.25 /million tokens€0.50 /million tokensTry
whisper-large-v3OpenAIAudio transcription€0.003 /Audio minuteFreeTry
voxtral-small-24b-2507MistralAudio transcription and Chat€0.15 /million tokens€0.35 /million tokensTry
mistral-small-3.2-24b-instruct-2506MistralChat and Vision€0.15 /million tokens€0.35 /million tokensTry
llama-3.3-70b-instructMetaChat€0.90 /million tokens€0.90 /million tokensTry
deepseek-r1-distill-llama-70bDeepseekChat€0.90 /million tokens€0.90 /million tokensTry
bge-multilingual-gemma2BAAIEmbeddings€0.10 /million tokensFreeTry
qwen3-coder-30b-a3b-instructQwenChat€0.20 /million tokens€0.80 /million tokensTry
pixtral-12b-2409MistralChat and Vision€0.20 /million tokens€0.20 /million tokensTry
mistral-nemo-instruct-2407MistralChat€0.20 /million tokens€0.20 /million tokensTry
llama-3.1-8b-instructMetaChat€0.20 /million tokens€0.20 /million tokensTry
mistral-small-3.1-24b-instruct-2503MistralChat and Vision€0.15 /million tokens€0.35 /million tokensTry
qwen2.5-coder-32b-instructQwenChat€0.90 /million tokens€0.90 /million tokensTry
llama-3.1-70b-instructMetaChat€0.90 /million tokens€0.90 /million tokensTry
devstral-small-2505MistralChat€0.15 /million tokens€0.35 /million tokensTry
Legal notice

Prices before tax.
You benefit from a free tier on the first 1,000,000 tokens. You'll be charged from token number 1,000,001.

Managed Inference

Deploy your managed AI infrastructure with dedicated GPUs and optimized models. You are charged for usage of the GPU type you choose. Billing only starts once the model is deployed

ModelGPUPriceApprox. per month
llama-3.1-8b-instructL4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
llama-3.3-70b-instructH100-2-80G€6.68/hour~€4876/month
llama-3.1-70b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
llama-3.1-nemotron-70b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mistral-7b-instruct-v0.3L4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mixtral-8x7b-instruct-v0.1H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mistral-nemo-instruct-2407L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
pixtral-12b-2409L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
molmo-72b-0924H100-2-80G€6.68/hour~€4876/month
qwen2.5-coder-32b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
bge-multilingual-gemma2L4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
sentence-t5-xxlL4-1-24G€0.93/hour~€679/month
Legal notice

Prices before tax