Tip
These limits only apply if you created a Scaleway Account and registered a valid payment method. Otherwise, stricter limits apply to ensure usage stays within Free Tier only.
Any model served through Scaleway Generative APIs gets limited by:
These limits only apply if you created a Scaleway Account and registered a valid payment method. Otherwise, stricter limits apply to ensure usage stays within Free Tier only.
Model string | Requests per minute | Total tokens per minute |
---|---|---|
llama-3.1-8b-instruct | 300 | 100K |
llama-3.1-70b-instruct | 300 | 100K |
mistral-nemo-instruct-2407 | 300 | 100K |
pixtral-12b-2409 | 300 | 100K |
qwen2.5-32b-instruct | 300 | 100K |
Model string | Requests per minute | Input tokens per minute |
---|---|---|
sentence-t5-xxl | 100 | 200K |
bge-multilingual-gemma2 | 100 | 200K |
These limits safeguard against abuse or misuse of Scaleway Generative APIs, helping to ensure fair access to the API with consistent performance.
We actively monitor usage and will improve rates based on feedback. If you need to increase your rate limits, contact us via the support team, providing details on the model used and specific use case.
Your opinion helps us make a better documentation.