scaleway.inference.v1.Deployment
idUnique identifier. (UUID format)
nameName of the deployment.
project_idProject ID. (UUID format)
statusStatus of the deployment.
tagsList of tags applied to the deployment.
node_type_nameNode type of the deployment.
List of endpoints.
sizeCurrent size of the pool.
min_sizeDefines the minimum size of the pool.
max_sizeDefines the maximum size of the pool. Currently, autoscaling is not yet supported, and this value must be equal to min_size.
error_messageDisplays information if your deployment is in error state.
model_idID of the model used for the deployment. (UUID format)
Quantization parameters for this deployment.
model_nameName of the deployed model.
created_atCreation date of the deployment. (RFC 3339 format)
updated_atLast modification date of the deployment. (RFC 3339 format)
regionRegion of the deployment.
scaleway.inference.v1.Endpoint
idUnique identifier. (UUID format)
urlURL of the endpoint.
For private endpoints, the URL will be accessible only from the Private Network.
In addition, private endpoints will expose a CA certificate that can be used to verify the server's identity.
This CA certificate can be retrieved using the GetDeploymentCertificate API call.
public_networkDefines whether the endpoint is public.
Details of the Private Network.
disable_authDefines whether the authentication is disabled.
scaleway.inference.v1.EndpointSpec
public_networkSet the endpoint as public.
Set the endpoint as private. Private endpoints are only accessible from the Private Network.
disable_authDisable the authentication on the endpoint. By default, deployments are protected by IAM authentication. When setting this field to true, the authentication will be disabled.
scaleway.inference.v1.ListDeploymentsResponse
List of deployments on the current page.
total_countTotal number of deployments.
scaleway.inference.v1.ListModelsResponse
List of models on the current page.
total_countTotal number of models.
scaleway.inference.v1.ListNodeTypesResponse
List of node types.
total_countTotal number of node types.
scaleway.inference.v1.Model
idUnique identifier. (UUID format)
nameUnique Name identifier.
project_idProject ID. (UUID format)
tagsList of tags applied to the model.
statusStatus of the model.
descriptionPurpose of the model.
error_messageDisplays information if your model is in error state.
has_eulaDefines whether the model has an end user license agreement.
created_atCreation date of the model. (RFC 3339 format)
updated_atLast modification date of the model. (RFC 3339 format)
regionRegion of the model.
Supported nodes types with quantization options and context lengths.
parameter_size_bitsSize, in bits, of the model parameters.
size_bytesTotal size, in bytes, of the model files.
scaleway.inference.v1.ModelSupportedNode
node_type_nameSupported node type.
Supported quantizations.
scaleway.inference.v1.ModelSupportedQuantization
quantization_bitsNumber of bits for this supported quantization.
allowedTells whether this quantization is allowed for this node type.
max_context_sizeMaximum inference context size available for this node type and quantization.
scaleway.inference.v1.NodeType
nameName of the node type.
stock_statusCurrent stock status for the node type.
descriptionCurrent specs of the offer.
vcpusNumber of virtual CPUs.
memoryQuantity of RAM. (in bytes)
vramQuantity of GPU RAM. (in bytes)
disabledThe node type is currently disabled.
betaThe node type is currently in beta.
created_atCreation date of the node type. (RFC 3339 format)
updated_atLast modification date of the node type. (RFC 3339 format)
gpusNumber of GPUs.
regionRegion of the node type.