A model represents a pre-trained machine learning model that can be deployed on the Generative APIs - Dedicated Deployment service.
They are used to define the inference model, its source, and its compatibility with the available nodes. Some models may be available in multiple parameters sizes, which will affect the performance and the accuracy of the model.
List models
List all available models.
path Parameters
regionThe region you want to target
query Parameters
order_byOrder in which to return results.
pagePage number to return.
page_sizeMaximum number of models to return per page.
project_idFilter by Project ID. (UUID format)
organization_idFilter by Organization ID. (UUID format)
nameFilter by model name.
tagsFilter by tags.
List models › Responses
List of models on the current page.
total_countTotal number of models.
Import a model
Import a new model to your model library.
path Parameters
regionThe region you want to target
Import a model › Request Body
nameName of the model.
project_idID of the Project to import the model in. (UUID format)
Where to import the model from.
Import a model › Responses
idUnique identifier. (UUID format)
nameUnique Name identifier.
project_idProject ID. (UUID format)
tagsList of tags applied to the model.
statusStatus of the model.
descriptionPurpose of the model.
error_messageDisplays information if your model is in error state.
has_eulaDefines whether the model has an end user license agreement.
created_atCreation date of the model. (RFC 3339 format)
updated_atLast modification date of the model. (RFC 3339 format)
regionRegion of the model.
Supported nodes types with quantization options and context lengths.
parameter_size_bitsSize, in bits, of the model parameters.
size_bytesTotal size, in bytes, of the model files.
Get a model
Get the model for the given ID.
path Parameters
regionThe region you want to target
model_idID of the model to get. (UUID format)
Get a model › Responses
idUnique identifier. (UUID format)
nameUnique Name identifier.
project_idProject ID. (UUID format)
tagsList of tags applied to the model.
statusStatus of the model.
descriptionPurpose of the model.
error_messageDisplays information if your model is in error state.
has_eulaDefines whether the model has an end user license agreement.
created_atCreation date of the model. (RFC 3339 format)
updated_atLast modification date of the model. (RFC 3339 format)
regionRegion of the model.
Supported nodes types with quantization options and context lengths.
parameter_size_bitsSize, in bits, of the model parameters.
size_bytesTotal size, in bytes, of the model files.
Get a model EULA
Get the EULA for the given model ID.
path Parameters
regionThe region you want to target
model_idID of the model to get the Eula for. (UUID format)
Get a model EULA › Responses
contentContent of the end user license agreement.