For Autoscaling Model Deployments
Troubleshoot autoscaling model deployments.
To help troubleshoot Autoscaling issues, see, Debugging a Model Deployment Failure. The autoscaling operation uses the update work request type, hence, in the event of an error, scrutinize the failed update work requests. Any occurring errors are presented in Error Messages. Review the specifics of the error and take appropriate action.
Service Timed Out during Operation
When creating, updating, or activating model deployment with an autoscaling type scaling policy, the operation fails with a Service Timed Out error.
The system checks for the presence of an IAM policy in the customer tenancy for the metrics retrieval by autoscaling service. A missing policy might lead to the service timing out.
Invalid TQL query
- Failed to provision compute resources due to an invalid parameter in the request. Invalid TQL query.
An incorrect or invalid query.
Scaling isn't Triggered or Takes too Long to Complete
Scaling isn't triggered or takes too long complete.
This cool-down period is applied after creation or update, and between each scaling event. This cool-down period also resets after every user-performed update operation.