Class ScalingConfig
The auto scaling configuration for the Hosted Application. Defines the minimum and maximum number of replicas. When unspecified, the service applies service-defined default scaling values.
Inherited Members
Namespace: Oci.GenerativeaiService.Models
Assembly: OCI.DotNetSDK.Generativeai.dll
Syntax
public class ScalingConfig
Properties
MaxReplica
Declaration
[JsonProperty(PropertyName = "maxReplica")]
public int? MaxReplica { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | Maximum number of replicas allowed. |
MinReplica
Declaration
[JsonProperty(PropertyName = "minReplica")]
public int? MinReplica { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | Minimum number of replicas to keep running. |
ScalingType
Declaration
[Required(ErrorMessage = "ScalingType is required.")]
[JsonProperty(PropertyName = "scalingType")]
[JsonConverter(typeof(ResponseEnumConverter))]
public ScalingConfig.ScalingTypeEnum? ScalingType { get; set; }
Property Value
| Type | Description |
|---|---|
| ScalingConfig.ScalingTypeEnum? | scaling type for application. |
Remarks
Required
TargetConcurrencyThreshold
Declaration
[JsonProperty(PropertyName = "targetConcurrencyThreshold")]
public int? TargetConcurrencyThreshold { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | number of simultaneous requests that can be processed by each replica. |
TargetCpuThreshold
Declaration
[JsonProperty(PropertyName = "targetCpuThreshold")]
public int? TargetCpuThreshold { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | Scale up if average CPU utilization exceeds this threshold. |
TargetMemoryThreshold
Declaration
[JsonProperty(PropertyName = "targetMemoryThreshold")]
public int? TargetMemoryThreshold { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | Scale up if average memory utilization exceeds this threshold. |
TargetRpsThreshold
Declaration
[JsonProperty(PropertyName = "targetRpsThreshold")]
public int? TargetRpsThreshold { get; set; }
Property Value
| Type | Description |
|---|---|
| int? | requests-per-second per replica of an application. |