Creating a Deployment

Create a deployment for an application in the OCI Generative AI service where you can deploy an agent container to the service to serve requests from clients.

Create a deployment by selecting a container image and image tag. You can activate the deployment to make it the running version for the application. This task documents the steps for creating a deployment by using the Oracle Cloud Console.

Tip

After you create a deployment, you can view its details and perform other tasks, such as activating it, copying its OCID, or deleting it. Use the Actions menu (three dots) in the Console to access these tasks. For more information, see Listing deployments.

To create a deployment, follow these steps:

  1. On the Applications list page, select the application that you want to create a deployment for. If you need help finding the list page, see Listing applications.
  2. In the application's detail page, select the Deployments tab.
  3. Select Create deployment.
  4. Select the following information for the container image.
    • repository compartment
    • repository
    • version tag
  5. (Optional) To make this deployment the running version for the application, enable Deploy and activate.

    Only one deployment can be active at a time. When you activate a deployment, the deployment that was active before (if any) becomes inactive.

  6. Select Create deployment.
    Note

    Before you use the deployment, wait until the Status displays Active and the container is up and ready to serve.