Overview of Deploying Agents in OCI Generative AI
You can deploy agents by using OCI Generative AI Applications, which provide a managed runtime for containerized agent workloads.
To deploy an agent, package it as a container image, upload it to Oracle Cloud Infrastructure Registry (OCIR), and deploy it by using the OCI Console, API, or CLI.
During deployment, configure:
- Scaling
- Storage
- Networking
- Authentication
After deployment, the service provisions an endpoint (for example, an HTTP URL) that clients or other agents can use to invoke the agent.
How it Works
After developing an agent locally (for example, by using LangGraph or similar frameworks), you create a Generative AI application to define the runtime configuration.
You then create a deployment by selecting a container image. The active deployment serves requests through the application endpoint. After the deployment is provisioned, the endpoint becomes available for invoking the agent.