Overview of the Generative AI Service
OCI Generative AI is a fully managed Oracle Cloud Infrastructure service for building, deploying, and operating generative AI applications at enterprise scale.
Core Features
OCI Generative AI supports these core generative AI use cases:
- Chat for conversational experiences such as question answering and virtual assistants
- Embeddings for semantic search, recommendation, classification, and clustering
- Rerank for ordering documents by relevance to a query
- OpenAI-compatible APIs for integrating with existing tools and SDKs
Model Usage Options
You can use OCI Generative AI in these ways:
- Use pretrained hosted models through the playground, API, or CLI.
- Import, fine-tune, and host custom models on dedicated AI clusters.
- Move from experimentation to production with enterprise controls.
Enterprise AI features
OCI Generative AI includes Enterprise AI features for building production-grade agentic applications.
These features include:
- OCI Responses API
- OpenAI Responses-compatible API for model interaction and agentic workflows
- Supports orchestration, reasoning, tool use, memory, and multi-model routing
- Tools
- File Search
- Code Interpreter
- Function Calling for local tools
- MCP Calling for remote MCP servers
- Containers API
- Vector Stores API
- Files API
- Memory
- Conversations API
- Long-term memory
- Short-term memory context compaction
- Projects
- Organize agent workloads by project
- Isolate conversations, files, containers, and memory
- Configure data retention and memory settings
- Applications
- Fully managed hosting for agentic applications
- Support for applications built with open source frameworks or MCP servers
- Built-in security controls
- Public and private endpoint support
- Vector Stores
- Managed vector storage
- File ingestion
- Semantic search
- Metadata filtering
- Support for RAG and NL2SQL use cases
- NL2SQL
- Ingests customer schema information
- Enriches schema data into a semantic vector store
- Accepts natural language queries and produces SQL
- Runs in a permission-controlled manner without moving or copying database content
- Enterprise AI API Keys
- OCI-specific API keys for Enterprise AI services
- Automatic rotation
Platform Benefits
- Build production-ready AI applications faster
- Reduce operational complexity
- Apply enterprise governance and security controls
- Use a unified platform for generative AI, retrieval, memory, tools, and managed hosting
Related Topics
- For services that call into Generative AI, see Generative AI Regions.
- For regions with Generative AI, see Generative AI Regions
- For models available by region, see Generative AI Models by Region.
- For models and regional supported for Enterprise AI tasks, see Generative AI Models and Regions for Agentic API.
- For accessing Generative AI in the Console, see Accessing Generative AI in the Console.