ChatOCIModelDeployment
This will help you getting started with OCIModelDeployment chat models. For detailed documentation of all ChatOCIModelDeployment features and configurations head to the API reference.
OCI Data Science is a fully managed and serverless platform for data science teams to build, train, and manage machine learning models in the Oracle Cloud Infrastructure. You can use AI Quick Actions to easily deploy LLMs on OCI Data Science Model Deployment Service. You may choose to deploy the model with popular inference frameworks such as vLLM or TGI. By default, the model deployment endpoint mimics the OpenAI API protocol.
For the latest updates, examples and experimental features, please see ADS LangChain Integration.
Overview
Integration details
Class | Package | Local | Serializable | JS support | Package downloads | Package latest |
---|---|---|---|---|---|---|
ChatOCIModelDeployment | langchain-community | ❌ | beta | ❌ |
Model features
Tool calling | Structured output | JSON mode | Image input | Audio input | Video input | Token-level streaming | Native async | Token usage | Logprobs |
---|---|---|---|---|---|---|---|---|---|
depends | depends | depends | depends | depends | depends | ✅ | ✅ | ✅ | ✅ |
Some model features, including tool calling, structured output, JSON mode and multi-modal inputs, are depending on deployed model.
Setup
To use ChatOCIModelDeployment you'll need to deploy a chat model with chat completion endpoint and install the langchain-community
, langchain-openai
and oracle-ads
integration packages.
You can easily deploy foundation models using the AI Quick Actions on OCI Data Science Model deployment. For additional deployment examples, please visit the Oracle GitHub samples repository.