# Replicate

```json
{
  "name": "Replicate",
  "slug": "replicate",
  "url": "https://composio.dev/toolkits/replicate",
  "markdown_url": "https://composio.dev/toolkits/replicate.md",
  "logo_url": "https://logos.composio.dev/api/replicate",
  "categories": [
    "ai & machine learning"
  ],
  "is_composio_managed": false,
  "updated_at": "2026-05-12T10:23:52.496Z"
}
```

![Replicate logo](https://logos.composio.dev/api/replicate)

## Description

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Replicate MCP or direct API to run models, manage jobs, fetch results, and monitor AI pipelines through natural language.

## Summary

Replicate is a cloud API platform for running AI models without managing infrastructure. Get scalable, production-ready access to the latest machine learning models in minutes.

## Categories

- ai & machine learning

## Toolkit Details

- Tools: 31

## Images

- Logo: https://logos.composio.dev/api/replicate

## Authentication

- **Api Key**
  - Type: `api_key`
  - Description: Api Key authentication for Replicate.
  - Setup:
    - Configure Api Key credentials for Replicate.
    - Use the credentials when creating an auth config in Composio.

## Suggested Prompts

- Run Stable Diffusion to generate an image
- List all my uploaded files on Replicate
- Get README documentation for a model
- Show example predictions for a specific model

## Supported Tools

| Tool slug | Name | Description |
|---|---|---|
| `REPLICATE_ACCOUNT_GET` | Get Account Information | Tool to get authenticated account information. Use when you need to retrieve details about the account associated with the API token. |
| `REPLICATE_CANCEL_PREDICTION` | Cancel Prediction | Tool to cancel a prediction that is still running. Use when you need to stop an in-progress prediction to free up resources or halt execution. |
| `REPLICATE_COLLECTIONS_GET` | Get model collection | Tool to get a specific collection of models by its slug. Use when you need detailed information about a collection and its models. |
| `REPLICATE_COLLECTIONS_LIST` | List model collections | Tool to list all collections of models. Use when you need to retrieve available model collections. Collections are curated groupings of related models. Response includes only collection metadata (name, slug, description), not individual models within each collection; use REPLICATE_MODELS_GET for per-model details. Response may include a non-null `next` field indicating additional pages; follow it to enumerate all collections. |
| `REPLICATE_CREATE_MODEL` | Create Model | Tool to create a new Replicate model with specified owner, name, visibility, and hardware. Use when you need to create a destination model before launching LoRA/fine-tune training. |
| `REPLICATE_CREATE_PREDICTION` | Create Prediction | Tool to create a prediction for a Replicate Deployment. IMPORTANT: This action ONLY works with Replicate Deployments (persistent instances you create and manage), NOT public models. Deployments are created via REPLICATE_DEPLOYMENTS_CREATE. To run public models (e.g., 'meta/llama-2-70b-chat', 'stability-ai/sdxl'), use REPLICATE_MODELS_PREDICTIONS_CREATE instead. Use 'wait_for' to wait until the prediction completes. |
| `REPLICATE_DEPLOYMENTS_CREATE` | Create Deployment | Tool to create a new deployment with specified model, version, hardware, and scaling parameters. Use when you need to deploy a model for production use with auto-scaling. |
| `REPLICATE_DEPLOYMENTS_DELETE` | Delete Deployment | Tool to delete a deployment from your account. Use when you need to remove a deployment. Deployments must be offline and unused for at least 15 minutes before deletion. |
| `REPLICATE_DEPLOYMENTS_GET` | Get Deployment Details | Tool to get deployment details by owner and name. Use when you need information about a specific deployment including its release configuration and hardware settings. |
| `REPLICATE_DEPLOYMENTS_LIST` | List deployments | Tool to list all deployments associated with the account. Use when you need to retrieve deployment configurations and their latest releases. |
| `REPLICATE_CREATE_FILE` | Create File | Tool to create or upload a file to Replicate. Use when you need to upload file content with optional metadata. |
| `REPLICATE_FILES_DELETE` | Delete File | Tool to delete a file by its ID. Use when you need to remove a file from storage. Returns 204 No Content on success. |
| `REPLICATE_FILES_GET` | Get File Details | Tool to get details of a file by its ID. Use when you need to inspect uploaded file information before further operations. Returned URLs may be short-lived; download or persist needed files promptly after retrieval. |
| `REPLICATE_FILES_LIST` | List Files | Tool to retrieve a paginated list of uploaded files. Use to view all files created by the authenticated user or organization. Files are sorted with most recent first. Pagination is cursor-based: follow the next cursor until empty to retrieve all files. Limit requests to 1–2/second to avoid 429 Too Many Requests errors. Use to validate current file_ids before passing to prediction tools, as stale file_ids cause runtime errors. |
| `REPLICATE_GET_PREDICTION` | Get Prediction | Tool to get the status and output of a prediction by its ID. Use when you need to check on a running prediction or retrieve the results of a completed prediction. |
| `REPLICATE_HARDWARE_LIST` | List Available Hardware | Tool to list available hardware SKUs for models and deployments. Use when you need to see what hardware options are available on the Replicate platform. |
| `REPLICATE_MODELS_EXAMPLES_LIST` | List model examples | Tool to list example predictions for a specific model. Use when you want to retrieve author-provided illustrative examples after identifying the model. Returned examples are minimal working payloads; cross-reference with REPLICATE_MODELS_README_GET before calling REPLICATE_CREATE_PREDICTION to satisfy strict input validation. |
| `REPLICATE_MODELS_GET` | Get Model Details | Tool to get details of a specific model by owner and name. Consult the returned input schema before constructing any prediction request — each model defines its own required/optional fields (e.g., `prompt`, `aspect_ratio`, `version`); missing or unknown keys cause validation errors. Model schemas and available versions may change over time; recheck before production use. |
| `REPLICATE_MODELS_LIST` | List Public Models | Tool to list public models with pagination and sorting. Use when you need to browse available models or find models sorted by creation date. |
| `REPLICATE_MODELS_PREDICTIONS_CREATE` | Create Model Prediction | Tool to create a prediction using an official Replicate model. Use when you need to run inference with a specific model using its owner and name. Supports synchronous waiting (up to 60 seconds) and webhooks for async notifications. |
| `REPLICATE_MODELS_README_GET` | Get Model README | Tool to get the README content for a model in Markdown format. Consult alongside REPLICATE_MODELS_EXAMPLES_LIST before calling REPLICATE_CREATE_PREDICTION — Replicate enforces strict JSON schemas on model inputs and returns 422 errors for incorrect keys or types. Use after retrieving model details when you want to view its documentation. |
| `REPLICATE_MODELS_VERSIONS_GET` | Get Model Version | Tool to get a specific version of a model. Use when you need details about a particular model version including its schema and metadata. |
| `REPLICATE_MODELS_VERSIONS_LIST` | List Model Versions | Tool to list all versions of a specific model. Use when you need to see all available versions of a model, sorted by newest first. |
| `REPLICATE_CREATE_PREDICTION` | Create Prediction | Tool to create a prediction to run a model by version ID. Use when you have a specific model version identifier and need to run inference with provided inputs. Supports synchronous waiting and webhook notifications. |
| `REPLICATE_PREDICTIONS_LIST` | List All Predictions | Tool to list all predictions for the authenticated user or organization with pagination. Use when you need to retrieve prediction history or filter predictions by creation date. |
| `REPLICATE_SEARCH` | Search Models and Collections | Tool to search for models, collections, and docs using text queries (beta). Use when you need to find relevant models or collections based on keywords or descriptions. |
| `REPLICATE_TRAININGS_CANCEL` | Cancel Training | Tool to cancel an ongoing training operation in Replicate. Use when you need to stop a training job that is in progress. |
| `REPLICATE_TRAININGS_CREATE` | Create Training Job | Tool to create a training job for a specific model version. Use when you need to fine-tune a model with custom training data. Supports webhook notifications for training status updates. |
| `REPLICATE_TRAININGS_LIST` | List Training Jobs | Tool to list all training jobs for the authenticated user or organization with pagination. Use when you need to retrieve training history or check the status of training jobs. |
| `REPLICATE_UPDATE_MODELS` | Update Model Metadata | Tool to update metadata for a model including description, URLs, and README. Use when you need to modify a model's visibility, documentation, or associated links. |
| `REPLICATE_WEBHOOKS_SECRET_GET` | Get Webhook Signing Secret | Tool to get the signing secret for the default webhook. Use when you need to retrieve the secret key used to verify webhook authenticity. |

## Supported Triggers

None listed.

## Installation and MCP Setup

### Path 1: SDK Installation

#### Path 1, Step 1: Install Composio

Install the Composio SDK
```python
pip install composio_openai
```

```typescript
npm install @composio/openai
```

#### Path 1, Step 2: Initialize Composio and Create Tool Router Session

Import and initialize Composio client, then create a Tool Router session
```python
from openai import OpenAI
from composio import Composio
from composio_openai import OpenAIResponsesProvider

composio = Composio(provider=OpenAIResponsesProvider())
openai = OpenAI()
session = composio.create(user_id='your-user-id')
```

```typescript
import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');
```

#### Path 1, Step 3: Execute Replicate Tools via Tool Router with Your Agent

Get tools from Tool Router session and execute Replicate actions with your Agent
```python
tools = session.tools
response = openai.responses.create(
  model='gpt-4.1',
  tools=tools,
  input=[{
    'role': 'user',
    'content': 'Run an image-to-text model on my latest uploaded file'
  }]
)
result = composio.provider.handle_tool_calls(
  response=response,
  user_id='your-user-id'
)
print(result)
```

```typescript
const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Run an image-to-text model on my latest uploaded file'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);
```

### Path 2: MCP Server Setup

#### Path 2, Step 1: Install Composio

Install the Composio SDK and Claude Agent SDK
```python
pip install composio claude-agent-sdk
```

```typescript
npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp
```

#### Path 2, Step 2: Create Tool Router Session

Initialize the Composio client and create a Tool Router session
```python
from composio import Composio
from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions

composio = Composio(api_key='your-composio-api-key')
session = composio.create(user_id='your-user-id')
url = session.mcp.url
```

```typescript
import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);
```

#### Path 2, Step 3: Connect to AI Agent

Use the MCP server with your AI agent
```python
import asyncio

options = ClaudeAgentOptions(
    permission_mode='bypassPermissions',
    mcp_servers={
        'tool_router': {
            'type': 'http',
            'url': url,
            'headers': {
                'x-api-key': 'your-composio-api-key'
            }
        }
    },
    system_prompt='You are a helpful assistant with access to Replicate tools.',
    max_turns=10
)

async def main():
    async with ClaudeSDKClient(options=options) as client:
        await client.query('Run stable-diffusion to generate a cat image')
        async for message in client.receive_response():
            if hasattr(message, 'content'):
                for block in message.content:
                    if hasattr(block, 'text'):
                        print(block.text)

asyncio.run(main())
```

```typescript
import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Run stable-diffusion to generate a cat image' }],
  stopWhen: stepCountIs( 5 )
});

console.log(`Agent: ${text}`);
```

## Why Use Composio?

### 1. AI Native Replicate Integration

- Supports both Replicate MCP and direct API based integrations
- Structured, LLM-friendly schemas for reliable tool execution
- Rich coverage for running models, tracking jobs, and retrieving predictions

### 2. Managed Auth

- Securely store and manage API keys for Replicate
- Central dashboard for managing and revoking credentials
- Scoped, per-user access without sharing secrets in code

### 3. Agent Optimized Design

- Tools tuned for high reliability and clear error reporting
- Automatic logging of all model runs and fetches per user and agent

### 4. Enterprise Grade Security

- RBAC support to control which agents can run which models
- Least privilege access for sensitive AI model endpoints
- Comprehensive audit logs for compliance and oversight

## Use Replicate with any AI Agent Framework

Choose a framework you want to connect Replicate with:

- [ChatGPT](https://composio.dev/toolkits/replicate/framework/chatgpt)
- [OpenAI Agents SDK](https://composio.dev/toolkits/replicate/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/replicate/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/replicate/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/replicate/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/replicate/framework/codex)
- [Cursor](https://composio.dev/toolkits/replicate/framework/cursor)
- [VS Code](https://composio.dev/toolkits/replicate/framework/vscode)
- [OpenCode](https://composio.dev/toolkits/replicate/framework/opencode)
- [OpenClaw](https://composio.dev/toolkits/replicate/framework/openclaw)
- [Hermes](https://composio.dev/toolkits/replicate/framework/hermes-agent)
- [Google ADK](https://composio.dev/toolkits/replicate/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/replicate/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/replicate/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/replicate/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/replicate/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/replicate/framework/crew-ai)
- [Pydantic AI](https://composio.dev/toolkits/replicate/framework/pydantic-ai)
- [AutoGen](https://composio.dev/toolkits/replicate/framework/autogen)

## Related Toolkits

- [Composio](https://composio.dev/toolkits/composio) - Composio is an integration platform that connects AI agents with hundreds of business tools. It streamlines authentication and lets you trigger actions across services—no custom code needed.
- [Composio search](https://composio.dev/toolkits/composio_search) - Composio search is a unified web search toolkit spanning travel, e-commerce, news, financial markets, images, and more. It lets you and your apps tap into up-to-date web data from a single, easy-to-integrate service.
- [Perplexityai](https://composio.dev/toolkits/perplexityai) - Perplexityai delivers natural, conversational AI models for generating human-like text. Instantly get context-aware, high-quality responses for chat, search, or complex workflows.
- [Browser tool](https://composio.dev/toolkits/browser_tool) - Browser tool is a virtual browser integration that lets AI agents interact with the web programmatically. It enables automated browsing, scraping, and action-taking from any AI workflow.
- [Ai ml api](https://composio.dev/toolkits/ai_ml_api) - Ai ml api is a suite of AI/ML models for natural language and image tasks. It provides fast, scalable access to advanced AI capabilities for your apps and workflows.
- [Aivoov](https://composio.dev/toolkits/aivoov) - Aivoov is an AI-powered text-to-speech platform offering 1,000+ voices in over 150 languages. Instantly turn written content into natural, human-like audio for any application.
- [All images ai](https://composio.dev/toolkits/all_images_ai) - All-Images.ai is an AI-powered image generation and management platform. It helps you create, search, and organize images effortlessly with advanced AI capabilities.
- [Anthropic administrator](https://composio.dev/toolkits/anthropic_administrator) - Anthropic administrator is an API for managing Anthropic organizational resources like members, workspaces, and API keys. It helps you automate admin tasks and streamline resource management across your Anthropic organization.
- [Api labz](https://composio.dev/toolkits/api_labz) - Api labz is a platform offering a suite of AI-driven APIs and workflow tools. It helps developers automate tasks and build smarter, more efficient applications.
- [Apipie ai](https://composio.dev/toolkits/apipie_ai) - Apipie ai is an AI model aggregator offering a single API for accessing top AI models from multiple providers. It helps developers build cost-efficient, latency-optimized AI solutions without juggling multiple integrations.
- [Astica ai](https://composio.dev/toolkits/astica_ai) - Astica ai provides APIs for computer vision, NLP, and voice synthesis. Integrate advanced AI features into your app with a single API key.
- [Bigml](https://composio.dev/toolkits/bigml) - BigML is a machine learning platform that lets you build, train, and deploy predictive models from your data. Its intuitive interface and robust API make machine learning accessible and efficient.
- [Botbaba](https://composio.dev/toolkits/botbaba) - Botbaba is a platform for building, managing, and deploying conversational AI chatbots across messaging channels. It streamlines chatbot automation, making it easier to integrate AI into customer interactions.
- [Botpress](https://composio.dev/toolkits/botpress) - Botpress is an open-source platform for building, deploying, and managing chatbots. It helps teams automate conversations and deliver rich, interactive messaging experiences.
- [Chatbotkit](https://composio.dev/toolkits/chatbotkit) - Chatbotkit is a platform for building and managing AI-powered chatbots using robust APIs and SDKs. It lets you easily add conversational AI to your apps for better user engagement.
- [Cody](https://composio.dev/toolkits/cody) - Cody is an AI assistant built for businesses, trained on your company's knowledge and data. It delivers instant answers and insights, tailored for your team.
- [Context7 MCP](https://composio.dev/toolkits/context7_mcp) - Context7 MCP delivers live, version-specific code docs and examples right from the source. It helps developers and AI agents instantly retrieve authoritative programming info—no more out-of-date docs.
- [Customgpt](https://composio.dev/toolkits/customgpt) - CustomGPT.ai lets you build and deploy chatbots tailored to your own data and business needs. Get precise and context-aware AI conversations without writing code.
- [Datarobot](https://composio.dev/toolkits/datarobot) - Datarobot is a machine learning platform that automates model development, deployment, and monitoring. It empowers organizations to quickly gain predictive insights from large datasets.
- [Deepgram](https://composio.dev/toolkits/deepgram) - Deepgram is an AI-powered speech recognition platform for accurate audio transcription and understanding. It enables fast, scalable speech-to-text with advanced audio intelligence features.

## Frequently Asked Questions

### Do I need my own developer credentials to use Replicate with Composio?

Yes, Replicate requires you to configure your own API key credentials. Once set up, Composio handles secure credential storage and API request handling for you.

### Can I use multiple toolkits together?

Yes! Composio's Tool Router enables agents to use multiple toolkits. [Learn more](https://docs.composio.dev/tool-router/overview).

### Is Composio secure?

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. [Learn more](https://trust.composio.dev).

### What if the API changes?

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

---
[See all toolkits](https://composio.dev/toolkits) · [Composio docs](https://docs.composio.dev/llms.txt)
