How to integrate Elevenlabs MCP with Vercel AI SDK v6

Connect Vercel AI SDK v6 to Elevenlabs MCP. Convert this chapter text to audio, create a custom project for your audiobook, and more using natural language, with authentication handled for you.

Get started for free Get a demo

Elevenlabs

Api Key

Elevenlabs is an advanced AI voice generation platform for lifelike, multilingual speech synthesis. Perfect for creating natural voices for videos, apps, and business content in seconds.

155 Tools

Managed auth

Connect Elevenlabs without auth hassles

We manage OAuth, API keys, token refresh, and scopes — you just build.

Try for free

Introduction

This guide walks you through connecting Elevenlabs to Vercel AI SDK v6 using the Composio tool router. By the end, you'll have a working Elevenlabs agent that can convert this chapter text to audio, create a custom project for your audiobook, add a new pronunciation rule for this word through natural language commands.

This guide will help you understand how to give your Vercel AI SDK agent real control over a Elevenlabs account through Composio's Elevenlabs MCP server.

Before we dive in, let's take a quick look at the key ideas and tools involved.

Also integrate Elevenlabs with

ChatGPT Work Antigravity OpenAI Agents SDK Claude Agent SDK Claude Code Claude Cowork Codex Kimi Code Grok Build OpenCode Cursor VS Code OpenClaw Hermes CLI Google ADK LangChain Mastra AI LlamaIndex CrewAI

TL;DR

Here's what you'll learn:

How to set up and configure a Vercel AI SDK agent with Elevenlabs integration
Using Composio's Tool Router to dynamically load and access Elevenlabs tools
Creating an MCP client connection using HTTP transport
Building an interactive CLI chat interface with conversation history management
Handling tool calls and results within the Vercel AI SDK framework

What is Vercel AI SDK?

The Vercel AI SDK is a TypeScript library for building AI-powered applications. It provides tools for creating agents that can use external services and maintain conversation state.

Key features include:

streamText: Core function for streaming responses with real-time tool support
MCP Client: Built-in support for Model Context Protocol via @ai-sdk/mcp
Step Counting: Control multi-step tool execution with stopWhen: stepCountIs()
OpenAI Provider: Native integration with OpenAI models

What is the Elevenlabs MCP server, and what's possible with it?

The Elevenlabs MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Elevenlabs account. It provides structured and secure access to your voice synthesis projects and tools, so your agent can perform actions like generating audio from text, managing custom voices, organizing projects, and fine-tuning pronunciation on your behalf.

Project and chapter audio conversion: Instantly convert text content from chapters or entire projects into high-quality, natural-sounding audio files.
Custom voice creation and management: Guide your agent to add, finalize, or share custom voices—either by uploading new samples or assembling voices from existing data.
Pronunciation dictionary and rule management: Improve the accuracy of speech outputs by adding pronunciation dictionaries or custom pronunciation rules directly from files or specific aliases/phonemes.
Project organization and automation: Let your agent create new projects, add or remove chapters, and automate speech synthesis workflows for audiobooks, podcasts, or media production.
Embeddable audio player generation: Enable your agent to generate AudioNative projects, creating customizable and embeddable audio players from your content with just a prompt.

What is the Composio tool router, and how does it fit here?

What is Composio SDK?

Composio's Composio SDK helps agents find the right tools for a task at runtime. You can plug in multiple toolkits (like Gmail, HubSpot, and GitHub), and the agent will identify the relevant app and action to complete multi-step workflows. This can reduce token usage and improve the reliability of tool calls. Read more here: Getting started with Composio SDK

The tool router generates a secure MCP URL that your agents can access to perform actions.

How the Composio SDK works

The Composio SDK follows a three-phase workflow:

Discovery: Searches for tools matching your task and returns relevant toolkits with their details.
Authentication: Checks for active connections. If missing, creates an auth config and returns a connection URL via Auth Link.
Execution: Executes the action using the authenticated connection.

Step-by-step Guide

Step by step09 STEPS

Prerequisites

Before you begin, make sure you have:

Node.js and npm installed
A Composio account with API key
An OpenAI API key

Getting API Keys for OpenAI and Composio

OpenAI API Key

Go to the OpenAI dashboard and create an API key. You'll need credits to use the models, or you can connect to another model provider.
Keep the API key safe.

Composio API Key

Log in to the Composio dashboard.
Navigate to your API settings and generate a new API key.
Store this key securely as you'll need it for authentication.

Install required dependencies

bash

npm install @ai-sdk/openai @ai-sdk/mcp @composio/core ai dotenv

First, install the necessary packages for your project.

What you're installing:

@ai-sdk/openai: Vercel AI SDK's OpenAI provider
@ai-sdk/mcp: MCP client for Vercel AI SDK
@composio/core: Composio SDK for tool integration
ai: Core Vercel AI SDK
dotenv: Environment variable management

Set up environment variables

bash

OPENAI_API_KEY=your_openai_api_key_here
COMPOSIO_API_KEY=your_composio_api_key_here
COMPOSIO_USER_ID=your_user_id_here

Create a .env file in your project root.

What's needed:

OPENAI_API_KEY: Your OpenAI API key for GPT model access
COMPOSIO_API_KEY: Your Composio API key for tool access
COMPOSIO_USER_ID: A unique identifier for the user session

Import required modules and validate environment

typescript

import "dotenv/config";
import { openai } from "@ai-sdk/openai";
import { Composio } from "@composio/core";
import * as readline from "readline";
import { streamText, type ModelMessage, stepCountIs } from "ai";
import { createMCPClient } from "@ai-sdk/mcp";

const composioAPIKey = process.env.COMPOSIO_API_KEY;
const composioUserID = process.env.COMPOSIO_USER_ID;

if (!process.env.OPENAI_API_KEY) throw new Error("OPENAI_API_KEY is not set");
if (!composioAPIKey) throw new Error("COMPOSIO_API_KEY is not set");
if (!composioUserID) throw new Error("COMPOSIO_USER_ID is not set");

const composio = new Composio({
  apiKey: composioAPIKey,
});

What's happening:

We're importing all necessary libraries including Vercel AI SDK's OpenAI provider and Composio
The dotenv/config import automatically loads environment variables
The MCP client import enables connection to Composio's tool server

Create Tool Router session and initialize MCP client

typescript

async function main() {
  // Create a tool router session for the user
  const session = await composio.create(composioUserID!, {
    toolkits: ["elevenlabs"],
  });

  const mcpUrl = session.mcp.url;

What's happening:

We're creating a Tool Router session that gives your agent access to Elevenlabs tools
The create method takes the user ID and specifies which toolkits should be available
The returned mcp object contains the URL and authentication headers needed to connect to the MCP server
This session provides access to all Elevenlabs-related tools through the MCP protocol

Connect to MCP server and retrieve tools

typescript

const mcpClient = await createMCPClient({
  transport: {
    type: "http",
    url: mcpUrl,
    headers: session.mcp.headers, // Authentication headers for the Composio MCP server
  },
});

const tools = await mcpClient.tools();

What's happening:

We're creating an MCP client that connects to our Composio Tool Router session via HTTP
The mcp.url provides the endpoint, and mcp.headers contains authentication credentials
The type: "http" is important - Composio requires HTTP transport
tools() retrieves all available Elevenlabs tools that the agent can use

Initialize conversation and CLI interface

typescript

let messages: ModelMessage[] = [];

console.log("Chat started! Type 'exit' or 'quit' to end the conversation.\n");
console.log(
  "Ask any questions related to elevenlabs, like summarize my last 5 emails, send an email, etc... :)))\n",
);

const rl = readline.createInterface({
  input: process.stdin,
  output: process.stdout,
  prompt: "> ",
});

rl.prompt();

What's happening:

We initialize an empty messages array to maintain conversation history
A readline interface is created to accept user input from the command line
Instructions are displayed to guide the user on how to interact with the agent

Handle user input and stream responses with real-time tool feedback

typescript

rl.on("line", async (userInput: string) => {
  const trimmedInput = userInput.trim();

  if (["exit", "quit", "bye"].includes(trimmedInput.toLowerCase())) {
    console.log("\nGoodbye!");
    rl.close();
    process.exit(0);
  }

  if (!trimmedInput) {
    rl.prompt();
    return;
  }

  messages.push({ role: "user", content: trimmedInput });
  console.log("\nAgent is thinking...\n");

  try {
    const stream = streamText({
      model: openai("gpt-5"),
      messages,
      tools,
      toolChoice: "auto",
      stopWhen: stepCountIs(10),
      onStepFinish: (step) => {
        for (const toolCall of step.toolCalls) {
          console.log(`[Using tool: ${toolCall.toolName}]`);
          }
          if (step.toolCalls.length > 0) {
            console.log(""); // Add space after tool calls
          }
        },
      });

      for await (const chunk of stream.textStream) {
        process.stdout.write(chunk);
      }

      console.log("\n\n---\n");

      // Get final result for message history
      const response = await stream.response;
      if (response?.messages?.length) {
        messages.push(...response.messages);
      }
    } catch (error) {
      console.error("\nAn error occurred while talking to the agent:");
      console.error(error);
      console.log(
        "\nYou can try again or restart the app if it keeps happening.\n",
      );
    } finally {
      rl.prompt();
    }
  });

  rl.on("close", async () => {
    await mcpClient.close();
    console.log("\n👋 Session ended.");
    process.exit(0);
  });
}

main().catch((err) => {
  console.error("Fatal error:", err);
  process.exit(1);
});

What's happening:

We use streamText instead of generateText to stream responses in real-time
toolChoice: "auto" allows the model to decide when to use Elevenlabs tools
stopWhen: stepCountIs(10) allows up to 10 steps for complex multi-tool operations
onStepFinish callback displays which tools are being used in real-time
We iterate through the text stream to create a typewriter effect as the agent responds
The complete response is added to conversation history to maintain context
Errors are caught and displayed with helpful retry suggestions

Complete Code

Here's the complete code to get you started with Elevenlabs and Vercel AI SDK:

typescript

import "dotenv/config";
import { openai } from "@ai-sdk/openai";
import { Composio } from "@composio/core";
import * as readline from "readline";
import { streamText, type ModelMessage, stepCountIs } from "ai";
import { createMCPClient } from "@ai-sdk/mcp";

const composioAPIKey = process.env.COMPOSIO_API_KEY;
const composioUserID = process.env.COMPOSIO_USER_ID;

if (!process.env.OPENAI_API_KEY) throw new Error("OPENAI_API_KEY is not set");
if (!composioAPIKey) throw new Error("COMPOSIO_API_KEY is not set");
if (!composioUserID) throw new Error("COMPOSIO_USER_ID is not set");

const composio = new Composio({
  apiKey: composioAPIKey,
});

async function main() {
  // Create a tool router session for the user
  const session = await composio.create(composioUserID!, {
    toolkits: ["elevenlabs"],
  });

  const mcpUrl = session.mcp.url;

  const mcpClient = await createMCPClient({
    transport: {
      type: "http",
      url: mcpUrl,
      headers: session.mcp.headers, // Authentication headers for the Composio MCP server
    },
  });

  const tools = await mcpClient.tools();

  let messages: ModelMessage[] = [];

  console.log("Chat started! Type 'exit' or 'quit' to end the conversation.\n");
  console.log(
    "Ask any questions related to elevenlabs, like summarize my last 5 emails, send an email, etc... :)))\n",
  );

  const rl = readline.createInterface({
    input: process.stdin,
    output: process.stdout,
    prompt: "> ",
  });

  rl.prompt();

  rl.on("line", async (userInput: string) => {
    const trimmedInput = userInput.trim();

    if (["exit", "quit", "bye"].includes(trimmedInput.toLowerCase())) {
      console.log("\nGoodbye!");
      rl.close();
      process.exit(0);
    }

    if (!trimmedInput) {
      rl.prompt();
      return;
    }

    messages.push({ role: "user", content: trimmedInput });
    console.log("\nAgent is thinking...\n");

    try {
      const stream = streamText({
        model: openai("gpt-5"),
        messages,
        tools,
        toolChoice: "auto",
        stopWhen: stepCountIs(10),
        onStepFinish: (step) => {
          for (const toolCall of step.toolCalls) {
            console.log(`[Using tool: ${toolCall.toolName}]`);
          }
          if (step.toolCalls.length > 0) {
            console.log(""); // Add space after tool calls
          }
        },
      });

      for await (const chunk of stream.textStream) {
        process.stdout.write(chunk);
      }

      console.log("\n\n---\n");

      // Get final result for message history
      const response = await stream.response;
      if (response?.messages?.length) {
        messages.push(...response.messages);
      }
    } catch (error) {
      console.error("\nAn error occurred while talking to the agent:");
      console.error(error);
      console.log(
        "\nYou can try again or restart the app if it keeps happening.\n",
      );
    } finally {
      rl.prompt();
    }
  });

  rl.on("close", async () => {
    await mcpClient.close();
    console.log("\n👋 Session ended.");
    process.exit(0);
  });
}

main().catch((err) => {
  console.error("Fatal error:", err);
  process.exit(1);
});

Conclusion

You've successfully built a Elevenlabs agent using the Vercel AI SDK with streaming capabilities! This implementation provides a powerful foundation for building AI applications with natural language interfaces and real-time feedback.

Key features of this implementation:

Real-time streaming responses for a better user experience with typewriter effect
Live tool execution feedback showing which tools are being used as the agent works
Dynamic tool loading through Composio's Tool Router with secure authentication
Multi-step tool execution with configurable step limits (up to 10 steps)
Comprehensive error handling for robust agent execution
Conversation history maintenance for context-aware responses

You can extend this further by adding custom error handling, implementing specific business logic, or integrating additional Composio toolkits to create multi-app workflows.

TOOLS

Supported Tools

Every Elevenlabs action and event your agent gets out of the box.

Add a pronunciation dictionary from file

Adds a new pronunciation dictionary from a lexicon file to improve speech synthesis accuracy.

Add outbound phone number

Tool to import/register a Twilio phone number or SIP trunk into ElevenLabs Agents Platform.

Add new project with attributes

Use to create a new ElevenLabs project for text-to-speech synthesis (e.

Add pronunciation dictionary from rules

Tool to create a new pronunciation dictionary from provided rules for ElevenLabs text-to-speech.

Add rules to the pronunciation dictionary

Adds one or more custom pronunciation rules (alias or phoneme) to an existing pronunciation dictionary.

Add sharing voice

Adds an existing, shareable voice to a specified user's ElevenLabs account library under a new custom name, requiring the user's public ID and the voice ID.

Add a voice

Adds a custom voice by uploading audio samples for voice cloning.

Attach phone number to agent

Tool to assign or unassign an existing imported phone number to an agent by updating the phone number's assigned agent.

Calculate ConvAI Agent LLM Usage

Tool to calculate expected number of LLM tokens needed for a conversational AI agent.

Calculate ConvAI LLM Usage

Tool to calculate expected LLM usage costs for conversational AI agents.

Cancel Batch Call

Tool to cancel a running batch call and set all recipients to cancelled status.

Convert chapter to audio

Converts the textual content of a chapter, identified by `chapter_id` within a `project_id`, into audio format.

Convert a project

Converts an existing ElevenLabs Studio project, including all its chapters and using its configured settings and voices, into speech.

Create a previously generated voice

Finalizes the creation of a voice using its `generated_voice_id` from a previous generation step by assigning a name, description, and optional labels.

Create Conversational AI Agent Test

Tool to create a new ElevenLabs Conversational AI agent response test.

Add to ConvAI Knowledge Base

Tool to add documentation to ElevenLabs Conversational AI knowledge base by uploading a file or referencing a webpage URL.

Create ConvAI Knowledge Base File

Tool to create a knowledge base document from an uploaded file for ElevenLabs Conversational AI agents.

Create ConvAI Knowledge Base Folder

Tool to create a folder in the ElevenLabs ConvAI knowledge base for organizing documents.

Create ConvAI Knowledge Base RAG Index

Tool to compute or retrieve RAG indexes for multiple knowledge base documents in batch.

Create Knowledge Base Text Document

Tool to create a knowledge base document with text content in ElevenLabs Conversational AI.

Create Knowledge Base URL Document

Tool to create a knowledge base document by scraping the given webpage.

Create ConvAI Workspace Secret

Tool to create a new secret for the ElevenLabs ConvAI workspace.

Create Conversational AI Tool

Tool to create a new conversational AI tool in ElevenLabs workspace.

Create Conversational AI Agent

Tool to create a new ElevenLabs Conversational AI agent with specified configuration.

Generate Music Composition Plan

Tool to generate a music composition plan from a text prompt using ElevenLabs Music API.

Create an AudioNative enabled project

Creates an ElevenLabs AudioNative project, generating an embeddable audio player from a provided content file using text-to-speech, allowing customization of player appearance, audio settings, and conversion options.

Get similar library voices

Returns a list of shared voices similar to the provided audio sample.

Create Single Use Token

Tool to generate a time-limited single-use token with embedded authentication for frontend clients.

Create Workspace Webhook

Tool to create a new webhook for the workspace with specified authentication type.

Delete chapter from project

Irreversibly deletes a specific, existing chapter from an existing project, typically to remove unwanted or obsolete content.

Delete Conversational AI Agent

Tool to permanently delete a Conversational AI agent by its unique identifier.

Delete agent response test

Tool to delete an agent response test by ID.

Delete batch call

Tool to permanently delete a batch call and all associated recipient records.

Delete conversation by ID

Tool to delete a particular Conversational AI conversation by ID.

Delete Knowledge Base Document or Folder

Tool to delete a document or folder from the knowledge base.

Delete ConvAI Knowledge Base RAG Index

Tool to delete RAG index for a knowledge base document.

Delete workspace secret

Tool to delete a workspace secret if it's not in use.

Delete conversational AI tool

Tool to delete a conversational AI tool from the workspace by ID.

Delete a dubbing project

Permanently deletes a dubbing project by its ID; this action is irreversible and the project cannot be recovered.

Delete history item

Permanently deletes a specific history item (including its audio file and metadata) using its `history_item_id`; this operation is irreversible and should be used with caution.

Delete MCP server

Tool to delete a specific MCP server configuration from the workspace.

Delete phone number by id

Tool to delete an imported phone number from the ElevenLabs workspace by ID.

Delete project by id

Use to irreversibly delete a specific project by its `project_id`; the project must exist and be accessible, and this action cannot be undone.

Delete voice sample

Permanently deletes a specific voice sample for a given voice ID; this action is irreversible.

Delete voice by id

Permanently and irreversibly deletes a specific custom voice using its `voice_id`; the voice must exist and the authenticated user must have permission to delete it.

Delete workspace webhook

Tool to delete a specified workspace webhook by its ID.

Download history items

Downloads audio clips from history by ID(s), returning a single file or a ZIP archive, with an optional output format (e.

Dub a video or an audio file

Dub a video or audio file into a specified target language, requiring 'file' or 'source_url', 'target_lang', and 'csv_file' if 'mode' is 'manual'.

Duplicate Conversational AI Agent

Tool to create a new agent by duplicating an existing one.

Edit voice

Updates the name, audio files, description, or labels for an existing voice model.

Edit voice settings

Edits key voice settings (e.

Generate a random voice

Generates a unique, random ElevenLabs text-to-speech voice based on input text and specified voice characteristics.

Get agent details

Tool to retrieve available Conversational AI agents and outbound-capable Twilio phone numbers.

Get Agent Link

Tool to get the current shareable link for a Conversational AI agent.

Get user profile

Retrieves the profile information for the authenticated ElevenLabs user (identified by API key).

Get audio from history item

Retrieves the audio content for a specific history item from ElevenLabs, using a `history_item_id` that must correspond to a previously generated audio.

Get sample audio

Retrieves the audio for a given `sample_id` that must belong to the specified `voice_id`.

Get audio native settings

Tool to retrieve player settings for a specific Audio Native project.

Get chapter by ID

Fetches comprehensive details for a specific chapter within a given project, including its metadata (name, ID), conversion status, progress, download availability, and content statistics.

Get chapters by project id

Retrieves a list of all chapters, their details, and conversion status for a project, useful for managing content or tracking progress.

Get chapter snapshots

Retrieves all saved version snapshots for a specific chapter within a given project, enabling review of its history or reversion to prior states.

Get Conversational AI Agent

Tool to retrieve the complete configuration for a specific Conversational AI agent by ID.

Get Agent Knowledge Base Size

Tool to retrieve the number of pages in a conversational AI agent's knowledge base.

Get ConvAI Agents Summaries

Tool to retrieve summaries for specified Conversational AI agents.

Get Agent Widget Config

Tool to retrieve the widget configuration for a Conversational AI agent.

Get Agent Response Test By ID

Tool to retrieve an ElevenLabs Conversational AI agent response test by its ID.

Get conversational AI analytics live count

Tool to retrieve the live count of active ongoing Conversational AI conversations.

Get batch call details

Tool to get detailed information about a batch call including all recipients.

Get Batch Calls for Workspace

Tool to retrieve all batch calls for the current workspace.

Get Conversational AI Conversations

Tool to retrieve all conversations of agents that user owns.

Get ConvAI Knowledge Base

Tool to retrieve a list of available knowledge base documents.

Get Knowledge Base Document Content

Tool to retrieve the entire content of a document from the knowledge base.

Get Knowledge Base Dependent Agents

Tool to retrieve a list of agents depending on a specific knowledge base document.

Get Knowledge Base Documentation

Tool to get details about a specific documentation making up the agent's knowledge base.

Get Knowledge Base RAG Index Overview

Tool to retrieve RAG index overview including total size and usage information.

Get ConvAI Knowledge Base RAG Indexes

Tool to retrieve all RAG indexes for a specified knowledge base document.

Get Knowledge Base Source File URL

Tool to get a signed URL to download the original source file of a file-type document from the knowledge base.

Get Knowledge Base Summaries

Tool to retrieve knowledge base document summaries by their IDs.

Get ConvAI MCP Server

Tool to retrieve a specific MCP server configuration from the workspace.

Get ConvAI MCP Servers

Tool to retrieve all MCP (Model Context Protocol) server configurations available in the workspace.

Get ConvAI MCP Server Tools

Tool to retrieve all tools available for a specific MCP server configuration.

Get Phone Number by ID

Tool to retrieve detailed configuration for a specific phone number by ID.

Get ConvAI Workspace Secrets

Tool to retrieve all workspace secrets for the user.

Get Convai Settings

Tool to retrieve Convai settings for the workspace.

Get Convai Dashboard Settings

Tool to retrieve Convai dashboard settings for the workspace.

List Test Invocations

Tool to list all test invocations for a specific conversational AI agent with pagination support.

Get Conversational AI Tool

Tool to retrieve the complete configuration for a specific conversational AI tool by ID.

Get ConvAI tools

Tool to retrieve all available tools in the workspace.

Get Tool Dependent Agents

Tool to retrieve a list of agents depending on a specific tool.

Get conversation by ID

Tool to fetch full details for a single Conversational AI conversation by ID.

Get Conversation Signed URL

Tool to get a signed URL to start a conversation with an agent that requires authorization.

Get default voice settings

Retrieves the ElevenLabs text-to-speech service's default voice settings (stability, similarity boost, style, speaker boost) that are applied when no voice-specific or request-specific settings are provided.

Get dubbed audio for a language

Retrieves an existing dubbed audio file for a specific `dubbing_id` and `language_code`.

Get dubbing project metadata

Retrieves metadata and status for a specific dubbing project by its ID.

Get dubbing transcript in specific format

Retrieves the transcript for a specific dubbing project and language in the requested format (SRT, WebVTT, or JSON).

Get generated items

Retrieves metadata for a list of generated audio items from history, supporting pagination and optional filtering by voice ID.

Get history item by id

Retrieves detailed information (excluding the audio file) for a specific audio generation history item from ElevenLabs, using its unique ID.

Get MCP Tool Configuration

Tool to retrieve configuration overrides for a specific MCP tool within an MCP server.

Get pronunciation dictionary metadata

Retrieves metadata for a specific, existing pronunciation dictionary from ElevenLabs using its ID.

Get models

Retrieves a detailed list of all available ElevenLabs text-to-speech (TTS) models and their capabilities.

Get project by ID

Use to retrieve all details for a specific project, including its chapters and their conversion statuses, by providing the project's unique ID.

Get projects

Fetches a list of all projects and their details associated with the user's ElevenLabs account; this is a read-only operation.

Get project snapshots

Retrieves all available snapshots (saved states or versions) for an existing project, enabling history tracking, version comparison, or accessing specific states for playback/processing, particularly in text-to-speech workflows.

Get pronunciation dictionaries

Retrieves a paginated list of pronunciation dictionaries, used to customize how specific words or phrases are pronounced by the text-to-speech (TTS) engine.

Get pronunciation dictionary version

Downloads the Pronunciation Lexicon Specification (PLS) file for an existing version of a pronunciation dictionary from ElevenLabs, used to customize TTS pronunciation.

Get Service Accounts

Tool to list all service accounts in the workspace.

Get shared voices

Retrieves a paginated and filterable list of shared voices from the ElevenLabs Voice Library.

Get sso provider admin

Retrieves the SSO provider configuration for a specified workspace, typically for review purposes, and will indicate if no configuration exists.

Get Agent Response Test Summaries

Tool to retrieve multiple agent response test summaries by their IDs.

Get dubbing transcript by language

Retrieves the textual transcript for a specified dubbing project and language, if one exists for that language in the project.

Get Usage Character Stats

Tool to retrieve usage metrics for the current user or entire workspace.

Get user info

Retrieves detailed information about the authenticated ElevenLabs user's account, including subscription, usage, API key, and status.

Get user subscription info

Retrieves detailed subscription information for the currently authenticated ElevenLabs user.

Get voice

Retrieves comprehensive details for a specific, existing voice by its `voice_id`, optionally including its settings.

Get voices list

Retrieves a list of all available voices along with their detailed attributes and settings.

Get voice settings

Retrieves the stability, similarity, style, and speaker boost settings for a specific, existing ElevenLabs voice using its `voice_id`.

Get workspace resource metadata

Tool to get metadata of a workspace resource by ID and type.

Get Workspace Webhooks

Tool to list all webhooks configured for the workspace.

List Conversational AI Agent Tests

Tool to list all agent response tests with pagination support and optional search filtering.

List Dubs

Tool to list dubbing projects you have access to.

List Phone Numbers

Tool to list all imported phone numbers in the workspace.

List WhatsApp Accounts

Tool to list all WhatsApp accounts in the workspace.

Move Bulk Knowledge Base Items

Tool to move multiple documents or folders from one folder to another in the knowledge base.

Move ConvAI Knowledge Base Entity

Tool to move a knowledge base document or folder to a different folder.

Outbound call

Tool to place an outbound call via SIP trunk.

Get API documentation

Retrieves the content of the official ElevenLabs API documentation page hosted on Mintlify.

Tool to register a Twilio call and return TwiML to connect the call to an ElevenLabs Conversational AI agent.

Remove rules from pronunciation dictionary

Permanently removes exact-match pronunciation rules from a specified ElevenLabs pronunciation dictionary using a list of rule strings; non-matching rule strings are ignored and this action cannot add or modify rules.

Resubmit Test Invocations

Tool to resubmit specific test runs from a test invocation for a conversational AI agent.

Retry Batch Call

Tool to retry a batch call, calling failed and no-response recipients again.

Run Agent Tests

Tool to run selected tests on a conversational AI agent with optional configuration overrides.

Set Agent Avatar

Tool to set or update the avatar image for a Conversational AI agent displayed in the widget.

Simulate Conversational AI Agent Conversation

Tool to run a simulated conversation between an agent and an AI user.

Speech to speech

Converts an input audio file to speech using a specified voice; if a `model_id` is provided, it must support speech-to-speech conversion.

Speech to speech streaming

Converts an input audio stream to a different voice output stream in real-time, using a specified speech-to-speech model.

Stream audio isolation

Tool to remove background noise from audio and stream the isolated result.

Stream chapter audio

Streams the audio for a specified chapter snapshot from an ElevenLabs project, optionally converting the output to MPEG format.

Stream ConvAI agent simulate conversation

Tool to run a simulated conversation between an agent and a simulated user, streaming back the response.

Stream project audio

Streams audio from a specific project snapshot, optionally converting it to MPEG format.

Archive project snapshot

Archives an existing project snapshot by its ID, creating a permanent, immutable, and typically irreversible copy of its state.

Submit Batch Call

Tool to submit a batch call.

Text to speech

Converts text to speech using a specified ElevenLabs voice and model, returning a downloadable audio file (use ELEVENLABS_TEXT_TO_SPEECH_STREAM for streaming instead).

Text to speech stream

Converts text to a spoken audio stream (no saved file or history entry); use the non-streaming text-to-speech tool when a persistent audio URL is needed.

Update Audio Native project content

Tool to update content for an Audio Native project by uploading a text or HTML file.

Update Conversational AI Agent

Tool to update an existing ElevenLabs Conversational AI agent's settings.

Update Agent Response Test

Tool to update an existing ElevenLabs Conversational AI agent response test by ID.

Update Knowledge Base Document

Tool to update the name of a knowledge base document in ElevenLabs Conversational AI.

Update ConvAI Workspace Secret

Tool to update an existing secret in the ElevenLabs ConvAI workspace.

Update Convai Settings

Tool to update Convai settings for the workspace.

Update Convai Dashboard Settings

Tool to update Convai dashboard settings for the workspace.

Update Conversational AI Tool

Tool to update an existing conversational AI tool in ElevenLabs workspace.

Update project pronunciation dictionaries

Updates a project's pronunciation dictionaries on ElevenLabs to improve text-to-speech accuracy for specialized terms; note that while multiple dictionaries can be applied, the UI only displays the first.

Update pronunciation dictionary

Partially updates a pronunciation dictionary's metadata (name or archived status) without changing its version.

Update Workspace Webhook

Tool to update a specified workspace webhook by its ID.

Voice generation parameters retrieval

Fetches configurable parameters for ElevenLabs voice generation, used to determine available settings (e.

FRAMEWORKS

How to build Elevenlabs MCP Agent with another framework

ChatGPT Work

Use Elevenlabs MCP with ChatGPT Work

Antigravity

Use Elevenlabs MCP with Antigravity

OpenAI Agents SDK

Use Elevenlabs MCP with OpenAI Agents SDK

Claude Agent SDK

Use Elevenlabs MCP with Claude Agent SDK

Claude Code

Use Elevenlabs MCP with Claude Code

Claude Cowork

Use Elevenlabs MCP with Claude Cowork

Codex

Use Elevenlabs MCP with Codex

Kimi Code

Use Elevenlabs MCP with Kimi Code

Grok Build

Use Elevenlabs MCP with Grok Build

Cursor

Use Elevenlabs MCP with Cursor

VS Code

Use Elevenlabs MCP with VS Code

OpenCode

Use Elevenlabs MCP with OpenCode

OpenClaw

Use Elevenlabs MCP with OpenClaw

Hermes

Use Elevenlabs MCP with Hermes

CLI

Use Elevenlabs MCP with CLI

Google ADK

Use Elevenlabs MCP with Google ADK

LangChain

Use Elevenlabs MCP with LangChain

Mastra AI

Use Elevenlabs MCP with Mastra AI

LlamaIndex

Use Elevenlabs MCP with LlamaIndex

CrewAI

Use Elevenlabs MCP with CrewAI

MORE TOOLKITS

Explore Other Toolkits

Toolkit marketplace

Youtube

Oauth2

YouTube is a leading video-sharing platform for uploading, streaming, and discovering content. It empowers creators and businesses to reach global audiences and monetize their work.

Amara

Api Key

Amara is a collaborative platform for creating and managing subtitles and captions for videos. It helps make content accessible and multilingual for global audiences.

Cats

Api Key

Cats is an API with a huge library of cat images, breed data, and cat facts. It makes finding adorable cat photos and trivia effortless for your apps and users.

Chatfai

Api Key

Chatfai is an AI platform that lets users talk to AI versions of fictional characters from books, movies, and games. It offers an engaging, interactive experience for fans to chat, roleplay, and explore creative dialogues.

FAQ

Frequently asked questions

With a standalone Elevenlabs MCP server, the agents and LLMs can only access a fixed set of Elevenlabs tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Elevenlabs and many other apps based on the task at hand, all through a single MCP endpoint.

Yes, you can. Vercel AI SDK v6 fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Elevenlabs tools.

Yes, absolutely. You can configure which Elevenlabs scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Elevenlabs data and credentials are handled as safely as possible.

Start with Elevenlabs.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Elevenlabs tool your agent needs.Free to start.

Start building

How to integrate Elevenlabs MCP with Vercel AI SDK v6

Connect Elevenlabs without auth hassles

Introduction

Also integrate Elevenlabs with

TL;DR

What is Vercel AI SDK?

What is the Elevenlabs MCP server, and what's possible with it?

What is the Composio tool router, and how does it fit here?

What is Composio SDK?

How the Composio SDK works

Step-by-step Guide

Prerequisites

Getting API Keys for OpenAI and Composio

Install required dependencies

Set up environment variables

Import required modules and validate environment

Create Tool Router session and initialize MCP client

Connect to MCP server and retrieve tools

Initialize conversation and CLI interface

Handle user input and stream responses with real-time tool feedback

Complete Code

Conclusion

Supported Tools

How to build Elevenlabs MCP Agent with another framework

ChatGPT Work

Antigravity

OpenAI Agents SDK

Claude Agent SDK

Claude Code

Claude Cowork

Codex

Kimi Code

Grok Build

Cursor

VS Code

OpenCode

OpenClaw

Hermes

CLI

Google ADK

LangChain

Mastra AI

LlamaIndex

CrewAI

Explore Other Toolkits

Youtube

Amara

Cats

Chatfai

Frequently asked questions

What are the differences in Tool Router MCP and Elevenlabs MCP?+

Can I use Tool Router MCP with Vercel AI SDK v6?+

Can I manage the permissions and scopes for Elevenlabs while using Tool Router?+

How safe is my data with Composio Tool Router?+

Start with Elevenlabs.It takes 30 seconds.