How to integrate Elevenlabs MCP with LlamaIndex

Connect LlamaIndex to Elevenlabs MCP. Convert this chapter text to audio, create a custom project for your audiobook, and more using natural language, with authentication handled for you.

Get started for free Get a demo

Elevenlabs

Api Key

Elevenlabs is an advanced AI voice generation platform for lifelike, multilingual speech synthesis. Perfect for creating natural voices for videos, apps, and business content in seconds.

155 Tools

Managed auth

Connect Elevenlabs without auth hassles

We manage OAuth, API keys, token refresh, and scopes — you just build.

Try for free

Introduction

This guide walks you through connecting Elevenlabs to LlamaIndex using the Composio tool router. By the end, you'll have a working Elevenlabs agent that can convert this chapter text to audio, create a custom project for your audiobook, add a new pronunciation rule for this word through natural language commands.

This guide will help you understand how to give your LlamaIndex agent real control over a Elevenlabs account through Composio's Elevenlabs MCP server.

Before we dive in, let's take a quick look at the key ideas and tools involved.

Also integrate Elevenlabs with

ChatGPT Work Antigravity OpenAI Agents SDK Claude Agent SDK Claude Code Claude Cowork Codex Kimi Code Grok Build OpenCode Cursor VS Code OpenClaw Hermes CLI Google ADK LangChain Vercel AI SDK Mastra AI CrewAI

TL;DR

Here's what you'll learn:

Set your OpenAI and Composio API keys
Install LlamaIndex and Composio packages
Create a Composio Tool Router session for Elevenlabs
Connect LlamaIndex to the Elevenlabs MCP server
Build a Elevenlabs-powered agent using LlamaIndex
Interact with Elevenlabs through natural language

What is LlamaIndex?

LlamaIndex is a data framework for building LLM applications. It provides tools for connecting LLMs to external data sources and services through agents and tools.

Key features include:

ReAct Agent: Reasoning and acting pattern for tool-using agents
MCP Tools: Native support for Model Context Protocol
Context Management: Maintain conversation context across interactions
Async Support: Built for async/await patterns

What is the Elevenlabs MCP server, and what's possible with it?

The Elevenlabs MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Elevenlabs account. It provides structured and secure access to your voice synthesis projects and tools, so your agent can perform actions like generating audio from text, managing custom voices, organizing projects, and fine-tuning pronunciation on your behalf.

Project and chapter audio conversion: Instantly convert text content from chapters or entire projects into high-quality, natural-sounding audio files.
Custom voice creation and management: Guide your agent to add, finalize, or share custom voices—either by uploading new samples or assembling voices from existing data.
Pronunciation dictionary and rule management: Improve the accuracy of speech outputs by adding pronunciation dictionaries or custom pronunciation rules directly from files or specific aliases/phonemes.
Project organization and automation: Let your agent create new projects, add or remove chapters, and automate speech synthesis workflows for audiobooks, podcasts, or media production.
Embeddable audio player generation: Enable your agent to generate AudioNative projects, creating customizable and embeddable audio players from your content with just a prompt.

What is the Composio tool router, and how does it fit here?

What is Composio SDK?

Composio's Composio SDK helps agents find the right tools for a task at runtime. You can plug in multiple toolkits (like Gmail, HubSpot, and GitHub), and the agent will identify the relevant app and action to complete multi-step workflows. This can reduce token usage and improve the reliability of tool calls. Read more here: Getting started with Composio SDK

The tool router generates a secure MCP URL that your agents can access to perform actions.

How the Composio SDK works

The Composio SDK follows a three-phase workflow:

Discovery: Searches for tools matching your task and returns relevant toolkits with their details.
Authentication: Checks for active connections. If missing, creates an auth config and returns a connection URL via Auth Link.
Execution: Executes the action using the authenticated connection.

Step-by-step Guide

Step by step10 STEPS

Prerequisites

Before you begin, make sure you have:

Python 3.8/Node 16 or higher installed
A Composio account with the API key
An OpenAI API key
A Elevenlabs account and project
Basic familiarity with async Python/Typescript

Getting API Keys for OpenAI, Composio, and Elevenlabs

OpenAI API key (OPENAI_API_KEY)

Go to the OpenAI dashboard
Create an API key if you don't have one
Assign it to OPENAI_API_KEY in .env

Composio API key and user ID

Log into the Composio dashboard
Copy your API key from Settings
- Use this as COMPOSIO_API_KEY
Pick a stable user identifier (email or ID)
- Use this as COMPOSIO_USER_ID

Installing dependencies

npm install @composio/llamaindex @llamaindex/openai @llamaindex/tools @llamaindex/workflow dotenv

Create a new Typescript project and install the necessary dependencies:

@composio/llamaindex: Composio's LlamaIndex integration
@llamaindex/openai: OpenAI LLM integration
@llamaindex/tools: MCP client for LlamaIndex
@llamaindex/workflow: Workflow framework for LlamaIndex
dotenv: Environment variable management

Set environment variables

bash

OPENAI_API_KEY=your-openai-api-key
COMPOSIO_API_KEY=your-composio-api-key
COMPOSIO_USER_ID=your-user-id

Create a .env file in your project root:

These credentials will be used to:

Authenticate with OpenAI's GPT-5 model
Connect to Composio's Tool Router
Identify your Composio user session for Elevenlabs access

Import modules

import "dotenv/config";
import readline from "node:readline/promises";
import { stdin as input, stdout as output } from "node:process";

import { Composio } from "@composio/core";

import { mcp } from "@llamaindex/tools";
import { agent as createAgent } from "@llamaindex/workflow";
import { openai } from "@llamaindex/openai";

dotenv.config();

Create a new file called elevenlabs_llamaindex_agent.ts and import the required modules:

Key imports:

dotenv.config loads .env at runtime
readline gives us a simple CLI chat loop
Composio is the main Composio SDK client
mcp connects to an MCP endpoint
createAgent builds a LlamaIndex agent
openai configures the LLM backend

Load environment variables and initialize Composio

const OPENAI_API_KEY = process.env.OPENAI_API_KEY;
const COMPOSIO_API_KEY = process.env.COMPOSIO_API_KEY;
const COMPOSIO_USER_ID = process.env.COMPOSIO_USER_ID;

if (!OPENAI_API_KEY) throw new Error("OPENAI_API_KEY is not set");
if (!COMPOSIO_API_KEY) throw new Error("COMPOSIO_API_KEY is not set");
if (!COMPOSIO_USER_ID) throw new Error("COMPOSIO_USER_ID is not set");

What's happening:

This ensures missing credentials cause early, clear errors before the agent attempts to initialise.

Create a Tool Router session and build the agent function

async function buildAgent() {

  console.log(`Initializing Composio client...${COMPOSIO_USER_ID!}...`);
  console.log(`COMPOSIO_USER_ID: ${COMPOSIO_USER_ID!}...`);

  const composio = new Composio({
    apiKey: COMPOSIO_API_KEY,
    provider: new LlamaindexProvider(),
  });

  const session = await composio.create(
    COMPOSIO_USER_ID!,
    {
      toolkits: ["elevenlabs"],
    },
  );

  const mcpUrl = session.mcp.url;
  console.log(`Composio Tool Router MCP URL: ${mcpUrl}`);

  const server = mcp({
    url: mcpUrl,
    clientName: "composio_tool_router_with_llamaindex",
    requestInit: {
      headers: {
        "x-api-key": COMPOSIO_API_KEY!,
      },
    },
    // verbose: true,
  });

  const tools = await server.tools();

  const llm = openai({ apiKey: OPENAI_API_KEY, model: "gpt-5" });

  const agent = createAgent({
    name: "composio_tool_router_with_llamaindex",
        description : "An agent that uses Composio Tool Router MCP tools to perform actions.",
    systemPrompt:
      "You are a helpful assistant connected to Composio Tool Router."+
"Use the available tools to answer user queries and perform Elevenlabs actions." ,
    llm,
    tools,
  });

  return agent;
}

What's happening here:

We create a Composio client using your API key and configure it with the LlamaIndex provider
We then create a tool router MCP session for your user, specifying the toolkits we want to use (in this case, elevenlabs)
The session returns an MCP HTTP endpoint URL that acts as a gateway to all your configured tools
LlamaIndex will connect to this endpoint to dynamically discover and use the available Elevenlabs tools.
The MCP tools are mapped to LlamaIndex-compatible tools and plug them into the Agent.

Create an interactive chat loop

async function chatLoop(agent: ReturnType<typeof createAgent>) {
  const rl = readline.createInterface({ input, output });

  console.log("Type 'quit' or 'exit' to stop.");

  while (true) {
    let userInput: string;

    try {
      userInput = (await rl.question("\nYou: ")).trim();
    } catch {
      console.log("\nAgent: Bye!");
      break;
    }

    if (!userInput) {
      continue;
    }

    const lower = userInput.toLowerCase();
    if (lower === "quit" || lower === "exit") {
      console.log("Agent: Bye!");
      break;
    }

    try {
      process.stdout.write("Agent: ");

      const stream = agent.runStream(userInput);
      let finalResult: any = null;

      for await (const event of stream) {
        // The event.data contains the streamed content
        const data: any = event.data;

        // Check for streaming delta content
        if (data?.delta) {
          process.stdout.write(data.delta);
        }

        // Store final result for fallback
        if (data?.result || data?.message) {
          finalResult = data;
        }
      }

      // If no streaming happened, show the final result
      if (finalResult) {
        const answer =
          finalResult.result ??
          finalResult.message?.content ??
          finalResult.message ??
          "";
        if (answer && typeof answer === "string" && !answer.includes("[object")) {
          process.stdout.write(answer);
        }
      }

      console.log(); // New line after streaming completes
    } catch (err: any) {
      console.error("\nAgent error:", err?.message ?? err);
    }
  }

  rl.close();
}

What's happening:

We're creating a direct terminal interface to chat with Elevenlabs
The LLM's responses are streamed to the CLI for faster interaction.
The agent uses context to maintain conversation history
The agent processes the request, selects appropriate Elevenlabs tools, and returns a result
We extract the answer from the result data structure and display it to the user
You can type 'quit' or 'exit' to stop the chat loop gracefully
Agent responses and any errors are streamed in a clear, readable format

Define the main entry point

async function main() {
  try {
    const agent = await buildAgent();
    await chatLoop(agent);
  } catch (err) {
    console.error("Failed to start agent:", err);
    process.exit(1);
  }
}

main();

What's happening here:

We're orchestrating the entire application flow
The agent gets built with proper error handling
Then we kick off the interactive chat loop so you can start talking to Elevenlabs

Run the agent

npx ts-node llamaindex-agent.ts

When prompted, authenticate and authorise your agent with Elevenlabs, then start asking questions.

Complete Code

Here's the complete code to get you started with Elevenlabs and LlamaIndex:

import "dotenv/config";
import readline from "node:readline/promises";
import { stdin as input, stdout as output } from "node:process";

import { Composio } from "@composio/core";
import { LlamaindexProvider } from "@composio/llamaindex";

import { mcp } from "@llamaindex/tools";
import { agent as createAgent } from "@llamaindex/workflow";
import { openai } from "@llamaindex/openai";

dotenv.config();

const OPENAI_API_KEY = process.env.OPENAI_API_KEY;
const COMPOSIO_API_KEY = process.env.COMPOSIO_API_KEY;
const COMPOSIO_USER_ID = process.env.COMPOSIO_USER_ID;

if (!OPENAI_API_KEY) {
    throw new Error("OPENAI_API_KEY is not set in the environment");
  }
if (!COMPOSIO_API_KEY) {
    throw new Error("COMPOSIO_API_KEY is not set in the environment");
  }
if (!COMPOSIO_USER_ID) {
    throw new Error("COMPOSIO_USER_ID is not set in the environment");
  }

async function buildAgent() {

  console.log(`Initializing Composio client...${COMPOSIO_USER_ID!}...`);
  console.log(`COMPOSIO_USER_ID: ${COMPOSIO_USER_ID!}...`);

  const composio = new Composio({
    apiKey: COMPOSIO_API_KEY,
    provider: new LlamaindexProvider(),
  });

  const session = await composio.create(
    COMPOSIO_USER_ID!,
    {
      toolkits: ["elevenlabs"],
    },
  );

  const mcpUrl = session.mcp.url;
  console.log(`Composio Tool Router MCP URL: ${mcpUrl}`);

  const server = mcp({
    url: mcpUrl,
    clientName: "composio_tool_router_with_llamaindex",
    requestInit: {
      headers: {
        "x-api-key": COMPOSIO_API_KEY!,
      },
    },
    // verbose: true,
  });

  const tools = await server.tools();

  const llm = openai({ apiKey: OPENAI_API_KEY, model: "gpt-5" });

  const agent = createAgent({
    name: "composio_tool_router_with_llamaindex",
    description:
      "An agent that uses Composio Tool Router MCP tools to perform actions.",
    systemPrompt:
      "You are a helpful assistant connected to Composio Tool Router."+
"Use the available tools to answer user queries and perform Elevenlabs actions." ,
    llm,
    tools,
  });

  return agent;
}

async function chatLoop(agent: ReturnType<typeof createAgent>) {
  const rl = readline.createInterface({ input, output });

  console.log("Type 'quit' or 'exit' to stop.");

  while (true) {
    let userInput: string;

    try {
      userInput = (await rl.question("\nYou: ")).trim();
    } catch {
      console.log("\nAgent: Bye!");
      break;
    }

    if (!userInput) {
      continue;
    }

    const lower = userInput.toLowerCase();
    if (lower === "quit" || lower === "exit") {
      console.log("Agent: Bye!");
      break;
    }

    try {
      process.stdout.write("Agent: ");

      const stream = agent.runStream(userInput);
      let finalResult: any = null;

      for await (const event of stream) {
        // The event.data contains the streamed content
        const data: any = event.data;

        // Check for streaming delta content
        if (data?.delta) {
          process.stdout.write(data.delta);
        }

        // Store final result for fallback
        if (data?.result || data?.message) {
          finalResult = data;
        }
      }

      // If no streaming happened, show the final result
      if (finalResult) {
        const answer =
          finalResult.result ??
          finalResult.message?.content ??
          finalResult.message ??
          "";
        if (answer && typeof answer === "string" && !answer.includes("[object")) {
          process.stdout.write(answer);
        }
      }

      console.log(); // New line after streaming completes
    } catch (err: any) {
      console.error("\nAgent error:", err?.message ?? err);
    }
  }

  rl.close();
}

async function main() {
  try {
    const agent = await buildAgent();
    await chatLoop(agent);
  } catch (err: any) {
    console.error("Failed to start agent:", err?.message ?? err);
    process.exit(1);
  }
}

main();

Conclusion

You've successfully connected Elevenlabs to LlamaIndex through Composio's Tool Router MCP layer. Key takeaways:

Tool Router dynamically exposes Elevenlabs tools through an MCP endpoint
LlamaIndex's ReActAgent handles reasoning and orchestration; Composio handles integrations
The agent becomes more capable without increasing prompt size
Async Python provides clean, efficient execution of agent workflows

You can easily extend this to other toolkits like Gmail, Notion, Stripe, GitHub, and more by adding them to the toolkits parameter.

TOOLS

Supported Tools

Every Elevenlabs action and event your agent gets out of the box.

Add a pronunciation dictionary from file

Adds a new pronunciation dictionary from a lexicon file to improve speech synthesis accuracy.

Add outbound phone number

Tool to import/register a Twilio phone number or SIP trunk into ElevenLabs Agents Platform.

Add new project with attributes

Use to create a new ElevenLabs project for text-to-speech synthesis (e.

Add pronunciation dictionary from rules

Tool to create a new pronunciation dictionary from provided rules for ElevenLabs text-to-speech.

Add rules to the pronunciation dictionary

Adds one or more custom pronunciation rules (alias or phoneme) to an existing pronunciation dictionary.

Add sharing voice

Adds an existing, shareable voice to a specified user's ElevenLabs account library under a new custom name, requiring the user's public ID and the voice ID.

Add a voice

Adds a custom voice by uploading audio samples for voice cloning.

Attach phone number to agent

Tool to assign or unassign an existing imported phone number to an agent by updating the phone number's assigned agent.

Calculate ConvAI Agent LLM Usage

Tool to calculate expected number of LLM tokens needed for a conversational AI agent.

Calculate ConvAI LLM Usage

Tool to calculate expected LLM usage costs for conversational AI agents.

Cancel Batch Call

Tool to cancel a running batch call and set all recipients to cancelled status.

Convert chapter to audio

Converts the textual content of a chapter, identified by `chapter_id` within a `project_id`, into audio format.

Convert a project

Converts an existing ElevenLabs Studio project, including all its chapters and using its configured settings and voices, into speech.

Create a previously generated voice

Finalizes the creation of a voice using its `generated_voice_id` from a previous generation step by assigning a name, description, and optional labels.

Create Conversational AI Agent Test

Tool to create a new ElevenLabs Conversational AI agent response test.

Add to ConvAI Knowledge Base

Tool to add documentation to ElevenLabs Conversational AI knowledge base by uploading a file or referencing a webpage URL.

Create ConvAI Knowledge Base File

Tool to create a knowledge base document from an uploaded file for ElevenLabs Conversational AI agents.

Create ConvAI Knowledge Base Folder

Tool to create a folder in the ElevenLabs ConvAI knowledge base for organizing documents.

Create ConvAI Knowledge Base RAG Index

Tool to compute or retrieve RAG indexes for multiple knowledge base documents in batch.

Create Knowledge Base Text Document

Tool to create a knowledge base document with text content in ElevenLabs Conversational AI.

Create Knowledge Base URL Document

Tool to create a knowledge base document by scraping the given webpage.

Create ConvAI Workspace Secret

Tool to create a new secret for the ElevenLabs ConvAI workspace.

Create Conversational AI Tool

Tool to create a new conversational AI tool in ElevenLabs workspace.

Create Conversational AI Agent

Tool to create a new ElevenLabs Conversational AI agent with specified configuration.

Generate Music Composition Plan

Tool to generate a music composition plan from a text prompt using ElevenLabs Music API.

Create an AudioNative enabled project

Creates an ElevenLabs AudioNative project, generating an embeddable audio player from a provided content file using text-to-speech, allowing customization of player appearance, audio settings, and conversion options.

Get similar library voices

Returns a list of shared voices similar to the provided audio sample.

Create Single Use Token

Tool to generate a time-limited single-use token with embedded authentication for frontend clients.

Create Workspace Webhook

Tool to create a new webhook for the workspace with specified authentication type.

Delete chapter from project

Irreversibly deletes a specific, existing chapter from an existing project, typically to remove unwanted or obsolete content.

Delete Conversational AI Agent

Tool to permanently delete a Conversational AI agent by its unique identifier.

Delete agent response test

Tool to delete an agent response test by ID.

Delete batch call

Tool to permanently delete a batch call and all associated recipient records.

Delete conversation by ID

Tool to delete a particular Conversational AI conversation by ID.

Delete Knowledge Base Document or Folder

Tool to delete a document or folder from the knowledge base.

Delete ConvAI Knowledge Base RAG Index

Tool to delete RAG index for a knowledge base document.

Delete workspace secret

Tool to delete a workspace secret if it's not in use.

Delete conversational AI tool

Tool to delete a conversational AI tool from the workspace by ID.

Delete a dubbing project

Permanently deletes a dubbing project by its ID; this action is irreversible and the project cannot be recovered.

Delete history item

Permanently deletes a specific history item (including its audio file and metadata) using its `history_item_id`; this operation is irreversible and should be used with caution.

Delete MCP server

Tool to delete a specific MCP server configuration from the workspace.

Delete phone number by id

Tool to delete an imported phone number from the ElevenLabs workspace by ID.

Delete project by id

Use to irreversibly delete a specific project by its `project_id`; the project must exist and be accessible, and this action cannot be undone.

Delete voice sample

Permanently deletes a specific voice sample for a given voice ID; this action is irreversible.

Delete voice by id

Permanently and irreversibly deletes a specific custom voice using its `voice_id`; the voice must exist and the authenticated user must have permission to delete it.

Delete workspace webhook

Tool to delete a specified workspace webhook by its ID.

Download history items

Downloads audio clips from history by ID(s), returning a single file or a ZIP archive, with an optional output format (e.

Dub a video or an audio file

Dub a video or audio file into a specified target language, requiring 'file' or 'source_url', 'target_lang', and 'csv_file' if 'mode' is 'manual'.

Duplicate Conversational AI Agent

Tool to create a new agent by duplicating an existing one.

Edit voice

Updates the name, audio files, description, or labels for an existing voice model.

Edit voice settings

Edits key voice settings (e.

Generate a random voice

Generates a unique, random ElevenLabs text-to-speech voice based on input text and specified voice characteristics.

Get agent details

Tool to retrieve available Conversational AI agents and outbound-capable Twilio phone numbers.

Get Agent Link

Tool to get the current shareable link for a Conversational AI agent.

Get user profile

Retrieves the profile information for the authenticated ElevenLabs user (identified by API key).

Get audio from history item

Retrieves the audio content for a specific history item from ElevenLabs, using a `history_item_id` that must correspond to a previously generated audio.

Get sample audio

Retrieves the audio for a given `sample_id` that must belong to the specified `voice_id`.

Get audio native settings

Tool to retrieve player settings for a specific Audio Native project.

Get chapter by ID

Fetches comprehensive details for a specific chapter within a given project, including its metadata (name, ID), conversion status, progress, download availability, and content statistics.

Get chapters by project id

Retrieves a list of all chapters, their details, and conversion status for a project, useful for managing content or tracking progress.

Get chapter snapshots

Retrieves all saved version snapshots for a specific chapter within a given project, enabling review of its history or reversion to prior states.

Get Conversational AI Agent

Tool to retrieve the complete configuration for a specific Conversational AI agent by ID.

Get Agent Knowledge Base Size

Tool to retrieve the number of pages in a conversational AI agent's knowledge base.

Get ConvAI Agents Summaries

Tool to retrieve summaries for specified Conversational AI agents.

Get Agent Widget Config

Tool to retrieve the widget configuration for a Conversational AI agent.

Get Agent Response Test By ID

Tool to retrieve an ElevenLabs Conversational AI agent response test by its ID.

Get conversational AI analytics live count

Tool to retrieve the live count of active ongoing Conversational AI conversations.

Get batch call details

Tool to get detailed information about a batch call including all recipients.

Get Batch Calls for Workspace

Tool to retrieve all batch calls for the current workspace.

Get Conversational AI Conversations

Tool to retrieve all conversations of agents that user owns.

Get ConvAI Knowledge Base

Tool to retrieve a list of available knowledge base documents.

Get Knowledge Base Document Content

Tool to retrieve the entire content of a document from the knowledge base.

Get Knowledge Base Dependent Agents

Tool to retrieve a list of agents depending on a specific knowledge base document.

Get Knowledge Base Documentation

Tool to get details about a specific documentation making up the agent's knowledge base.

Get Knowledge Base RAG Index Overview

Tool to retrieve RAG index overview including total size and usage information.

Get ConvAI Knowledge Base RAG Indexes

Tool to retrieve all RAG indexes for a specified knowledge base document.

Get Knowledge Base Source File URL

Tool to get a signed URL to download the original source file of a file-type document from the knowledge base.

Get Knowledge Base Summaries

Tool to retrieve knowledge base document summaries by their IDs.

Get ConvAI MCP Server

Tool to retrieve a specific MCP server configuration from the workspace.

Get ConvAI MCP Servers

Tool to retrieve all MCP (Model Context Protocol) server configurations available in the workspace.

Get ConvAI MCP Server Tools

Tool to retrieve all tools available for a specific MCP server configuration.

Get Phone Number by ID

Tool to retrieve detailed configuration for a specific phone number by ID.

Get ConvAI Workspace Secrets

Tool to retrieve all workspace secrets for the user.

Get Convai Settings

Tool to retrieve Convai settings for the workspace.

Get Convai Dashboard Settings

Tool to retrieve Convai dashboard settings for the workspace.

List Test Invocations

Tool to list all test invocations for a specific conversational AI agent with pagination support.

Get Conversational AI Tool

Tool to retrieve the complete configuration for a specific conversational AI tool by ID.

Get ConvAI tools

Tool to retrieve all available tools in the workspace.

Get Tool Dependent Agents

Tool to retrieve a list of agents depending on a specific tool.

Get conversation by ID

Tool to fetch full details for a single Conversational AI conversation by ID.

Get Conversation Signed URL

Tool to get a signed URL to start a conversation with an agent that requires authorization.

Get default voice settings

Retrieves the ElevenLabs text-to-speech service's default voice settings (stability, similarity boost, style, speaker boost) that are applied when no voice-specific or request-specific settings are provided.

Get dubbed audio for a language

Retrieves an existing dubbed audio file for a specific `dubbing_id` and `language_code`.

Get dubbing project metadata

Retrieves metadata and status for a specific dubbing project by its ID.

Get dubbing transcript in specific format

Retrieves the transcript for a specific dubbing project and language in the requested format (SRT, WebVTT, or JSON).

Get generated items

Retrieves metadata for a list of generated audio items from history, supporting pagination and optional filtering by voice ID.

Get history item by id

Retrieves detailed information (excluding the audio file) for a specific audio generation history item from ElevenLabs, using its unique ID.

Get MCP Tool Configuration

Tool to retrieve configuration overrides for a specific MCP tool within an MCP server.

Get pronunciation dictionary metadata

Retrieves metadata for a specific, existing pronunciation dictionary from ElevenLabs using its ID.

Get models

Retrieves a detailed list of all available ElevenLabs text-to-speech (TTS) models and their capabilities.

Get project by ID

Use to retrieve all details for a specific project, including its chapters and their conversion statuses, by providing the project's unique ID.

Get projects

Fetches a list of all projects and their details associated with the user's ElevenLabs account; this is a read-only operation.

Get project snapshots

Retrieves all available snapshots (saved states or versions) for an existing project, enabling history tracking, version comparison, or accessing specific states for playback/processing, particularly in text-to-speech workflows.

Get pronunciation dictionaries

Retrieves a paginated list of pronunciation dictionaries, used to customize how specific words or phrases are pronounced by the text-to-speech (TTS) engine.

Get pronunciation dictionary version

Downloads the Pronunciation Lexicon Specification (PLS) file for an existing version of a pronunciation dictionary from ElevenLabs, used to customize TTS pronunciation.

Get Service Accounts

Tool to list all service accounts in the workspace.

Get shared voices

Retrieves a paginated and filterable list of shared voices from the ElevenLabs Voice Library.

Get sso provider admin

Retrieves the SSO provider configuration for a specified workspace, typically for review purposes, and will indicate if no configuration exists.

Get Agent Response Test Summaries

Tool to retrieve multiple agent response test summaries by their IDs.

Get dubbing transcript by language

Retrieves the textual transcript for a specified dubbing project and language, if one exists for that language in the project.

Get Usage Character Stats

Tool to retrieve usage metrics for the current user or entire workspace.

Get user info

Retrieves detailed information about the authenticated ElevenLabs user's account, including subscription, usage, API key, and status.

Get user subscription info

Retrieves detailed subscription information for the currently authenticated ElevenLabs user.

Get voice

Retrieves comprehensive details for a specific, existing voice by its `voice_id`, optionally including its settings.

Get voices list

Retrieves a list of all available voices along with their detailed attributes and settings.

Get voice settings

Retrieves the stability, similarity, style, and speaker boost settings for a specific, existing ElevenLabs voice using its `voice_id`.

Get workspace resource metadata

Tool to get metadata of a workspace resource by ID and type.

Get Workspace Webhooks

Tool to list all webhooks configured for the workspace.

List Conversational AI Agent Tests

Tool to list all agent response tests with pagination support and optional search filtering.

List Dubs

Tool to list dubbing projects you have access to.

List Phone Numbers

Tool to list all imported phone numbers in the workspace.

List WhatsApp Accounts

Tool to list all WhatsApp accounts in the workspace.

Move Bulk Knowledge Base Items

Tool to move multiple documents or folders from one folder to another in the knowledge base.

Move ConvAI Knowledge Base Entity

Tool to move a knowledge base document or folder to a different folder.

Outbound call

Tool to place an outbound call via SIP trunk.

Get API documentation

Retrieves the content of the official ElevenLabs API documentation page hosted on Mintlify.

Tool to register a Twilio call and return TwiML to connect the call to an ElevenLabs Conversational AI agent.

Remove rules from pronunciation dictionary

Permanently removes exact-match pronunciation rules from a specified ElevenLabs pronunciation dictionary using a list of rule strings; non-matching rule strings are ignored and this action cannot add or modify rules.

Resubmit Test Invocations

Tool to resubmit specific test runs from a test invocation for a conversational AI agent.

Retry Batch Call

Tool to retry a batch call, calling failed and no-response recipients again.

Run Agent Tests

Tool to run selected tests on a conversational AI agent with optional configuration overrides.

Set Agent Avatar

Tool to set or update the avatar image for a Conversational AI agent displayed in the widget.

Simulate Conversational AI Agent Conversation

Tool to run a simulated conversation between an agent and an AI user.

Speech to speech

Converts an input audio file to speech using a specified voice; if a `model_id` is provided, it must support speech-to-speech conversion.

Speech to speech streaming

Converts an input audio stream to a different voice output stream in real-time, using a specified speech-to-speech model.

Stream audio isolation

Tool to remove background noise from audio and stream the isolated result.

Stream chapter audio

Streams the audio for a specified chapter snapshot from an ElevenLabs project, optionally converting the output to MPEG format.

Stream ConvAI agent simulate conversation

Tool to run a simulated conversation between an agent and a simulated user, streaming back the response.

Stream project audio

Streams audio from a specific project snapshot, optionally converting it to MPEG format.

Archive project snapshot

Archives an existing project snapshot by its ID, creating a permanent, immutable, and typically irreversible copy of its state.

Submit Batch Call

Tool to submit a batch call.

Text to speech

Converts text to speech using a specified ElevenLabs voice and model, returning a downloadable audio file (use ELEVENLABS_TEXT_TO_SPEECH_STREAM for streaming instead).

Text to speech stream

Converts text to a spoken audio stream (no saved file or history entry); use the non-streaming text-to-speech tool when a persistent audio URL is needed.

Update Audio Native project content

Tool to update content for an Audio Native project by uploading a text or HTML file.

Update Conversational AI Agent

Tool to update an existing ElevenLabs Conversational AI agent's settings.

Update Agent Response Test

Tool to update an existing ElevenLabs Conversational AI agent response test by ID.

Update Knowledge Base Document

Tool to update the name of a knowledge base document in ElevenLabs Conversational AI.

Update ConvAI Workspace Secret

Tool to update an existing secret in the ElevenLabs ConvAI workspace.

Update Convai Settings

Tool to update Convai settings for the workspace.

Update Convai Dashboard Settings

Tool to update Convai dashboard settings for the workspace.

Update Conversational AI Tool

Tool to update an existing conversational AI tool in ElevenLabs workspace.

Update project pronunciation dictionaries

Updates a project's pronunciation dictionaries on ElevenLabs to improve text-to-speech accuracy for specialized terms; note that while multiple dictionaries can be applied, the UI only displays the first.

Update pronunciation dictionary

Partially updates a pronunciation dictionary's metadata (name or archived status) without changing its version.

Update Workspace Webhook

Tool to update a specified workspace webhook by its ID.

Voice generation parameters retrieval

Fetches configurable parameters for ElevenLabs voice generation, used to determine available settings (e.

FRAMEWORKS

How to build Elevenlabs MCP Agent with another framework

ChatGPT Work

Use Elevenlabs MCP with ChatGPT Work

Antigravity

Use Elevenlabs MCP with Antigravity

OpenAI Agents SDK

Use Elevenlabs MCP with OpenAI Agents SDK

Claude Agent SDK

Use Elevenlabs MCP with Claude Agent SDK

Claude Code

Use Elevenlabs MCP with Claude Code

Claude Cowork

Use Elevenlabs MCP with Claude Cowork

Codex

Use Elevenlabs MCP with Codex

Kimi Code

Use Elevenlabs MCP with Kimi Code

Grok Build

Use Elevenlabs MCP with Grok Build

Cursor

Use Elevenlabs MCP with Cursor

VS Code

Use Elevenlabs MCP with VS Code

OpenCode

Use Elevenlabs MCP with OpenCode

OpenClaw

Use Elevenlabs MCP with OpenClaw

Hermes

Use Elevenlabs MCP with Hermes

CLI

Use Elevenlabs MCP with CLI

Google ADK

Use Elevenlabs MCP with Google ADK

LangChain

Use Elevenlabs MCP with LangChain

Vercel AI SDK

Use Elevenlabs MCP with Vercel AI SDK

Mastra AI

Use Elevenlabs MCP with Mastra AI

CrewAI

Use Elevenlabs MCP with CrewAI

MORE TOOLKITS

Explore Other Toolkits

Toolkit marketplace

Youtube

Oauth2

YouTube is a leading video-sharing platform for uploading, streaming, and discovering content. It empowers creators and businesses to reach global audiences and monetize their work.

Amara

Api Key

Amara is a collaborative platform for creating and managing subtitles and captions for videos. It helps make content accessible and multilingual for global audiences.

Cats

Api Key

Cats is an API with a huge library of cat images, breed data, and cat facts. It makes finding adorable cat photos and trivia effortless for your apps and users.

Chatfai

Api Key

Chatfai is an AI platform that lets users talk to AI versions of fictional characters from books, movies, and games. It offers an engaging, interactive experience for fans to chat, roleplay, and explore creative dialogues.

FAQ

Frequently asked questions

With a standalone Elevenlabs MCP server, the agents and LLMs can only access a fixed set of Elevenlabs tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Elevenlabs and many other apps based on the task at hand, all through a single MCP endpoint.

Yes, you can. LlamaIndex fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Elevenlabs tools.

Yes, absolutely. You can configure which Elevenlabs scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Elevenlabs data and credentials are handled as safely as possible.

Start with Elevenlabs.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Elevenlabs tool your agent needs.Free to start.

Start building

How to integrate Elevenlabs MCP with LlamaIndex

Connect Elevenlabs without auth hassles

Introduction

Also integrate Elevenlabs with

TL;DR

What is LlamaIndex?

What is the Elevenlabs MCP server, and what's possible with it?

What is the Composio tool router, and how does it fit here?

What is Composio SDK?

How the Composio SDK works

Step-by-step Guide

Prerequisites

Getting API Keys for OpenAI, Composio, and Elevenlabs

Installing dependencies

Set environment variables

Import modules

Load environment variables and initialize Composio

Create a Tool Router session and build the agent function

Create an interactive chat loop

Define the main entry point

Run the agent

Complete Code

Conclusion

Supported Tools

How to build Elevenlabs MCP Agent with another framework

ChatGPT Work

Antigravity

OpenAI Agents SDK

Claude Agent SDK

Claude Code

Claude Cowork

Codex

Kimi Code

Grok Build

Cursor

VS Code

OpenCode

OpenClaw

Hermes

CLI

Google ADK

LangChain

Vercel AI SDK

Mastra AI

CrewAI

Explore Other Toolkits

Youtube

Amara

Cats

Chatfai

Frequently asked questions

What are the differences in Tool Router MCP and Elevenlabs MCP?+

Can I use Tool Router MCP with LlamaIndex?+

Can I manage the permissions and scopes for Elevenlabs while using Tool Router?+

How safe is my data with Composio Tool Router?+

Start with Elevenlabs.It takes 30 seconds.