# Elevenlabs CLI for AI Agents

```json
{
  "title": "Elevenlabs CLI for AI Agents",
  "toolkit": "Elevenlabs",
  "toolkit_slug": "elevenlabs",
  "framework": "CLI",
  "framework_slug": "cli",
  "url": "https://composio.dev/toolkits/elevenlabs/framework/cli",
  "markdown_url": "https://composio.dev/toolkits/elevenlabs/framework/cli.md",
  "updated_at": "2026-05-06T08:10:09.444Z"
}
```

## Introduction

CLIs are eating MCPs. The industry is converging on the very same idea. MCPs for all their merit can be token hungry, slow, and unreliable for complex tool chaining. However, coding agents have become incredibly good at working with CLIs, and in fact they are far more comfortable working with CLI tools than MCP.
With Composio's Universal CLI, your coding agents can talk to over 1000+ SaaS applications. With Elevenlabs, agents can convert this chapter text to audio, create a custom project for your audiobook, add a new pronunciation rule for this word, and more — all without worrying about authentication.
This guide walks you through Composio Universal CLI and explains how you can connect it with coding agents like Claude Code, Codex, OpenCode, etc, for end-to-end Elevenlabs automation.

## Also integrate Elevenlabs with

- [ChatGPT](https://composio.dev/toolkits/elevenlabs/framework/chatgpt)
- [OpenAI Agents SDK](https://composio.dev/toolkits/elevenlabs/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/elevenlabs/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/elevenlabs/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/elevenlabs/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/elevenlabs/framework/codex)
- [Cursor](https://composio.dev/toolkits/elevenlabs/framework/cursor)
- [VS Code](https://composio.dev/toolkits/elevenlabs/framework/vscode)
- [OpenCode](https://composio.dev/toolkits/elevenlabs/framework/opencode)
- [OpenClaw](https://composio.dev/toolkits/elevenlabs/framework/openclaw)
- [Hermes](https://composio.dev/toolkits/elevenlabs/framework/hermes-agent)
- [Google ADK](https://composio.dev/toolkits/elevenlabs/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/elevenlabs/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/elevenlabs/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/elevenlabs/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/elevenlabs/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/elevenlabs/framework/crew-ai)

## TL;DR

The idea behind building the universal CLI is to give agents a single command interface to interact with all your external applications. Here's what you'll get with it:
- Agent-friendly: Coding agents like Claude Code, Codex, and OpenCode can use CLI tools natively — no MCP setup required.
- Authentication handled: Connect once via OAuth or API Key, and all CLI commands work with your credentials automatically.
- Tool discovery: Search, inspect, and execute 20,000+ tools across 1000+ apps from one interface.
- Trigger support: Use triggers to listen for events across your apps, powered by real-time webhooks or polling under the hood.
- Type generation: Generate typed schemas for autocomplete and type safety in your projects.

## Connect Elevenlabs to CLI

### Prerequisites
Install the Composio CLI, authenticate, and initialize your project:

```bash
# Install the Composio CLI
curl -fsSL https://composio.dev/install | bash

# Authenticate with Composio
composio login
```

## What is the Elevenlabs MCP server, and what's possible with it?

The Elevenlabs MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Elevenlabs account. It provides structured and secure access to your voice synthesis projects and tools, so your agent can perform actions like generating audio from text, managing custom voices, organizing projects, and fine-tuning pronunciation on your behalf.
- Project and chapter audio conversion: Instantly convert text content from chapters or entire projects into high-quality, natural-sounding audio files.
- Custom voice creation and management: Guide your agent to add, finalize, or share custom voices—either by uploading new samples or assembling voices from existing data.
- Pronunciation dictionary and rule management: Improve the accuracy of speech outputs by adding pronunciation dictionaries or custom pronunciation rules directly from files or specific aliases/phonemes.
- Project organization and automation: Let your agent create new projects, add or remove chapters, and automate speech synthesis workflows for audiobooks, podcasts, or media production.
- Embeddable audio player generation: Enable your agent to generate AudioNative projects, creating customizable and embeddable audio players from your content with just a prompt.

## Supported Tools

| Tool slug | Name | Description |
|---|---|---|
| `ELEVENLABS_ADD_A_PRONUNCIATION_DICTIONARY` | Add a pronunciation dictionary from file | Adds a new pronunciation dictionary from a lexicon file to improve speech synthesis accuracy. |
| `ELEVENLABS_ADD_PROJECT` | Add new project with attributes | Use to create a new elevenlabs project for text-to-speech synthesis (e.g., audiobooks); a project `name` is required by the api for creation, and content can be initialized using `from url` or `from document`. |
| `ELEVENLABS_ADD_RULES_TO_THE_PRONUNCIATION_DICTIONARY` | Add rules to the pronunciation dictionary | Adds one or more custom pronunciation rules (alias or phoneme) to an existing pronunciation dictionary. |
| `ELEVENLABS_ADD_SHARING_VOICE` | Add sharing voice | Adds an existing, shareable voice to a specified user's elevenlabs account library under a new custom name, requiring the user's public id and the voice id. |
| `ELEVENLABS_ADD_VOICE` | Add a voice | Adds a custom voice, requiring a `name` and a `files` list with at least one audio sample, to initiate cloning; returns `voice id` but voice is not immediately usable for synthesis. |
| `ELEVENLABS_CONVERT_CHAPTER` | Convert chapter to audio | Converts the textual content of a chapter, identified by `chapter id` within a `project id`, into audio format. |
| `ELEVENLABS_CONVERT_PROJECT` | Convert a project | Converts an existing elevenlabs studio project, including all its chapters and using its configured settings and voices, into speech. |
| `ELEVENLABS_CREATE_A_PREVIOUSLY_GENERATED_VOICE` | Create a previously generated voice | Finalizes the creation of a voice using its `generated voice id` from a previous generation step by assigning a name, description, and optional labels. |
| `ELEVENLABS_CREATES_AUDIONATIVE_ENABLED_PROJECT` | Create an AudioNative enabled project | Creates an elevenlabs audionative project, generating an embeddable audio player from a provided content file using text-to-speech, allowing customization of player appearance, audio settings, and conversion options. |
| `ELEVENLABS_DELETE_CHAPTER` | Delete chapter from project | Irreversibly deletes a specific, existing chapter from an existing project, typically to remove unwanted or obsolete content. |
| `ELEVENLABS_DELETE_DUBBING_PROJECT` | Delete a dubbing project | Permanently deletes a dubbing project by its id; this action is irreversible and the project cannot be recovered. |
| `ELEVENLABS_DELETE_HISTORY_ITEM` | Delete history item | Permanently deletes a specific history item (including its audio file and metadata) using its `history item id`; this operation is irreversible and should be used with caution. |
| `ELEVENLABS_DELETE_PROJECT` | Delete project by id | Use to irreversibly delete a specific project by its `project id`; the project must exist and be accessible, and this action cannot be undone. |
| `ELEVENLABS_DELETE_SAMPLE` | Delete voice sample | Permanently deletes a specific voice sample for a given voice id; this action is irreversible. |
| `ELEVENLABS_DELETE_VOICE` | Delete voice by id | Permanently and irreversibly deletes a specific custom voice using its `voice id`; the voice must exist and the authenticated user must have permission to delete it. |
| `ELEVENLABS_DOWNLOAD_HISTORY_ITEMS` | Download history items | Downloads audio clips from history by id(s), returning a single file or a zip archive, with an optional output format (e.g., 'wav'); provides only audio content, no metadata. |
| `ELEVENLABS_DUB_A_VIDEO_OR_AN_AUDIO_FILE` | Dub a video or an audio file | Dub a video or audio file into a specified target language, requiring 'file' or 'source url', 'target lang', and 'csv file' if 'mode' is 'manual'. |
| `ELEVENLABS_EDIT_VOICE` | Edit voice | Updates the name, audio files, description, or labels for an existing voice model specified by `voice id`. |
| `ELEVENLABS_EDIT_VOICE_SETTINGS` | Edit voice settings | Edits key voice settings (e.g., stability, similarity enhancement, style exaggeration, speaker boost) for an existing voice, affecting all future audio generated with that voice id. |
| `ELEVENLABS_ELEVENLABS_TEXT_TO_SPEECH` | Text to speech | Converts text to speech using a specified elevenlabs voice and model, returning a downloadable audio file. |
| `ELEVENLABS_ELEVENLABS_TEXT_TO_SPEECH_STREAM` | Text to speech stream | Converts text to a spoken audio stream, allowing latency optimization, specific output formats (some tier-dependent), and custom pronunciations; ensure the chosen model supports text-to-speech and text is preferably under 5000 characters. |
| `ELEVENLABS_GENERATE_A_RANDOM_VOICE` | Generate a random voice | Generates a unique, random elevenlabs text-to-speech voice based on input text and specified voice characteristics. |
| `ELEVENLABS_GET_A_PROFILE_PAGE` | Get user profile by handle | Retrieves the public profile information for an existing elevenlabs user based on their unique handle. |
| `ELEVENLABS_GET_AUDIO_FROM_HISTORY_ITEM` | Get audio from history item | Retrieves the audio content for a specific history item from elevenlabs, using a `history item id` that must correspond to a previously generated audio. |
| `ELEVENLABS_GET_AUDIO_FROM_SAMPLE` | Get sample audio | Retrieves the audio for a given `sample id` that must belong to the specified `voice id`. |
| `ELEVENLABS_GET_CHAPTER_BY_ID` | Get chapter by ID | Fetches comprehensive details for a specific chapter within a given project, including its metadata (name, id), conversion status, progress, download availability, and content statistics. |
| `ELEVENLABS_GET_CHAPTERS` | Get chapters by project id | Retrieves a list of all chapters, their details, and conversion status for a project, useful for managing content or tracking progress. |
| `ELEVENLABS_GET_CHAPTER_SNAPSHOTS` | Get chapter snapshots | Retrieves all saved version snapshots for a specific chapter within a given project, enabling review of its history or reversion to prior states. |
| `ELEVENLABS_GET_DEFAULT_VOICE_SETTINGS` | Get default voice settings | Retrieves the elevenlabs text-to-speech service's default voice settings (stability, similarity boost, style, speaker boost) that are applied when no voice-specific or request-specific settings are provided. |
| `ELEVENLABS_GET_DUBBED_FILE` | Get dubbed audio for a language | Retrieves an existing dubbed audio file for a specific `dubbing id` and `language code`. |
| `ELEVENLABS_GET_DUBBING_PROJECT_METADATA` | Get dubbing project metadata | Retrieves metadata and status for a specific dubbing project by its id. |
| `ELEVENLABS_GET_GENERATED_ITEMS` | Get generated items | Retrieves metadata for a list of generated audio items from history, supporting pagination and optional filtering by voice id. |
| `ELEVENLABS_GET_HISTORY_ITEM_BY_ID` | Get history item by id | Retrieves detailed information (excluding the audio file) for a specific audio generation history item from elevenlabs, using its unique id. |
| `ELEVENLABS_GET_METADATA_FOR_A_PRONUNCIATION_DICTIONARY` | Get pronunciation dictionary metadata | Retrieves metadata for a specific, existing pronunciation dictionary from elevenlabs using its id. |
| `ELEVENLABS_GET_MODELS` | Get models | Retrieves a detailed list of all available elevenlabs text-to-speech (tts) models and their capabilities. |
| `ELEVENLABS_GET_PROJECT_BY_ID` | Get project by ID | Use to retrieve all details for a specific project, including its chapters and their conversion statuses, by providing the project's unique id. |
| `ELEVENLABS_GET_PROJECTS` | Get projects | Fetches a list of all projects and their details associated with the user's elevenlabs account; this is a read-only operation. |
| `ELEVENLABS_GET_PROJECT_SNAPSHOTS` | Get project snapshots | Retrieves all available snapshots (saved states or versions) for an existing project, enabling history tracking, version comparison, or accessing specific states for playback/processing, particularly in text-to-speech workflows. |
| `ELEVENLABS_GET_PRONUNCIATION_DICTIONARIES` | Get pronunciation dictionaries | Retrieves a paginated list of pronunciation dictionaries, used to customize how specific words or phrases are pronounced by the text-to-speech (tts) engine. |
| `ELEVENLABS_GET_PRONUNCIATION_DICT_VERSION` | Get pronunciation dictionary version | Downloads the pronunciation lexicon specification (pls) file for an existing version of a pronunciation dictionary from elevenlabs, used to customize tts pronunciation. |
| `ELEVENLABS_GET_SHARED_VOICES` | Get shared voices | Retrieves a paginated and filterable list of shared voices from the elevenlabs voice library. |
| `ELEVENLABS_GET_SSO_PROVIDER_ADMIN` | Get sso provider admin | Retrieves the sso provider configuration for a specified workspace, typically for review purposes, and will indicate if no configuration exists. |
| `ELEVENLABS_GET_TRANSCRIPT_FOR_DUB` | Get dubbing transcript by language | Retrieves the textual transcript for a specified dubbing project and language, if one exists for that language in the project. |
| `ELEVENLABS_GET_USER_INFO` | Get user info | Retrieves detailed information about the authenticated elevenlabs user's account, including subscription, usage, api key, and status. |
| `ELEVENLABS_GET_USER_SUBSCRIPTION_INFO` | Get user subscription info | Retrieves detailed subscription information for the currently authenticated elevenlabs user. |
| `ELEVENLABS_GET_VOICE` | Get voice | Retrieves comprehensive details for a specific, existing voice by its `voice id`, optionally including its settings. |
| `ELEVENLABS_GET_VOICES` | Get voices list | Retrieves a list of all available voices along with their detailed attributes and settings. |
| `ELEVENLABS_GET_VOICE_SETTINGS` | Get voice settings | Retrieves the stability, similarity, style, and speaker boost settings for a specific, existing elevenlabs voice using its `voice id`. |
| `ELEVENLABS_REDIRECT_TO_MINTLIFY` | Get API documentation | Retrieves the content of the official elevenlabs api documentation page hosted on mintlify. |
| `ELEVENLABS_REMOVE_RULES_FROM_THE_PRONUNCIATION_DICTIONARY` | Remove rules from pronunciation dictionary | Permanently removes exact-match pronunciation rules from a specified elevenlabs pronunciation dictionary using a list of rule strings; non-matching rule strings are ignored and this action cannot add or modify rules. |
| `ELEVENLABS_SPEECH_TO_SPEECH` | Speech to speech | Converts an input audio file to speech using a specified voice; if a `model id` is provided, it must support speech-to-speech conversion. |
| `ELEVENLABS_SPEECH_TO_SPEECH_STREAMING` | Speech to speech streaming | Converts an input audio stream to a different voice output stream in real-time, using a specified speech-to-speech model. |
| `ELEVENLABS_STREAM_CHAPTER_AUDIO` | Stream chapter audio | Streams the audio for a specified chapter snapshot from an elevenlabs project, optionally converting the output to mpeg format. |
| `ELEVENLABS_STREAM_PROJECT_AUDIO` | Stream project audio | Streams audio from a specific project snapshot, optionally converting it to mpeg format. |
| `ELEVENLABS_STREAMS_ARCHIVE_WITH_PROJECT_AUDIO` | Archive project snapshot | Archives an existing project snapshot by its id, creating a permanent, immutable, and typically irreversible copy of its state. |
| `ELEVENLABS_UPDATE_PRONUNCIATION_DICTIONARIES` | Update project pronunciation dictionaries | Updates a project's pronunciation dictionaries on elevenlabs to improve text-to-speech accuracy for specialized terms; note that while multiple dictionaries can be applied, the ui only displays the first. |
| `ELEVENLABS_VOICE_GENERATION_PARAMETERS` | Voice generation parameters retrieval | Fetches configurable parameters for elevenlabs voice generation, used to determine available settings (e.g., accents, character limits) prior to audio synthesis; returns metadata only, not audio. |

## Supported Triggers

None listed.

## Complete Code

None listed.

## Conclusion

- Try asking your coding agent to perform various Elevenlabs operations
- Explore cross-app workflows by connecting more toolkits
- Set up triggers for real-time automation
- Use composio generate for typed schemas in your projects

## How to build Elevenlabs MCP Agent with another framework

- [ChatGPT](https://composio.dev/toolkits/elevenlabs/framework/chatgpt)
- [OpenAI Agents SDK](https://composio.dev/toolkits/elevenlabs/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/elevenlabs/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/elevenlabs/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/elevenlabs/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/elevenlabs/framework/codex)
- [Cursor](https://composio.dev/toolkits/elevenlabs/framework/cursor)
- [VS Code](https://composio.dev/toolkits/elevenlabs/framework/vscode)
- [OpenCode](https://composio.dev/toolkits/elevenlabs/framework/opencode)
- [OpenClaw](https://composio.dev/toolkits/elevenlabs/framework/openclaw)
- [Hermes](https://composio.dev/toolkits/elevenlabs/framework/hermes-agent)
- [Google ADK](https://composio.dev/toolkits/elevenlabs/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/elevenlabs/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/elevenlabs/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/elevenlabs/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/elevenlabs/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/elevenlabs/framework/crew-ai)

## Related Toolkits

- [Youtube](https://composio.dev/toolkits/youtube) - YouTube is a leading video-sharing platform for uploading, streaming, and discovering content. It empowers creators and businesses to reach global audiences and monetize their work.
- [Amara](https://composio.dev/toolkits/amara) - Amara is a collaborative platform for creating and managing subtitles and captions for videos. It helps make content accessible and multilingual for global audiences.
- [Cats](https://composio.dev/toolkits/cats) - Cats is an API with a huge library of cat images, breed data, and cat facts. It makes finding adorable cat photos and trivia effortless for your apps and users.
- [Chatfai](https://composio.dev/toolkits/chatfai) - Chatfai is an AI platform that lets users talk to AI versions of fictional characters from books, movies, and games. It offers an engaging, interactive experience for fans to chat, roleplay, and explore creative dialogues.
- [Cincopa](https://composio.dev/toolkits/cincopa) - Cincopa is a multimedia platform for uploading, managing, and customizing videos, images, and audio. It helps you deliver engaging media experiences with robust APIs and flexible integrations.
- [Dungeon fighter online](https://composio.dev/toolkits/dungeon_fighter_online) - Dungeon Fighter Online (DFO) is an arcade-style, side-scrolling action RPG packed with dynamic combat and progression. Play solo or with friends to battle monsters, complete quests, and upgrade your characters.
- [Elevenreader](https://composio.dev/toolkits/elevenreader) - Elevenreader is an AI-powered text-to-speech service by ElevenLabs that converts written content into lifelike audio. It enables fast, natural audio generation from any text.
- [Epic games](https://composio.dev/toolkits/epic_games) - Epic Games is a leading video game publisher and digital storefront, known for Fortnite and Unreal Engine. It lets gamers access, manage, and purchase games all in one place.
- [Fal.ai](https://composio.dev/toolkits/fal_ai) - Fal.ai is a generative media platform offering 600+ AI models for images, video, voice, and audio. Developers use Fal.ai for fast, scalable access to cutting-edge generative AI tools.
- [Giphy](https://composio.dev/toolkits/giphy) - Giphy is the largest online library for searching and sharing GIFs and stickers. Instantly add vibrant animated content to your apps, chats, and workflows.
- [Headout](https://composio.dev/toolkits/headout) - Headout is a global platform for booking travel experiences, tours, and entertainment. It helps users discover and secure activities at top destinations, all in one place.
- [Imagekit io](https://composio.dev/toolkits/imagekit_io) - ImageKit.io is a cloud-based media management platform for image and video delivery. Instantly optimize, transform, and deliver visuals globally via a lightning-fast CDN.
- [Listennotes](https://composio.dev/toolkits/listennotes) - Listennotes is a powerful podcast search engine with a massive global database. Discover, search, and curate podcasts from around the world in seconds.
- [News api](https://composio.dev/toolkits/news_api) - News api is a REST API for searching and retrieving live news articles from across the web. Instantly access headlines, coverage, and breaking stories from thousands of sources.
- [RAWG Video Games Database](https://composio.dev/toolkits/rawg_video_games_database) - RAWG Video Games Database is the largest video game discovery and info service. Instantly access comprehensive details, ratings, and release dates for thousands of games.
- [Seat geek](https://composio.dev/toolkits/seat_geek) - SeatGeek is a live event platform offering APIs for concerts, sports, and theater data. Instantly access events, venues, and performers info for smarter ticketing and discovery.
- [Shotstack](https://composio.dev/toolkits/shotstack) - Shotstack is a cloud platform for programmatically generating videos, images, and audio. Automate creative content production at scale with flexible RESTful APIs.
- [Spotify](https://composio.dev/toolkits/spotify) - Spotify is a streaming service for music and podcasts with millions of tracks from artists worldwide. Enjoy personalized playlists, recommendations, and seamless listening across all your devices.
- [Ticketmaster](https://composio.dev/toolkits/ticketmaster) - Ticketmaster is a global platform for event discovery, ticket sales, and live entertainment management. Get real-time access to events and streamline ticketing for fans and organizers.
- [Gmail](https://composio.dev/toolkits/gmail) - Gmail is Google's email service with powerful spam protection, search, and G Suite integration. It keeps your inbox organized and makes communication fast and reliable.

## Frequently Asked Questions

### What is the Composio Universal CLI?

The Composio Universal CLI is a single command-line interface that lets coding agents and developers interact with 1000+ SaaS applications. It handles authentication, tool discovery, action execution, and trigger setup — all from the terminal, without needing to configure MCP servers.

### Which coding agents work with the Composio CLI?

Any coding agent that can run shell commands works with the Composio CLI — including Claude Code, Codex, OpenCode, OpenClaw, and others. Once the CLI is installed, agents automatically discover and use the composio commands to interact with Elevenlabs and other connected apps.

### How is the CLI different from using an MCP server for Elevenlabs?

MCP servers require configuration and can be token-heavy for complex workflows. The CLI gives agents a direct, lightweight interface — no server setup needed. Agents simply call composio commands like any other shell tool. It's faster to set up, more reliable for multi-step tool chaining, and works natively with how coding agents already operate.

### How safe is my Elevenlabs data when using the Composio CLI?

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Elevenlabs data and credentials are handled as safely as possible. You can also bring your own OAuth credentials for full control.

---
[See all toolkits](https://composio.dev/toolkits) · [Composio docs](https://docs.composio.dev/llms.txt)
