# Supadata

```json
{
  "name": "Supadata",
  "slug": "supadata",
  "url": "https://composio.dev/toolkits/supadata",
  "markdown_url": "https://composio.dev/toolkits/supadata.md",
  "logo_url": "https://logos.composio.dev/api/supadata",
  "categories": [
    "data & analytics"
  ],
  "is_composio_managed": false,
  "updated_at": "2026-05-12T10:27:29.162Z"
}
```

![Supadata logo](https://logos.composio.dev/api/supadata)

## Description

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Supadata MCP or direct API to extract transcripts from video, parse structured data from web pages, summarize multimedia content, and automate content tagging through natural language.

## Summary

Supadata is a web and video-to-text API that extracts structured data from videos and websites. It enables fast, automated content parsing for AI training and analysis.

## Categories

- data & analytics

## Toolkit Details

- Tools: 11

## Images

- Logo: https://logos.composio.dev/api/supadata

## Authentication

- **Api Key**
  - Type: `api_key`
  - Description: Api Key authentication for Supadata.
  - Setup:
    - Configure Api Key credentials for Supadata.
    - Use the credentials when creating an auth config in Composio.

## Suggested Prompts

- Extract all URLs from a competitor website
- Scrape the main content from a blog post
- List videos uploaded by a YouTube channel
- Get detailed metadata for a specific YouTube video

## Supported Tools

| Tool slug | Name | Description |
|---|---|---|
| `SUPADATA_GET_ME` | Get Account Information | Retrieve organization details, plan information, and credit usage for the authenticated account. Use this to check account status, credit balance, and subscription plan. |
| `SUPADATA_GET_TRANSCRIPT` | Get Video Transcript | Get transcript from a video on supported platforms or from a file URL. Use this tool when you need to: - Retrieve transcripts from YouTube, TikTok, Twitter, Instagram, or Facebook videos - Extract transcript from video file URLs - Get transcripts in specific languages - Choose between native captions or AI-generated transcripts For large videos, the API returns a job ID. You'll need to poll the /transcript/{jobId} endpoint to get the final result. For smaller videos, the transcript is returned immediately. The response varies based on the 'text' parameter: - text=true: Returns plain text transcript as a string - text=false: Returns structured list of chunks with timestamps and text |
| `SUPADATA_GET_TRANSCRIPT_BY_JOB_ID` | Get Transcript by Job ID | Get results for a transcript job by job ID. Use this to retrieve results of asynchronous transcript processing. Check the status field to determine if the job is complete. |
| `SUPADATA_GET_WEB_MAP` | Website URL Map | Crawl a website and extract all URLs/links found on it. Use this to create a sitemap, discover all pages of a website, or prepare for content scraping across multiple pages. Returns a list of all URLs discovered during the crawl. |
| `SUPADATA_GET_WEB_SCRAPE` | Scrape Web Page | Extract content from any webpage and convert it to Markdown format. Use this tool when you need to: - Read and extract text content from a web page - Get metadata (title, description) from a webpage - Discover links on a webpage - Get character counts of page content The tool returns the page content in Markdown format, making it easy to process and understand. One API credit is consumed per scrape request. |
| `SUPADATA_GET_YOUTUBE_CHANNEL` | Get YouTube Channel Metadata | Fetches metadata for a YouTube channel including name, description, subscriber count, video count, view count, and channel images. Accepts channel IDs, handles, or full YouTube URLs. |
| `SUPADATA_GET_YOUTUBE_CHANNEL_VIDEOS` | Get YouTube Channel Videos | Tool to retrieve video IDs from a YouTube channel. Returns lists of regular video IDs, YouTube Shorts IDs, and live stream IDs. Videos are ordered latest first. Use this to get video IDs that can be passed to the Get YouTube Video endpoint for full metadata. |
| `SUPADATA_GET_YOUTUBE_PLAYLIST` | Get YouTube Playlist Metadata | Retrieve metadata for a YouTube playlist including title, description, video count, view count, last updated timestamp, and channel information. Use this when you need playlist-level information. For the list of videos in a playlist, use the Get YouTube Playlist Videos action instead. |
| `SUPADATA_GET_YOUTUBE_PLAYLIST_VIDEOS` | Get YouTube Playlist Videos | Tool to retrieve video IDs from a YouTube playlist. Returns lists of standard video IDs, YouTube Shorts IDs, and live stream IDs contained in the playlist. |
| `SUPADATA_GET_YOUTUBE_VIDEO` | Get YouTube Video Metadata | Retrieve comprehensive metadata for a YouTube video by its ID. Use this tool when you need to get details about a specific YouTube video including: - Title, description, and tags - Channel information - View and like counts - Upload date and duration - Live stream status - Available transcript languages The video_id is the 11-character identifier found in YouTube URLs (e.g., 'dQw4w9WgXcQ' from 'https://youtube.com/watch?v=dQw4w9WgXcQ'). |
| `SUPADATA_SEARCH_YOUTUBE` | Search YouTube | Search YouTube for videos, channels, or playlists by keyword. Returns matching content with metadata like titles, thumbnails, view counts, and channel information. |

## Supported Triggers

None listed.

## Installation and MCP Setup

### Path 1: SDK Installation

#### Path 1, Step 1: Install Composio

Install the Composio SDK
```python
pip install composio_openai
```

```typescript
npm install @composio/openai
```

#### Path 1, Step 2: Initialize Composio and Create Tool Router Session

Import and initialize Composio client, then create a Tool Router session
```python
from openai import OpenAI
from composio import Composio
from composio_openai import OpenAIResponsesProvider

composio = Composio(provider=OpenAIResponsesProvider())
openai = OpenAI()
session = composio.create(user_id='your-user-id')
```

```typescript
import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');
```

#### Path 1, Step 3: Execute Supadata Tools via Tool Router with Your Agent

Get tools from Tool Router session and execute Supadata actions with your Agent
```python
tools = session.tools
response = openai.responses.create(
  model='gpt-4.1',
  tools=tools,
  input=[{
    'role': 'user',
    'content': 'Extract all video titles from a YouTube channel about AI news.'
  }]
)
result = composio.provider.handle_tool_calls(
  response=response,
  user_id='your-user-id'
)
print(result)
```

```typescript
const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Extract all video titles from a YouTube channel about AI news.'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);
```

### Path 2: MCP Server Setup

#### Path 2, Step 1: Install Composio

Install the Composio SDK and Claude Agent SDK
```python
pip install composio claude-agent-sdk
```

```typescript
npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp
```

#### Path 2, Step 2: Create Tool Router Session

Initialize the Composio client and create a Tool Router session
```python
from composio import Composio
from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions

composio = Composio(api_key='your-composio-api-key')
session = composio.create(user_id='your-user-id')
url = session.mcp.url
```

```typescript
import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);
```

#### Path 2, Step 3: Connect to AI Agent

Use the MCP server with your AI agent
```python
import asyncio

options = ClaudeAgentOptions(
    permission_mode='bypassPermissions',
    mcp_servers={
        'tool_router': {
            'type': 'http',
            'url': url,
            'headers': {
                'x-api-key': 'your-composio-api-key'
            }
        }
    },
    system_prompt='You are a helpful assistant with access to Supadata tools.',
    max_turns=10
)

async def main():
    async with ClaudeSDKClient(options=options) as client:
        await client.query('Extract main article text from https://techcrunch.com')
        async for message in client.receive_response():
            if hasattr(message, 'content'):
                for block in message.content:
                    if hasattr(block, 'text'):
                        print(block.text)

asyncio.run(main())
```

```typescript
import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Extract main article text from https://techcrunch.com' }],
  stopWhen: stepCountIs( 5 )
});

console.log(`Agent: ${text}`);
```

## Why Use Composio?

### 1. AI Native Supadata Integration

- Supports both Supadata MCP and direct API based integrations
- Structured, LLM-friendly schemas for reliable tool execution
- Rich coverage for extracting, parsing, and summarizing multimedia and web data

### 2. Managed Auth

- Built-in API key handling with secure storage and rotation
- Central place to manage, scope, and revoke Supadata access
- Per user and per environment credentials instead of hard-coded keys

### 3. Agent Optimized Design

- Tools are tuned using real error and success rates to improve reliability over time
- Comprehensive execution logs so you always know what ran, when, and on whose behalf

### 4. Enterprise Grade Security

- Fine-grained RBAC so you control which agents and users can access Supadata
- Scoped, least privilege access to Supadata resources
- Full audit trail of agent actions to support review and compliance

## Use Supadata with any AI Agent Framework

Choose a framework you want to connect Supadata with:

- [ChatGPT](https://composio.dev/toolkits/supadata/framework/chatgpt)
- [OpenAI Agents SDK](https://composio.dev/toolkits/supadata/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/supadata/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/supadata/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/supadata/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/supadata/framework/codex)
- [Cursor](https://composio.dev/toolkits/supadata/framework/cursor)
- [VS Code](https://composio.dev/toolkits/supadata/framework/vscode)
- [OpenCode](https://composio.dev/toolkits/supadata/framework/opencode)
- [OpenClaw](https://composio.dev/toolkits/supadata/framework/openclaw)
- [Hermes](https://composio.dev/toolkits/supadata/framework/hermes-agent)
- [Google ADK](https://composio.dev/toolkits/supadata/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/supadata/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/supadata/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/supadata/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/supadata/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/supadata/framework/crew-ai)
- [Pydantic AI](https://composio.dev/toolkits/supadata/framework/pydantic-ai)
- [AutoGen](https://composio.dev/toolkits/supadata/framework/autogen)

## Related Toolkits

- [Excel](https://composio.dev/toolkits/excel) - Microsoft Excel is a robust spreadsheet application for organizing, analyzing, and visualizing data. It's the go-to tool for calculations, reporting, and flexible data management.
- [21risk](https://composio.dev/toolkits/_21risk) - 21RISK is a web app built for easy checklist, audit, and compliance management. It streamlines risk processes so teams can focus on what matters.
- [Abstract](https://composio.dev/toolkits/abstract) - Abstract provides a suite of APIs for automating data validation and enrichment tasks. It helps developers streamline workflows and ensure data quality with minimal effort.
- [Addressfinder](https://composio.dev/toolkits/addressfinder) - Addressfinder is a data quality platform for verifying addresses, emails, and phone numbers. It helps you ensure accurate customer and contact data every time.
- [Agenty](https://composio.dev/toolkits/agenty) - Agenty is a web scraping and automation platform for extracting data and automating browser tasks—no coding needed. It streamlines data collection, monitoring, and repetitive online actions.
- [Ambee](https://composio.dev/toolkits/ambee) - Ambee is an environmental data platform providing real-time, hyperlocal APIs for air quality, weather, and pollen. Get precise environmental insights to power smarter decisions in your apps and workflows.
- [Ambient weather](https://composio.dev/toolkits/ambient_weather) - Ambient Weather is a platform for personal weather stations with a robust API for accessing local, real-time, and historical weather data. Get detailed environmental insights directly from your own sensors for smarter apps and automations.
- [Anonyflow](https://composio.dev/toolkits/anonyflow) - Anonyflow is a service for encryption-based data anonymization and secure data sharing. It helps organizations meet GDPR, CCPA, and HIPAA data privacy compliance requirements.
- [Api ninjas](https://composio.dev/toolkits/api_ninjas) - Api ninjas offers 120+ public APIs spanning categories like weather, finance, sports, and more. Developers use it to supercharge apps with real-time data and actionable endpoints.
- [Api sports](https://composio.dev/toolkits/api_sports) - Api sports is a comprehensive sports data platform covering 2,000+ competitions with live scores and 15+ years of stats. Instantly access up-to-date sports information for analysis, apps, or chatbots.
- [Apify](https://composio.dev/toolkits/apify) - Apify is a cloud platform for building, deploying, and managing web scraping and automation tools called Actors. It lets you automate data extraction and workflow tasks at scale—no infrastructure headaches.
- [Autom](https://composio.dev/toolkits/autom) - Autom is a lightning-fast search engine results data platform for Google, Bing, and Brave. Developers use it to access fresh, low-latency SERP data on demand.
- [Beaconchain](https://composio.dev/toolkits/beaconchain) - Beaconchain is a real-time analytics platform for Ethereum 2.0's Beacon Chain. It provides detailed insights into validators, blocks, and overall network performance.
- [Big data cloud](https://composio.dev/toolkits/big_data_cloud) - BigDataCloud provides APIs for geolocation, reverse geocoding, and address validation. Instantly access reliable location intelligence to enhance your applications and workflows.
- [Bigpicture io](https://composio.dev/toolkits/bigpicture_io) - BigPicture.io offers APIs for accessing detailed company and profile data. Instantly enrich your applications with up-to-date insights on 20M+ businesses.
- [Bitquery](https://composio.dev/toolkits/bitquery) - Bitquery is a blockchain data platform offering indexed, real-time, and historical data from 40+ blockchains via GraphQL APIs. Get unified, reliable access to complex on-chain data for analytics, trading, and research.
- [Brightdata](https://composio.dev/toolkits/brightdata) - Brightdata is a leading web data platform offering advanced scraping, SERP APIs, and anti-bot tools. It lets you collect public web data at scale, bypassing blocks and friction.
- [Builtwith](https://composio.dev/toolkits/builtwith) - BuiltWith is a web technology profiler that uncovers the technologies powering any website. Gain actionable insights into analytics, hosting, and content management stacks for smarter research and lead generation.
- [Byteforms](https://composio.dev/toolkits/byteforms) - Byteforms is an all-in-one platform for creating forms, managing submissions, and integrating data. It streamlines workflows by centralizing form data collection and automation.
- [Cabinpanda](https://composio.dev/toolkits/cabinpanda) - Cabinpanda is a data collection platform for building and managing online forms. It helps streamline how you gather, organize, and analyze responses.

## Frequently Asked Questions

### Do I need my own developer credentials to use Supadata with Composio?

Yes, Supadata requires you to configure your own API key credentials. Once set up, Composio handles secure credential storage and API request handling for you.

### Can I use multiple toolkits together?

Yes! Composio's Tool Router enables agents to use multiple toolkits. [Learn more](https://docs.composio.dev/tool-router/overview).

### Is Composio secure?

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. [Learn more](https://trust.composio.dev).

### What if the API changes?

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

---
[See all toolkits](https://composio.dev/toolkits) · [Composio docs](https://docs.composio.dev/llms.txt)
