# How to integrate Diffbot MCP with Hermes

```json
{
  "title": "How to integrate Diffbot MCP with Hermes",
  "toolkit": "Diffbot",
  "toolkit_slug": "diffbot",
  "framework": "Hermes",
  "framework_slug": "hermes-agent",
  "url": "https://composio.dev/toolkits/diffbot/framework/hermes-agent",
  "markdown_url": "https://composio.dev/toolkits/diffbot/framework/hermes-agent.md",
  "updated_at": "2026-05-06T08:08:39.262Z"
}
```

## Introduction

Hermes is a 24/7 autonomous agent that lives on your computer or server — it remembers what it learns and evolves as your usage grows.
This guide explains the easiest and most robust way to connect your Diffbot account to Hermes. You can do this through either Composio Connect CLI or Composio Connect MCP. For personal use we recommend the CLI, but you won't go wrong with MCP either.

## Also integrate Diffbot with

- [OpenAI Agents SDK](https://composio.dev/toolkits/diffbot/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/diffbot/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/diffbot/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/diffbot/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/diffbot/framework/codex)
- [OpenClaw](https://composio.dev/toolkits/diffbot/framework/openclaw)
- [CLI](https://composio.dev/toolkits/diffbot/framework/cli)
- [Google ADK](https://composio.dev/toolkits/diffbot/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/diffbot/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/diffbot/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/diffbot/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/diffbot/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/diffbot/framework/crew-ai)

## TL;DR

### What is Composio Connect?
Composio Connect is a consumer offering that lets anyone plug 1,000+ applications directly into their agent harness — including Hermes. It can:
- Search and load tools from relevant toolkits on-demand, reducing context usage.
- Chain multiple tools to accomplish complex workflows via a remote workbench, without excessive back-and-forth with the LLM.
- Manage app authentication end-to-end with zero manual overhead.

## Connect Diffbot to Hermes

### Integrating Diffbot with Hermes
### Using Composio Connect CLI
1. Install the Composio CLI
Run the install script directly, or paste https://composio.dev/hermes into your Hermes chat box to have it installed for you.

```bash
curl -fsSL https://composio.dev/install | bash
```

## What is the Diffbot MCP server, and what's possible with it?

The Diffbot MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Diffbot account. It provides structured and secure access to web data extraction and analysis, so your agent can extract structured data from web pages, analyze content types, retrieve product details, manage bulk jobs, and search extracted datasets on your behalf.
- Automatic content analysis and extraction: Let your agent analyze any web page and automatically extract structured data such as articles, products, events, images, or videos using AI-powered tools.
- Article and discussion thread extraction: Effortlessly pull detailed metadata, authors, publication dates, and full discussion threads from news sites, blogs, forums, and comment sections.
- Product and event data gathering: Instantly extract comprehensive product specifications, pricing, reviews, and event information including venues, dates, and descriptions from e-commerce or event pages.
- Bulk job management and search: Enable your agent to list, monitor, and search across large-scale crawl or extraction jobs, making it easy to work with massive web data collections.
- Account and usage insights: Retrieve your Diffbot account details, plan information, and usage statistics to stay on top of quotas and manage your web data operations efficiently.

## Supported Tools

| Tool slug | Name | Description |
|---|---|---|
| `DIFFBOT_DIFFBOT_SEARCH` | Diffbot Search | Tool to search data extracted by crawl or bulk jobs using dql queries. use after data extraction jobs complete to retrieve search results. |
| `DIFFBOT_GET_ACCOUNT` | Get Diffbot Account Details | Tool to retrieve account details, including plan information and usage statistics. use after authenticating to verify subscription and daily quota status. |
| `DIFFBOT_GET_ANALYZE` | Diffbot Analyze | Tool to automatically determine a page's content type and route it to the appropriate extraction api. use when you have only a url and need diffbot to choose the right extractor. |
| `DIFFBOT_GET_ARTICLE` | Get Article Data | Tool to extract information from articles, including authors, publication dates, and images. use when you need structured metadata from a web article url. |
| `DIFFBOT_GET_DISCUSSION` | Get Discussion Thread | Tool to extract threads of content from forums, comment sections, and review pages. use when you need structured discussion data from web pages after identifying the discussion url. |
| `DIFFBOT_GET_EVENT` | Diffbot Get Event | Tool to extract event details from web pages. use when you need structured event data such as venue, date, and description. |
| `DIFFBOT_GET_IMAGE` | Diffbot Get Image | Tool to extract detailed information about images, including dimensions and recognition data. use after confirming the image url is publicly accessible. |
| `DIFFBOT_GET_PRODUCT` | Diffbot Get Product | Tool to extract product information such as specifications, prices, availability, and reviews. use when you need structured product data including specs, pricing, and reviews. |
| `DIFFBOT_GET_VIDEO` | Get Video Data | Tool to extract information from videos, including titles, descriptions, and embedded html. use when you need structured video metadata from any web page. |
| `DIFFBOT_LIST_BULK_JOBS` | List Bulk Jobs | Tool to list all bulk jobs associated with a specific token. use after authenticating to retrieve statuses of all jobs for the account. |
| `DIFFBOT_RESOLVE_LOST_ID` | Resolve Lost ID | Tool to resolve lost ids in the knowledge graph. use when you need to map a lost identifier to its canonical counterpart for data consistency. |
| `DIFFBOT_START_BULK` | Start Bulk Job | Tool to start a bulk extract job. use when processing large numbers of urls asynchronously. |
| `DIFFBOT_START_CRAWL` | Start Crawl Job | Tool to spider a site for links and process them with the extract api into a single collection. use when you have seed urls and want to collect structured data across a site. requires a plus plan for crawl api access. |
| `DIFFBOT_STOP_BULK_JOB` | Stop Bulk Job | Tool to stop a running bulk job. use when you need to halt further processing of urls in a job in progress. invoke only after confirming the jobid to avoid accidental stoppage. |

## Supported Triggers

None listed.

## Creating MCP Server - Stand-alone vs Composio SDK

The Diffbot MCP server provides comprehensive access to Diffbot operations through Composio. Once connected, Hermes can perform all major Diffbot actions on your behalf using natural language commands.

## Complete Code

None listed.

## Conclusion

### Way Forward
With Diffbot connected, Hermes can now act on your behalf whenever it detects a relevant task or you ask it to.
From here, you can extend Hermes further:
- Connect more apps: Calendar, Slack, Notion, Linear, and hundreds of others are available through the same Composio Connect setup. Each new integration compounds what Hermes can do for you.
- Build workflows across tools: Once multiple apps are connected, Hermes can chain actions together — turn an email into a calendar invite, a Slack message into a Linear ticket, or a meeting note into a follow-up draft.
- Let it learn your patterns: The more you use Hermes, the better it gets at anticipating how you'd handle recurring tasks. Give it feedback on drafts and decisions, and it will adapt.
If you run into trouble or want to share what you've built, join the [community](https://discord.com/invite/composio) or check out the [Docs](https://docs.composio.dev?utm_source=toolkits&utm_medium=framework_template&utm_campaign=hermes&utm_content=docs) for deeper configuration options.

## How to build Diffbot MCP Agent with another framework

- [OpenAI Agents SDK](https://composio.dev/toolkits/diffbot/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/diffbot/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/diffbot/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/diffbot/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/diffbot/framework/codex)
- [OpenClaw](https://composio.dev/toolkits/diffbot/framework/openclaw)
- [CLI](https://composio.dev/toolkits/diffbot/framework/cli)
- [Google ADK](https://composio.dev/toolkits/diffbot/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/diffbot/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/diffbot/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/diffbot/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/diffbot/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/diffbot/framework/crew-ai)

## Related Toolkits

- [Excel](https://composio.dev/toolkits/excel) - Microsoft Excel is a robust spreadsheet application for organizing, analyzing, and visualizing data. It's the go-to tool for calculations, reporting, and flexible data management.
- [21risk](https://composio.dev/toolkits/_21risk) - 21RISK is a web app built for easy checklist, audit, and compliance management. It streamlines risk processes so teams can focus on what matters.
- [Abstract](https://composio.dev/toolkits/abstract) - Abstract provides a suite of APIs for automating data validation and enrichment tasks. It helps developers streamline workflows and ensure data quality with minimal effort.
- [Addressfinder](https://composio.dev/toolkits/addressfinder) - Addressfinder is a data quality platform for verifying addresses, emails, and phone numbers. It helps you ensure accurate customer and contact data every time.
- [Agentql](https://composio.dev/toolkits/agentql) - Agentql is a toolkit that connects AI agents to the web using a specialized query language. It enables structured web interaction and data extraction for smarter automations.
- [Agenty](https://composio.dev/toolkits/agenty) - Agenty is a web scraping and automation platform for extracting data and automating browser tasks—no coding needed. It streamlines data collection, monitoring, and repetitive online actions.
- [Ambee](https://composio.dev/toolkits/ambee) - Ambee is an environmental data platform providing real-time, hyperlocal APIs for air quality, weather, and pollen. Get precise environmental insights to power smarter decisions in your apps and workflows.
- [Ambient weather](https://composio.dev/toolkits/ambient_weather) - Ambient Weather is a platform for personal weather stations with a robust API for accessing local, real-time, and historical weather data. Get detailed environmental insights directly from your own sensors for smarter apps and automations.
- [Anonyflow](https://composio.dev/toolkits/anonyflow) - Anonyflow is a service for encryption-based data anonymization and secure data sharing. It helps organizations meet GDPR, CCPA, and HIPAA data privacy compliance requirements.
- [Api ninjas](https://composio.dev/toolkits/api_ninjas) - Api ninjas offers 120+ public APIs spanning categories like weather, finance, sports, and more. Developers use it to supercharge apps with real-time data and actionable endpoints.
- [Api sports](https://composio.dev/toolkits/api_sports) - Api sports is a comprehensive sports data platform covering 2,000+ competitions with live scores and 15+ years of stats. Instantly access up-to-date sports information for analysis, apps, or chatbots.
- [Apify](https://composio.dev/toolkits/apify) - Apify is a cloud platform for building, deploying, and managing web scraping and automation tools called Actors. It lets you automate data extraction and workflow tasks at scale—no infrastructure headaches.
- [Autom](https://composio.dev/toolkits/autom) - Autom is a lightning-fast search engine results data platform for Google, Bing, and Brave. Developers use it to access fresh, low-latency SERP data on demand.
- [Beaconchain](https://composio.dev/toolkits/beaconchain) - Beaconchain is a real-time analytics platform for Ethereum 2.0's Beacon Chain. It provides detailed insights into validators, blocks, and overall network performance.
- [Big data cloud](https://composio.dev/toolkits/big_data_cloud) - BigDataCloud provides APIs for geolocation, reverse geocoding, and address validation. Instantly access reliable location intelligence to enhance your applications and workflows.
- [Bigpicture io](https://composio.dev/toolkits/bigpicture_io) - BigPicture.io offers APIs for accessing detailed company and profile data. Instantly enrich your applications with up-to-date insights on 20M+ businesses.
- [Bitquery](https://composio.dev/toolkits/bitquery) - Bitquery is a blockchain data platform offering indexed, real-time, and historical data from 40+ blockchains via GraphQL APIs. Get unified, reliable access to complex on-chain data for analytics, trading, and research.
- [Brightdata](https://composio.dev/toolkits/brightdata) - Brightdata is a leading web data platform offering advanced scraping, SERP APIs, and anti-bot tools. It lets you collect public web data at scale, bypassing blocks and friction.
- [Builtwith](https://composio.dev/toolkits/builtwith) - BuiltWith is a web technology profiler that uncovers the technologies powering any website. Gain actionable insights into analytics, hosting, and content management stacks for smarter research and lead generation.
- [Byteforms](https://composio.dev/toolkits/byteforms) - Byteforms is an all-in-one platform for creating forms, managing submissions, and integrating data. It streamlines workflows by centralizing form data collection and automation.

## Frequently Asked Questions

### What are the differences in Tool Router MCP and Diffbot MCP?

With a standalone Diffbot MCP server, the agents and LLMs can only access a fixed set of Diffbot tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Diffbot and many other apps based on the task at hand, all through a single MCP endpoint.

### Can I use Tool Router MCP with Hermes?

Yes, you can. Hermes fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Diffbot tools.

### Can I manage the permissions and scopes for Diffbot while using Tool Router?

Yes, absolutely. You can configure which Diffbot scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

### How safe is my data with Composio Tool Router?

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Diffbot data and credentials are handled as safely as possible.

---
[See all toolkits](https://composio.dev/toolkits) · [Composio docs](https://docs.composio.dev/llms.txt)
