Bench MCP for AI Agents

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Bench MCP or direct API to run benchmarks, fetch performance metrics, generate comparative reports, and track historical results through natural language.

Get started for free Get a demo

Bench

No Auth

Bench is a benchmarking tool for automated performance measurement and analysis. It helps you quickly evaluate, compare, and track your systems or workflows.

1 Tools

Try Bench now

Type what you want done — sign in and watch it run live in the Tool Router playground.

TOOL ROUTER PLAYGROUND

Bench

Enter ↵

Try asking

TOOLS

Supported Tools

Every Bench action and event your agent gets out of the box.

Sleep

SETUP GUIDE

Connect Bench MCP Tool with your Agent

Install Composio

typescript

npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp

Install the Composio SDK and Claude Agent SDK

Create Tool Router Session

typescript

import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);

Initialize the Composio client and create a Tool Router session

Connect to AI Agent

typescript

import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Pause agent execution for 5 seconds' }],
  stopWhen: stepCountIs(5)
});

console.log(`Agent: ${text}`);

Use the MCP server with your AI agent

SETUP GUIDE

Connect Bench API Tool with your Agent

Install Composio

typescript

npm install @composio/openai

Install the Composio SDK

Initialize Composio and Create Tool Router Session

typescript

import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');

Import and initialize Composio client, then create a Tool Router session

Execute Bench Tools via Tool Router with Your Agent

typescript

const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Run a benchmark sleep test for 5 seconds'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);

Get tools from Tool Router session and execute Bench actions with your Agent

Why Use Composio?

AI Native Bench Integration

Supports both Bench MCP and direct API based integrations
Structured, LLM-friendly schemas for reliable tool execution
Rich coverage for running, tracking, and analyzing your Bench benchmarks

Managed Auth

No credentials required—Bench supports NO_AUTH for fast, frictionless setup
Central place to manage and scope Bench access if needed
Per user and per environment context for ultimate flexibility

Agent Optimized Design

Bench tools are tuned for LLM agents for reliable execution
Comprehensive logs so you always know what benchmarks ran, when, and why

Enterprise Grade Security

Fine-grained RBAC so you control which agents and users can access Bench
Scoped, least privilege access to benchmarking resources
Full audit trail of agent actions to support review and compliance

FRAMEWORKS

Use Bench with any AI Agent Framework

Choose a Framework you want to connect Bench with

OpenAI Agents SDK

Use Bench MCP with OpenAI Agents SDK

Claude Agents SDK

Use Bench MCP with Claude Agents SDK

Claude Code

Use Bench MCP with Claude Code

Claude Cowork

Use Bench MCP with Claude Cowork

Codex

Use Bench MCP with Codex

OpenClaw

Use Bench MCP with OpenClaw

Hermes

Use Bench MCP with Hermes

Google ADK

Use Bench MCP with Google ADK

Langchain

Use Bench MCP with Langchain

AI SDK

Use Bench MCP with AI SDK

Mastra AI

Use Bench MCP with Mastra AI

LlamaIndex

Use Bench MCP with LlamaIndex

CrewAI

Use Bench MCP with CrewAI

Pydantic AI

Use Bench MCP with Pydantic AI

Autogen

Use Bench MCP with Autogen

MORE TOOLKITS

Explore Other Toolkits

Toolkit marketplace

Supabase

Oauth2Api Key

Supabase is an open-source backend platform offering scalable Postgres databases, authentication, storage, and real-time APIs. It lets developers build modern apps without managing infrastructure.

Codeinterpreter

No Auth

Codeinterpreter is a Python-based coding environment with built-in data analysis and visualization. It lets you instantly run scripts, plot results, and prototype solutions inside supported platforms.

GitHub

Oauth2

GitHub is a code hosting platform for version control and collaborative software development. It streamlines project management, code review, and team workflows in one place.

1password

Api Key

1Password is a password manager and digital vault for storing logins, secrets, notes, and secure documents. It helps individuals and teams protect credentials, share access safely, and reduce password risk.

FAQ

Frequently asked questions

No developer credentials are needed for Bench. You can get started right away—no setup required!

Yes! Composio's Tool Router enables agents to use multiple toolkits. Learn more.

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. Learn more.

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

Start with Bench.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Bench tool your agent needs.Free to start.

Start building

Bench MCP for AI Agents

Try Bench now

Supported Tools

Connect Bench MCP Tool with your Agent

Install Composio

Create Tool Router Session

Connect to AI Agent

Connect Bench API Tool with your Agent

Install Composio

Initialize Composio and Create Tool Router Session

Execute Bench Tools via Tool Router with Your Agent

Why Use Composio?

AI Native Bench Integration

Managed Auth

Agent Optimized Design

Enterprise Grade Security

Use Bench with any AI Agent Framework

OpenAI Agents SDK

Claude Agents SDK

Claude Code

Claude Cowork

Codex

OpenClaw

Hermes

Google ADK

Langchain

AI SDK

Mastra AI

LlamaIndex

CrewAI

Pydantic AI

Autogen

Explore Other Toolkits

Supabase

Codeinterpreter

GitHub

1password

Frequently asked questions

Do I need my own developer credentials to use Bench with Composio?+

Can I use multiple toolkits together?+

Is Composio secure?+

What if the API changes?+

Start with Bench.It takes 30 seconds.