# Ocrspace

```json
{
  "name": "Ocrspace",
  "slug": "ocrspace",
  "url": "https://composio.dev/toolkits/ocrspace",
  "markdown_url": "https://composio.dev/toolkits/ocrspace.md",
  "logo_url": "https://logos.composio.dev/api/ocrspace",
  "categories": [
    "document & file management"
  ],
  "is_composio_managed": false,
  "updated_at": "2026-05-12T10:20:29.062Z"
}
```

![Ocrspace logo](https://logos.composio.dev/api/ocrspace)

## Description

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Ocrspace MCP or direct API to extract text from images, analyze PDF documents, summarize scanned receipts, and process photographed notes through natural language.

## Summary

Ocrspace is an OCR API that extracts text from images and PDFs in JSON format. Instantly turn visual content into searchable, usable data.

## Categories

- document & file management

## Toolkit Details

- Tools: 3

## Images

- Logo: https://logos.composio.dev/api/ocrspace

## Authentication

- **Api Key**
  - Type: `api_key`
  - Description: Api Key authentication for OCR.space.
  - Setup:
    - Configure Api Key credentials for OCR.space.
    - Use the credentials when creating an auth config in Composio.

## Suggested Prompts

- Extract text from a scanned receipt image
- Convert a PDF invoice to editable text
- Get all words from a business card photo
- Pull text from a screenshot of a document

## Supported Tools

| Tool slug | Name | Description |
|---|---|---|
| `OCRSPACE_GET_CONVERSIONS` | Get Conversion Statistics | Retrieve OCR API conversion statistics and usage data (PRO accounts only). Returns the number of conversions for Engine1, Engine2, and total conversions. Data is updated once daily and shows conversions from start of month to end of yesterday. Free API keys will return 0 conversions. |
| `OCRSPACE_OCR_PARSE_IMAGE_POST` | Extract Text from Image/PDF (OCR) | Extract text from images and PDF documents using OCR (Optical Character Recognition). Supports 27 languages, table recognition, orientation detection, and word-level coordinate extraction. Provide exactly one of `file`, `url`, or `base64Image`; providing multiple or none triggers E301/OCRExitCode 99. Input can be provided as file upload, public URL, or base64-encoded data URI. Response is nested JSON; extract text from `ParsedResults[*].ParsedText`. Returns extracted text with optional overlay coordinates and searchable PDF generation. For poor-quality scans, enable both `detectOrientation` and `scale` and ensure `language` matches the document. |
| `OCRSPACE_PARSE_IMAGE_URL` | Extract Text from Image URL (GET) | Extract text from images via URL using simplified GET endpoint. Only supports URL-based submissions - no file uploads or base64 encoding. Faster and simpler than POST endpoint for basic use cases. |

## Supported Triggers

None listed.

## Installation and MCP Setup

### Path 1: SDK Installation

#### Path 1, Step 1: Install Composio

Install the Composio SDK
```python
pip install composio_openai
```

```typescript
npm install @composio/openai
```

#### Path 1, Step 2: Initialize Composio and Create Tool Router Session

Import and initialize Composio client, then create a Tool Router session
```python
from openai import OpenAI
from composio import Composio
from composio_openai import OpenAIResponsesProvider

composio = Composio(provider=OpenAIResponsesProvider())
openai = OpenAI()
session = composio.create(user_id='your-user-id')
```

```typescript
import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');
```

#### Path 1, Step 3: Execute Ocrspace Tools via Tool Router with Your Agent

Get tools from Tool Router session and execute Ocrspace actions with your Agent
```python
tools = session.tools
response = openai.responses.create(
  model='gpt-4.1',
  tools=tools,
  input=[{
    'role': 'user',
    'content': 'Extract all text from this uploaded receipt image.'
  }]
)
result = composio.provider.handle_tool_calls(
  response=response,
  user_id='your-user-id'
)
print(result)
```

```typescript
const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Extract all text from this uploaded receipt image.'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);
```

### Path 2: MCP Server Setup

#### Path 2, Step 1: Install Composio

Install the Composio SDK and Claude Agent SDK
```python
pip install composio claude-agent-sdk
```

```typescript
npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp
```

#### Path 2, Step 2: Create Tool Router Session

Initialize the Composio client and create a Tool Router session
```python
from composio import Composio
from claude_agent_sdk import ClaudeSDKClient, ClaudeAgentOptions

composio = Composio(api_key='your-composio-api-key')
session = composio.create(user_id='your-user-id')
url = session.mcp.url
```

```typescript
import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);
```

#### Path 2, Step 3: Connect to AI Agent

Use the MCP server with your AI agent
```python
import asyncio

options = ClaudeAgentOptions(
    permission_mode='bypassPermissions',
    mcp_servers={
        'tool_router': {
            'type': 'http',
            'url': url,
            'headers': {
                'x-api-key': 'your-composio-api-key'
            }
        }
    },
    system_prompt='You are a helpful assistant with access to Ocrspace tools.',
    max_turns=10
)

async def main():
    async with ClaudeSDKClient(options=options) as client:
        await client.query('Extract all text from this image file')
        async for message in client.receive_response():
            if hasattr(message, 'content'):
                for block in message.content:
                    if hasattr(block, 'text'):
                        print(block.text)

asyncio.run(main())
```

```typescript
import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Extract all text from this image file' }],
  stopWhen: stepCountIs( 5 )
});

console.log(`Agent: ${text}`);
```

## Why Use Composio?

### 1. AI Native Ocrspace Integration

- Supports both Ocrspace MCP and direct API based integrations
- Structured, LLM-friendly schemas for reliable tool execution
- Rich coverage for extracting, parsing, and analyzing text from uploaded images and PDFs

### 2. Managed Auth

- Built-in API key handling with secure storage and rotation
- Central place to manage, scope, and revoke Ocrspace access
- Per user and per environment credentials instead of hard-coded keys

### 3. Agent Optimized Design

- Tools are tuned using real error and success rates to improve reliability over time
- Comprehensive execution logs so you always know what ran, when, and on whose behalf

### 4. Enterprise Grade Security

- Fine-grained RBAC so you control which agents and users can access Ocrspace
- Scoped, least privilege access to Ocrspace resources
- Full audit trail of agent actions to support review and compliance

## Use Ocrspace with any AI Agent Framework

Choose a framework you want to connect Ocrspace with:

- [OpenAI Agents SDK](https://composio.dev/toolkits/ocrspace/framework/open-ai-agents-sdk)
- [Claude Agent SDK](https://composio.dev/toolkits/ocrspace/framework/claude-agents-sdk)
- [Claude Code](https://composio.dev/toolkits/ocrspace/framework/claude-code)
- [Claude Cowork](https://composio.dev/toolkits/ocrspace/framework/claude-cowork)
- [Codex](https://composio.dev/toolkits/ocrspace/framework/codex)
- [OpenClaw](https://composio.dev/toolkits/ocrspace/framework/openclaw)
- [Hermes](https://composio.dev/toolkits/ocrspace/framework/hermes-agent)
- [Google ADK](https://composio.dev/toolkits/ocrspace/framework/google-adk)
- [LangChain](https://composio.dev/toolkits/ocrspace/framework/langchain)
- [Vercel AI SDK](https://composio.dev/toolkits/ocrspace/framework/ai-sdk)
- [Mastra AI](https://composio.dev/toolkits/ocrspace/framework/mastra-ai)
- [LlamaIndex](https://composio.dev/toolkits/ocrspace/framework/llama-index)
- [CrewAI](https://composio.dev/toolkits/ocrspace/framework/crew-ai)
- [Pydantic AI](https://composio.dev/toolkits/ocrspace/framework/pydantic-ai)
- [AutoGen](https://composio.dev/toolkits/ocrspace/framework/autogen)

## Related Toolkits

- [Google Drive](https://composio.dev/toolkits/googledrive) - Google Drive is a cloud storage platform for uploading, sharing, and collaborating on files. It's perfect for keeping your documents accessible and organized across devices.
- [Google Docs](https://composio.dev/toolkits/googledocs) - Google Docs is a cloud-based word processor that enables document creation and real-time collaboration. Its seamless sharing and version history make team editing and content management a breeze.
- [Affinda](https://composio.dev/toolkits/affinda) - Affinda is an AI-powered document processing platform that automates data extraction from resumes, invoices, and more. It streamlines document-heavy workflows by turning files into structured, actionable data.
- [Agility cms](https://composio.dev/toolkits/agility_cms) - Agility CMS is a headless content management system for building and managing digital experiences across platforms. It lets teams update content quickly and deliver omnichannel experiences with ease.
- [Algodocs](https://composio.dev/toolkits/algodocs) - Algodocs is an AI-powered platform that automates data extraction from business documents. It delivers fast, secure, and accurate processing without templates or manual training.
- [Api2pdf](https://composio.dev/toolkits/api2pdf) - Api2Pdf is a REST API for generating PDFs from HTML, URLs, and documents using powerful engines like wkhtmltopdf and Headless Chrome. It streamlines document conversion and automation for developers and businesses.
- [Box](https://composio.dev/toolkits/box) - Box is a cloud content management and file sharing platform for businesses. It helps teams securely store, organize, and collaborate on files from anywhere.
- [Cloudconvert](https://composio.dev/toolkits/cloudconvert) - CloudConvert is a powerful file conversion service supporting over 200 file formats. It streamlines converting, compressing, and managing documents, media, and more, all in one place.
- [Cloudlayer](https://composio.dev/toolkits/cloudlayer) - Cloudlayer is a document and asset generation service for creating PDFs and images via API or SDKs. It lets you automate high-quality doc creation, saving dev time and reducing manual work.
- [Cloudpress](https://composio.dev/toolkits/cloudpress) - Cloudpress is a content export tool for Google Docs and Notion. It automates publishing to your favorite Content Management Systems.
- [Contentful graphql](https://composio.dev/toolkits/contentful_graphql) - Contentful graphql is a content delivery API that lets you access Contentful data using GraphQL queries. It gives you efficient, flexible ways to fetch and manage structured content for any digital project.
- [Conversion tools](https://composio.dev/toolkits/conversion_tools) - Conversion Tools is an online service for converting documents between formats such as PDF, Word, Excel, XML, and CSV. It lets you automate complex document workflows with just a few clicks.
- [Convertapi](https://composio.dev/toolkits/convertapi) - ConvertAPI is a robust file conversion service for documents, images, and spreadsheets. It streamlines programmatic format changes and lets developers automate complex workflows with a single API.
- [Craftmypdf](https://composio.dev/toolkits/craftmypdf) - CraftMyPDF is a web-based service for designing and generating PDFs with templates and live data. It streamlines document creation by automating personalized PDFs at scale.
- [Docmosis](https://composio.dev/toolkits/docmosis) - Docmosis generates PDF and Word documents from user-defined templates. It's perfect for merging data fields to quickly produce reports, invoices, and business letters.
- [Docnify](https://composio.dev/toolkits/docnify) - Docnify is a digital signing platform that streamlines the way you sign and manage documents. It brings together tools like Figma, Jira, Trello, and Google Docs for a unified document workspace.
- [Docparser](https://composio.dev/toolkits/docparser) - Docparser is a cloud-based document parsing and automation platform. It streamlines data extraction from PDFs and documents for faster workflows.
- [DocRaptor](https://composio.dev/toolkits/docraptor) - DocRaptor is a powerful API for converting HTML to PDF or XLSX documents. It enables fast, high-quality document generation from your applications.
- [Docsautomator](https://composio.dev/toolkits/docsautomator) - Docsautomator is an automation platform for Google Docs. It lets you create, manage, and generate documents from templates quickly.
- [Docsumo](https://composio.dev/toolkits/docsumo) - Docsumo is an AI-powered platform for automating document data extraction and analysis. It helps you turn PDFs, invoices, and forms into structured, actionable data with minimal manual effort.

## Frequently Asked Questions

### Do I need my own developer credentials to use Ocrspace with Composio?

Yes, Ocrspace requires you to configure your own API key credentials. Once set up, Composio handles secure credential storage and API request handling for you.

### Can I use multiple toolkits together?

Yes! Composio's Tool Router enables agents to use multiple toolkits. [Learn more](https://docs.composio.dev/tool-router/overview).

### Is Composio secure?

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. [Learn more](https://trust.composio.dev).

### What if the API changes?

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

---
[See all toolkits](https://composio.dev/toolkits) · [Composio docs](https://docs.composio.dev/llms.txt)
