How to connect Gemini to Claude Cowork

Gemini logo
Claude Cowork logo
divider

Introduction

Cowork is Anthropic's AI agent for knowledge work. Think of it as Claude Code for everything else. It works autonomously with your computer, local files, and applications to accomplish complex tasks.

This guide walks you through the easiest and most secure way to connect your Gemini account to Cowork via Composio Connect, enabling it to summarize this research article in 100 words, generate a creative image of a futuristic city, create a 30-second video based on this script, and more such actions on your behalf without compromising your account security.

Also integrate Gemini with

Connecting Gemini to Claude Cowork

1. Open Customize

In Claude Desktop, click Customize in the left sidebar, then select Connectors and click the + icon at the top.

Claude Desktop connectors screen with Add custom connector selected

2. Add the Composio MCP server

Click Add custom connector and paste in the Composio MCP server URL:

bash
https://connect.composio.dev/mcp
Add custom connector dialog with Composio MCP server URL

3. Authorize in your browser

Click Connect. You'll be redirected to a browser window where you can authorize Composio to continue.

Composio authorization screen for Claude Cowork

4. Connect your Gemini account

Back in Cowork, ask the agent to connect to Gemini or give it any Gemini-related task.

For example, ask Cowork to:

  • "Summarize this research article in 100 words"
  • "Generate a creative image of a futuristic city"
  • "Create a 30-second video based on this script"

It will prompt you to authenticate and authorize access.

That's it. Composio's tools are now available in Cowork, and your Gemini account is ready to use.

What is Claude Cowork?

Claude Cowork is Anthropic's agent for general knowledge work. It can use your computer, files, and connected applications to complete longer-running tasks across your work tools.

With Composio Connect, Cowork can securely access apps like Gemini through MCP without you sharing account credentials directly with the agent.

What is the Gemini MCP server, and what's possible with it?

The Gemini MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Gemini account. It provides structured and secure access to Gemini's multimodal AI features, so your agent can generate text, images, and videos, analyze content, and manage model resources on your behalf.

  • Text and content generation: Instruct your agent to create high-quality, customized text using Gemini's advanced generative models—great for brainstorming, drafting, or summarizing information.
  • Creative image and video generation: Ask the agent to generate original images or high-quality videos from text prompts using Gemini 2.5 Flash and Veo models, with fine control over style and format.
  • Embedding and semantic analysis: Let your agent transform any text into rich semantic embeddings for similarity search, clustering, or classification tasks.
  • Model discovery and optimization: Have the agent list available Gemini and Veo models, check their capabilities, and select the best fit for your project or workflow.
  • Efficient resource management: Enable the agent to track video generation operations, download final assets, and optimize prompt inputs by counting tokens—all without manual intervention.

Supported Tools & Triggers

Tools
Count Tokens (Gemini)Counts the number of tokens in text using gemini tokenization.
Download Video (Veo)Downloads a generated veo video to local storage.
Embed Content (Gemini)Generates text embeddings using gemini embedding models.
Generate Content (Gemini)Generates text content from prompts using gemini models.
Generate Image (Gemini 2.5 Flash)Generates images from text prompts using gemini 2.
Generate Videos (Veo)Generates videos from text prompts using google's veo models.
Get Videos Operation (Veo)Checks the status of a veo video generation operation.
List Models (Gemini API)Lists available gemini and veo models with their capabilities and limits.
Wait For Video (Veo)Polls a veo video generation operation until completion or timeout.

Available tools and triggers

After setup, the supported Gemini tools and triggers listed on this page are available to Cowork through Composio Connect.

You can now ask Cowork to handle Gemini workflows in natural language, from quick summaries and drafting tasks to more complex multi-step work across connected apps.

How to build Gemini MCP Agent with another framework

FAQ

What are the differences in Tool Router MCP and Gemini MCP?

With a standalone Gemini MCP server, the agents and LLMs can only access a fixed set of Gemini tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Gemini and many other apps based on the task at hand, all through a single MCP endpoint.

Can I use Tool Router MCP with Claude Cowork?

Yes, you can. Claude Cowork fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Gemini tools.

Can I manage the permissions and scopes for Gemini while using Tool Router?

Yes, absolutely. You can configure which Gemini scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

How safe is my data with Composio Tool Router?

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Gemini data and credentials are handled as safely as possible.

Used by agents from

Context
Letta
glean
HubSpot
Agent.ai
Altera
DataStax
Entelligence
Rolai
Context
Letta
glean
HubSpot
Agent.ai
Altera
DataStax
Entelligence
Rolai
Context
Letta
glean
HubSpot
Agent.ai
Altera
DataStax
Entelligence
Rolai

Never worry about agent reliability

We handle tool reliability, observability, and security so you never have to second-guess an agent action.