Dev.to AI2h ago|Research & Papers Products & Services

Measuring the Token Cost of MCP Tool Definitions

The article examines the token cost of tool definitions in Multimodal Conversational Platforms (MCPs), finding that they can consume thousands of tokens before the model even reads a user message. It provides a tool to audit and optimize tool schemas.

💡

Why it matters

Optimizing tool definitions is crucial for maximizing the efficiency and cost-effectiveness of Multimodal Conversational Platforms, which are becoming increasingly important in the AI landscape.

Key Points

1Tool definitions in MCPs can consume 22,945 tokens before a single user message
2Format differences between providers (OpenAI, MCP, Google) can add 140 tokens across 20 tools
3A tool to audit and optimize tool schemas can reduce token costs by up to 21%

Details

The article discusses the significant token cost of tool definitions in Multimodal Conversational Platforms (MCPs) like GitHub, Slack, and Brave Search. The author measured the token consumption of 137 tools across 11 popular MCP servers, finding that 22,945 tokens were injected before the model read a single user message. One server (GitHub) accounted for 69% of this. The article provides examples of how a simple function can cost 60 tokens, and how that adds up quickly with 20-30 tools. It also shows how the format differences between providers like OpenAI, MCP, and Google can lead to meaningful token differences. To address this, the author introduces a tool called 'agent-friend' that can audit tool schemas, identify optimization opportunities, and reduce token costs by up to 21% through techniques like removing verbose prefixes, trimming long descriptions, and eliminating redundant parameter information.

Measuring the Token Cost of MCP Tool Definitions

Why it matters

Key Points

Details

Dive deeper

Related Articles

Lessons from 454 Autonomous Tasks

CodeToNotion: Turn Your VS Code Comments Into an AI-Powered…

OpenClaw Advanced Tutorial: From Intermediate to Expert in …

Hello World — Will AI Make You Smarter or Dumber?

Vibe-coding in Google AI Studio: my tips to prompt better a…

Claude vs. Gemini vs. DeepSeek: Why Multi-Model Routing is …

Best Open-Source AI Agent Frameworks for Building Custom Ag…

The Missing Layer in the Agent Stack: Why I Wrote a Shared …

El impacto de la Inteligencia Artificial en el desarrollo d…

Building an AI That Watches Itself Die (Part 4 of 4): The E…

AI Curator

Ask me anything about AI

Related Articles

Lessons from 454 Autonomous Tasks

CodeToNotion: Turn Your VS Code Comments Into an AI-Powered…

OpenClaw Advanced Tutorial: From Intermediate to Expert in …

Hello World — Will AI Make You Smarter or Dumber?

Vibe-coding in Google AI Studio: my tips to prompt better a…

Claude vs. Gemini vs. DeepSeek: Why Multi-Model Routing is …

Best Open-Source AI Agent Frameworks for Building Custom Ag…

The Missing Layer in the Agent Stack: Why I Wrote a Shared …

El impacto de la Inteligencia Artificial en el desarrollo d…

Building an AI That Watches Itself Die (Part 4 of 4): The E…