Tips and tricks to reduce MCP token bloat

Bill Doerrfeld | February 5, 2026

My latest for The New Stack shares techniques on optimizing MCP usage.

MCP servers can quickly drain context windows without the right guardrails. Thankfully, there are ways around this...

Today,
my feature with The New Stack breaks down a number of practical techniques for reducing MCP token bloat as teams begin using multiple MCPs in real, scaled workflows.

Techniques include more intentional tool design, minimizing upfront context, progressive disclosure, better tool discovery, subagents, code mode, semantic caching, stronger prompting practices, and more.

The big takeaway: as MCP gains real enterprise traction, it'll take smart approaches to optimize its use in software development.

Huge thank you to the experts who shared their knowledge with me for this piece! This one features, in order of appearance:

-
Gil Feig, CTO and co-founder, Merge
-
Christian Posta, VP and global field CTO, solo.io
-
Alex Salazar, co-founder and CEO, Arcade.dev
-
Marcin Klimek, senior technical product manager, SmartBear
-
Kevin Swiber, API strategist, Layered System
-
Neeraj Abhyankar, VP of data and AI, R Systems
-
Ori Yitzhaki, chief product officer, Sonar
-
Tom Moor, head of engineering, Linear
-
Matt Martin, co-founder and CEO, Clockwise
-
Ankit Jain, CEO, Aviator
-
Melissa R., Director of AI, AppOmni


This is a space I expect will continue to evolve, and I hope to continue covering the emerging techniques to get the most of MCP in practice.

Read: 10 strategies to reduce MCP token bloat

Other Blog Posts

By Bill Doerrfeld May 1, 2026
Cloudflare rebuilt Next.js over a weekend using agentic coding.
By Bill Doerrfeld April 20, 2026
My InfoWorld feature reviews the key building blocks in agentic systems and with real-world examples from Shopify, Block, and others.
By Bill Doerrfeld March 31, 2026
My latest InfoWorld feature explores what makes an enterprise MCP registry effective, from semantic discovery to governance and security for AI agents.
By Bill Doerrfeld March 30, 2026
My first-ever contribution to CSO Online looks at the shifting landscape, from perimeter-based security to API security, and how CISOs are responding.
By Bill Doerrfeld March 29, 2026
My latest feature for The New Stack looks into solutions being proposed to fix open source Slopmageddon.
A digital pattern of rounded rectangular blocks in shades of blue and purple, arranged in an interlocking layout.
By Bill Doerrfeld March 27, 2026
My latest DirectorPlus looks at how agentic AI is reshaping platform engineering at Squarespace: less shared code and more developer experience focus.
By Bill Doerrfeld March 19, 2026
Usage-based pricing is reshaping the API economy. Discover 5 API monetization success stories, including OpenAI, Plaid, and AssemblyAI.
A lightbulb against a purple background, containing a human brain with an
By Bill Doerrfeld March 18, 2026
Why event-driven APIs matter for AI workflows, enabling real-time data, scalable systems, and responsive agent behavior.
By Bill Doerrfeld February 28, 2026
While hardware usually gets the spotlight in physical AI, the real differentiator won't be hardware. It'll be the models.
By Bill Doerrfeld February 27, 2026
In the latest DirectorPlus, Workato's CTO explains how MCP-enabled integration catalyzed internal AI usage and ROI.