Tips and tricks to reduce MCP token bloat

Bill Doerrfeld | February 5, 2026

My latest for The New Stack shares techniques on optimizing MCP usage.

MCP servers can quickly drain context windows without the right guardrails. Thankfully, there are ways around this...

Today,
my feature with The New Stack breaks down a number of practical techniques for reducing MCP token bloat as teams begin using multiple MCPs in real, scaled workflows.

Techniques include more intentional tool design, minimizing upfront context, progressive disclosure, better tool discovery, subagents, code mode, semantic caching, stronger prompting practices, and more.

The big takeaway: as MCP gains real enterprise traction, it'll take smart approaches to optimize its use in software development.

Huge thank you to the experts who shared their knowledge with me for this piece! This one features, in order of appearance:

-
Gil Feig, CTO and co-founder, Merge
-
Christian Posta, VP and global field CTO, solo.io
-
Alex Salazar, co-founder and CEO, Arcade.dev
-
Marcin Klimek, senior technical product manager, SmartBear
-
Kevin Swiber, API strategist, Layered System
-
Neeraj Abhyankar, VP of data and AI, R Systems
-
Ori Yitzhaki, chief product officer, Sonar
-
Tom Moor, head of engineering, Linear
-
Matt Martin, co-founder and CEO, Clockwise
-
Ankit Jain, CEO, Aviator
-
Melissa R., Director of AI, AppOmni


This is a space I expect will continue to evolve, and I hope to continue covering the emerging techniques to get the most of MCP in practice.

Read: 10 strategies to reduce MCP token bloat

Other Blog Posts

By Bill Doerrfeld June 17, 2026
My latest for LeadDev considers how engineering leaders should respond in the wake of uncertainty in the AI model market.
By Bill Doerrfeld June 10, 2026
I'm working with Zuplo on some new content around their MCP Gateway release. First up: a deep comparison of MCP gateways on the market!
By Bill Doerrfeld June 10, 2026
The constant barrage of AI layoffs is overshadowing the economic reasons behind these cuts, as well as the net-positive talent redistribution happening at large.
By Bill Doerrfeld June 8, 2026
My latest for InfoWorld reviews MCP servers and agent-ready tools for connecting AI agents with popular database styles.
By Bill Doerrfeld May 29, 2026
For my latest DirectorPlus edition, Joel Carusone from NinjaOne shares how engineering leaders can build the muscle for making tough calls.
Close-up of a glowing laptop keyboard in blue light, viewed at an angle with the screen above
By Bill Doerrfeld May 25, 2026
My latest InfoWorld feature explores how Model Context Protocol (MCP) supports context engineering for AI-assisted coding.
A set of metal keys on a keyring resting on a wooden surface.
By Bill Doerrfeld May 22, 2026
My latest for Nordic APIs explores 10 API key security risks and what to use alongside keys for stronger API security.
By Bill Doerrfeld May 18, 2026
The yearly API conference, apidays New York, is a hotbed for solid discussion on what's top of mind in the API space, and as MC I had a front row seat.
By Bill Doerrfeld May 13, 2026
My latest for CIO Online features real results form CIOs actively deploying AI agents to empower sales and revenue teams.
By Bill Doerrfeld May 12, 2026
Reports say consumers are souring on AI everywhere, all the time. So, at the risk of losing trust, or even potential business, is adding AI to an existing product really worth it?