Tips and tricks to reduce MCP token bloat

Bill Doerrfeld | February 5, 2026

My latest for The New Stack shares techniques on optimizing MCP usage.

MCP servers can quickly drain context windows without the right guardrails. Thankfully, there are ways around this...

Today,
my feature with The New Stack breaks down a number of practical techniques for reducing MCP token bloat as teams begin using multiple MCPs in real, scaled workflows.

Techniques include more intentional tool design, minimizing upfront context, progressive disclosure, better tool discovery, subagents, code mode, semantic caching, stronger prompting practices, and more.

The big takeaway: as MCP gains real enterprise traction, it'll take smart approaches to optimize its use in software development.

Huge thank you to the experts who shared their knowledge with me for this piece! This one features, in order of appearance:

-
Gil Feig, CTO and co-founder, Merge
-
Christian Posta, VP and global field CTO, solo.io
-
Alex Salazar, co-founder and CEO, Arcade.dev
-
Marcin Klimek, senior technical product manager, SmartBear
-
Kevin Swiber, API strategist, Layered System
-
Neeraj Abhyankar, VP of data and AI, R Systems
-
Ori Yitzhaki, chief product officer, Sonar
-
Tom Moor, head of engineering, Linear
-
Matt Martin, co-founder and CEO, Clockwise
-
Ankit Jain, CEO, Aviator
-
Melissa R., Director of AI, AppOmni


This is a space I expect will continue to evolve, and I hope to continue covering the emerging techniques to get the most of MCP in practice.

Read: 10 strategies to reduce MCP token bloat

Other Blog Posts

By Bill Doerrfeld March 19, 2026
Usage-based pricing is reshaping the API economy. Discover 5 API monetization success stories, including OpenAI, Plaid, and AssemblyAI.
A lightbulb against a purple background, containing a human brain with an
By Bill Doerrfeld March 18, 2026
Why event-driven APIs matter for AI workflows, enabling real-time data, scalable systems, and responsive agent behavior.
By Bill Doerrfeld February 28, 2026
While hardware usually gets the spotlight in physical AI, the real differentiator won't be hardware. It'll be the models.
By Bill Doerrfeld February 27, 2026
In the latest DirectorPlus, Workato's CTO explains how MCP-enabled integration catalyzed internal AI usage and ROI.
By Bill Doerrfeld February 18, 2026
My latest on InfoWorld reviews MCP servers from 5 major cloud providers
By Bill Doerrfeld February 18, 2026
How are organizations actually using agentic knowledge bases in practice? My article for The New Stack looks at six emerging patterns.
eBPF in Production Report
By Bill Doerrfeld February 12, 2026
My report for the eBPF Foundation explores enterprise eBPF case studies, production deployments, and real business outcomes across cloud-native environments.
Close-up of whole bean coffee Bottomless
By Bill Doerrfeld February 10, 2026
Longtime Bottomless user sharing why I love automated coffee delivery triggered by a smart scale, plus a referral link for a free first bag.
By Bill Doerrfeld February 4, 2026
It may seem like AI agents are suddenly doing everything across industries. But in reality, the pace of agentic AI is moving carefully, and very deliberately, in highly regulated environments like finance and banking.
By Bill Doerrfeld February 3, 2026
My latest feature for InfoWorld explores when it makes sense to scrape public web sources, and when official API integrations are the better choice for external data.