Should AI agents scrape or integrate external data?

Bill Doerrfeld | February 3, 2026

My latest on InfoWorld explores the upsides and caveats of both approaches.

To scrape or integrate? It's an age-old question resurfacing for AI agent builders.

Excited to share my analysis today for
InfoWorld, where I break down when it makes sense to scrape public web sources, and when official API integrations are the better choice for external data.

The takeaway: agents need data. New interactive browser tools and scraping techniques help pull in real-time, supplementary signals. But scraping comes with fragility and legal downsides. As
Deepak Singh puts it, "It's building on quicksand."

Scraping is no substitute for the predictable, validated, and governed integrations agents need to execute auditable workflows and real-world actions reliably.

This article features, in order of appearance:

-
Or Lenchner, CEO, Bright Data
-
Deepak Singh, CEO and co-founder, AvairAI Inc.
-
Neeraj Abhyankar, VP, Data and AI, R Systems
-
Gaurav Pathak, VP of AI and metadata, Informatica
-
Keith Pijanowski, AI and ML solutions engineer, MinIO
-
Krishna Subramanian, co-founder and COO, Komprise

Also shout-outs to reports from
PwC (2025 AI Agents Survey), Tray.ai (2024 Enterprise Survey), Salt Security (2025 AI Agents Report), and McKinsey & Company (2025 State of AI Study), plus links to reporting from AI21 Labs, The Register, and WIRED.

Read: How should AI agents consume external data?

Other Blog Posts

By Bill Doerrfeld June 17, 2026
My latest for LeadDev considers how engineering leaders should respond in the wake of uncertainty in the AI model market.
By Bill Doerrfeld June 10, 2026
I'm working with Zuplo on some new content around their MCP Gateway release. First up: a deep comparison of MCP gateways on the market!
By Bill Doerrfeld June 10, 2026
The constant barrage of AI layoffs is overshadowing the economic reasons behind these cuts, as well as the net-positive talent redistribution happening at large.
By Bill Doerrfeld June 8, 2026
My latest for InfoWorld reviews MCP servers and agent-ready tools for connecting AI agents with popular database styles.
By Bill Doerrfeld May 29, 2026
For my latest DirectorPlus edition, Joel Carusone from NinjaOne shares how engineering leaders can build the muscle for making tough calls.
Close-up of a glowing laptop keyboard in blue light, viewed at an angle with the screen above
By Bill Doerrfeld May 25, 2026
My latest InfoWorld feature explores how Model Context Protocol (MCP) supports context engineering for AI-assisted coding.
A set of metal keys on a keyring resting on a wooden surface.
By Bill Doerrfeld May 22, 2026
My latest for Nordic APIs explores 10 API key security risks and what to use alongside keys for stronger API security.
By Bill Doerrfeld May 18, 2026
The yearly API conference, apidays New York, is a hotbed for solid discussion on what's top of mind in the API space, and as MC I had a front row seat.
By Bill Doerrfeld May 13, 2026
My latest for CIO Online features real results form CIOs actively deploying AI agents to empower sales and revenue teams.
By Bill Doerrfeld May 12, 2026
Reports say consumers are souring on AI everywhere, all the time. So, at the risk of losing trust, or even potential business, is adding AI to an existing product really worth it?