AI & ML

Agentic AI at Scale: The Four Foundational Data Prerequisites

· 5 min read

The Agentic AI Paradox: Trillions in Forecasts, but Enterprise Reality Hits a Data Wall

The numbers around agentic AI are undeniably compelling. We're talking about market growth that most tech segments can only dream of, with Deloitte Digital projecting the global agentic AI market to swell from $8.5 billion by the end of 2026 to nearly $40 billion just four years later. And Gartner sees worldwide AI spending hitting an eye-watering $2.5 trillion in 2026, a 44% jump in a single year. Enterprises are quickly moving to adopt these autonomous systems, with MuleSoft research showing an average organization already running 12 AI agents and expecting to increase that by 67% to 20 agents in the next two years.

Here's the thing, though: beneath the surface of these soaring projections and enthusiastic adoption rates lies a significant, often overlooked, hurdle. While nearly two-thirds of enterprises have experimented with agentic AI, fewer than 10% have managed to scale these deployments to deliver actual, measurable value. My read is that this isn't a problem with the agents themselves, but rather with the foundational infrastructure they depend on.

The Data Chasm: Why Scaling Autonomous Agents Stalls

The instinct might be to focus on the sophistication of the AI models or the complexity of agent orchestration. But that misses the point. The single biggest obstacle, cited by a staggering eight in ten companies, isn't talent, or even the tech stack itself, but rather poor data. IDC spells out the consequences starkly: by 2027, companies that don't prioritize high-quality, AI-ready data risk a 15% loss in productivity simply because they'll struggle to scale their generative AI and agentic solutions.

What makes data such a critical bottleneck for agentic AI? Unlike traditional software, autonomous agents are designed to execute complex tasks without constant human intervention. They need to make decisions, often across various business processes. This autonomy demands an unprecedented level of trust in the data they consume. Fragmented data, inconsistent standards, or data locked in silos directly translate to errors and poor decision-making from an agent. Think about it: an agent needs reliable, consistent information to act on its own, and if it's operating on bad data, it becomes a liability.

This challenge is exacerbated by the realities of enterprise IT. MuleSoft’s research reveals that the average enterprise manages 957 applications—and for those further along in their agentic AI journey, that number climbs to 1,057. Yet, only about 27% of these applications are currently connected, creating a sprawling, disconnected data landscape. IT teams are spending, on average, 36% of their time building and testing custom integrations between systems. Custom work isn't how you scale AI adoption; it's how you stay stuck in pilot purgatory. Data quality emerges as the top concern for a quarter of organizations when deploying AI, and nearly all (96%) struggle to use data from across their business for AI initiatives.

This is why, while 2025 was the year for small experiments, 2026 is shaping up to be the year of scaling agentic AI. And to scale, as IDC aptly notes, companies absolutely need trustworthy, accessible, and quality data.

A depiction of a robot in the center of a maze of data and applications

Beyond the Bots: Reshaping Roles and Governance

The shift to agentic AI isn't just about technical infrastructure; it's fundamentally reshaping how organizations operate and how people work. IDC forecasts that by 2026, a substantial 40% of all Global 2000 job roles will involve working directly with AI agents. This isn't just automation; it's a redefinition of traditional entry, mid, and senior-level positions, suggesting a future where human roles pivot from direct execution to supervising and orchestrating agent-led workflows.

This brings us to another critical constraint McKinsey identified: operating model and talent limitations, alongside ineffective change management. As agents take on more autonomous tasks, new governance models become indispensable. How do you dictate how agents can operate in a trustworthy, transparent, and scalable manner? The challenge intensifies as organizations embrace hybrid work environments, demanding clear rules and frameworks for agent autonomy.

It's a complex dance. On one hand, you want agents to run with minimal human intervention; on the other, you need controls to ensure accuracy, compliance, and ethical operation. This is why some are finding AI agents are fast, loose, and out of control without proper guardrails. It's not just about building better agents, but also how to build better AI agents for your business - without creating trust issues. This reorientation requires companies to think deeply about upskilling workers for AI and fostering a culture that can adapt to these new human-agent collaborations, while also being mindful that prolonged AI use can be hazardous to your health and work.

Building the Bedrock: A Strategic Imperative for AI Agents

McKinsey's research points to a clear pathway for businesses looking to move beyond experimentation and actually scale agentic AI for measurable value. It comes down to four coordinated steps that connect strategy, technology, and people:

  1. Identify High-Impact Workflows to 'Agentify': Start by looking for highly deterministic, repetitive tasks. These are strong candidates for AI agents because their outcomes are predictable and the value they deliver is clear. McKinsey suggests that customer service, marketing, knowledge management, and IT are leading the charge in AI adoption, offering fertile ground for agent deployment.

  2. Modernize Each Layer of the Data Architecture for Agents: This is arguably the most crucial step. Data architecture needs to support interoperability, easy access, and robust governance across systems. The current reality of disconnected applications just won't cut it. Enterprises must move towards an architecture that can feed agents a steady flow of high-quality, trusted data to accurately automate complex business workflows and support their autonomy.

  3. Ensure Data Quality is in Place: This isn't just about structured data; it includes unstructured data and, critically, the data that agents themselves generate. All of it needs to meet consistent standards for accuracy, lineage, and governance. The challenge of getting access to trusted data is a significant obstacle, and without resolving it, agents cannot perform reliably.

  4. Build an Operating and Governance Model for Agentic AI: This is where the human element comes in. Rethink how work gets done. Human roles will evolve into supervision and orchestration, making governance paramount. It's about establishing clear rules and frameworks for how agents can operate autonomously in a trustworthy and transparent manner, ensuring control as these systems scale.

The Path Forward: Trust, Autonomy, and the Data Dividend

The agentic AI era promises transformative potential, but it's clear the path to unlocking that value isn't through simply deploying more agents. The real differentiator, as McKinsey aptly concludes, lies in having access to high-quality data. Agents will generate enormous amounts of data, making data quality, lineage, and standardization more critical than ever before. As these agentic systems scale, governance becomes the primary lever for control, ensuring trust and preventing chaos.

So, the competitive advantage in the agentic era isn't just about having the most sophisticated AI models or the cleverest agents. It's about the foundational data — its quality, accessibility, and the architecture supporting it. Organizations that get this right won't just avoid a 15% productivity loss; they'll position themselves to truly capitalize on the trillions in AI spending and redefine their operations with truly autonomous, high-value workflows.