The Enterprise AI Gap: The Largest Opportunity in Business AI Is The Mid-Market
By Vaughan | Co-Founder and Chief Product Officer, Datafi
A new McKinsey report delivers a finding that reframes the entire conversation around enterprise AI: fewer than 10 percent of companies that have experimented with agentic AI have successfully scaled it to deliver tangible value. Eight in ten cite data limitations as the primary obstacle. Not model capability. Not budget. Data.
This should not surprise anyone who has spent serious time inside enterprise data environments. The infrastructure required to transform AI from a conversational novelty into an autonomous problem-solving system has never been about what sits at the top of the stack. It has always been about what lies beneath: the quality, context, governance, and architecture of the data that AI must actually act on.
The largest opportunity in business AI is not the defense sector or Fortune 50, it is the vast commercial mid-market that generates mission-critical data at scale every day but has no viable path to transforming that data into autonomous, decision-making AI.
At Datafi, we built for this reality from day one. And the McKinsey findings validate a market thesis we have held since the beginning: the largest opportunity in business AI is not the government intelligence community, not the defense sector, not the geopolitical data infrastructure that Palantir has built a powerful and well-earned business serving. The largest opportunity is the vast commercial economy operating between the Fortune 50 and the small business, spanning industries that generate mission-critical data at scale every day and have no viable path to transforming that data into autonomous, decision-making AI.
That gap is Datafi’s market. And across supply chain, insurance, life sciences, cybersecurity, manufacturing, financial services, and beyond, the conditions that define it are remarkably consistent.
Why Palantir Cannot Serve This Market
Palantir is a serious company doing serious work. Its Gotham and Foundry platforms represent genuine technical achievement built over many years, primarily for customers who operate at the intersection of data, decision-making, and national consequence: defense agencies, intelligence services, and the largest global enterprises with dedicated data engineering organizations, multi-year implementation budgets, and a tolerance for highly customized, services-intensive deployments.
That customer profile is real and valuable. It is also narrow.
The commercial mid-market, growth-stage enterprises, and operationally complex organizations across the industries described below do not fit it. They lack the internal data engineering headcount that Palantir deployments require to succeed. They cannot absorb multi-year implementation timelines. They need AI that acts within their existing operational context, not a platform that requires rebuilding that context from scratch.
These organizations are generating rich, consequential business data every day. Their competitive pressure to operationalize AI is equal to or greater than their larger peers. And the tools available to them fall into two inadequate categories: point solutions that answer questions within a single domain but cannot coordinate across the business, and the prospect of assembling a bespoke agentic stack from disconnected vendors that requires technical talent most of these companies cannot attract or retain.
Neither path leads to transformative outcomes. The McKinsey data confirms the result: two-thirds of enterprises have experimented with agents. Fewer than one in ten have captured real value. The gap between experimentation and scale is almost entirely a data infrastructure problem.
The Data Foundation Problem Across Six Industries
The McKinsey framework identifies seven data architecture principles required for agentic AI to scale: ingesting data as a product, sharing meaning not just data, unifying the analytics and AI data foundation, embedding governance by default, exposing stable interfaces, making behavior measurable, and providing a controlled execution environment for agents. These principles do not vary by industry. But the specific ways that data fragmentation and governance failure manifest does, and understanding those manifestations reveals exactly where Datafi’s Business AI Operating System delivers its most decisive competitive advantage.
Supply Chain and Logistics
Supply chain operations sit at the intersection of structured transactional data, unstructured supplier communications, real-time sensor and logistics feeds, and probabilistic demand signals. The data is abundant. The problem is that it lives in siloes across ERP systems, third-party logistics platforms, supplier portals, and contract management tools, each with its own schema, update cadence, and access model.
An AI agent tasked with identifying and responding to a supply disruption cannot function if it cannot simultaneously read inventory positions, supplier lead times, freight capacity, and customer commitment data in a coordinated, contextually consistent way. Current deployments fail here not because the underlying models are insufficient, but because the data architecture cannot support the cross-system context that autonomous action requires.
Datafi’s vertically integrated stack eliminates the integration gaps that cause this failure. When context travels with data through a unified semantic layer, agents can execute supply chain interventions, not just report on supply chain conditions. The difference between alerting a procurement manager to a risk and actually recalibrating a purchase order is the difference between AI that informs and AI that solves.
Insurance
Insurance is among the highest-value targets for agentic AI and among the most structurally challenging. Underwriting, claims processing, fraud detection, and customer service each require reasoning over data that spans structured policy records, unstructured claims documents, external market feeds, regulatory databases, and historical loss patterns. The data relationships are complex, the governance requirements are strict, and the consequences of acting on incorrect or incomplete information are financial and regulatory.
Point-solution AI has made inroads in narrow insurance tasks: document extraction, first-notice-of-loss processing, simple fraud flag generation. But these remain isolated capabilities that require human coordination to stitch into coherent workflows. The opportunity that remains uncaptured is the end-to-end automation of complex insurance workflows, where agents can move a claim from intake through investigation, validation, and settlement decision with appropriate human review checkpoints, while maintaining the lineage and auditability that regulators require.
That capability requires exactly the governance architecture Datafi embeds by default: data access controls that enforce role-based permissions at the agent level, lineage tracking that records every data point an agent used in reaching a decision, and a semantic layer that ensures a “loss event” means the same thing to every agent and application operating across the platform.
Life Sciences and Pharma
Life sciences companies operate in a data environment of extraordinary complexity and consequence. Clinical trial data, real-world evidence, regulatory submissions, safety monitoring signals, and commercial performance data must be integrated across systems that are frequently acquired through M&A and therefore architecturally inconsistent. The regulatory stakes of acting on incorrect or miscontextualized data are severe, which has historically made this sector cautious about AI adoption.
That caution has a cost. The pipeline of insights that could accelerate drug development decisions, improve pharmacovigilance, and optimize commercial launch strategies is largely inaccessible to the humans who need it because it requires coordinating across data systems that were never designed to interoperate.
Agentic AI built on a properly governed data foundation changes this calculus. Agents that can monitor safety signals across clinical and real-world data simultaneously, flag anomalies for medical review with complete data provenance, and coordinate regulatory documentation workflows represent a genuine acceleration of processes that currently take months of manual effort. The prerequisite is the same one McKinsey identifies: a data architecture that enforces governance automatically, preserves lineage through every transformation, and allows agents to operate on data they are authorized to access without requiring manual access management at each workflow step.
Cybersecurity
The irony of cybersecurity as an AI use case is that security operations centers are already drowning in data and already using machine learning to process it. The problem is that the signal-to-noise ratio in security telemetry is catastrophically low, the correlation of threat indicators across different data sources is still largely manual, and the time between detection and response remains too long at most organizations to prevent damage from fast-moving threats.
Agentic AI in cybersecurity is not a future concept. It is an urgent operational requirement. Agents that can correlate indicators of compromise across network traffic logs, endpoint data, identity and access records, and threat intelligence feeds, then autonomously trigger containment actions within pre-approved response playbooks, reduce mean time to response from hours to minutes.
What makes this hard is not the AI. It is the data. Security telemetry lives in SIEM platforms, endpoint detection tools, identity providers, and cloud logging services, each with different schemas, different latencies, and different access models. Federating this data into a coherent, queryable, governable foundation that agents can operate against requires exactly the kind of unified data architecture that Datafi delivers. And because security response actions have irreversible consequences, the governance layer that controls what agents can do, and records what they did, is not optional. It is the prerequisite for deploying autonomous response capability at all.
Manufacturing and Industrial Operations
Manufacturing presents one of the clearest cases for agentic AI and one of the most persistent failures to deliver it. The use cases are well-understood: predictive maintenance that prevents unplanned downtime, quality control automation that catches defects before they propagate through production, energy optimization that adjusts plant operations in real time, and production scheduling that responds dynamically to demand changes and supply constraints.
The data required to power these use cases exists. Every modern production facility generates operational technology data from sensors and control systems, information technology data from ERP and MES platforms, and external data from supplier and logistics systems. The barrier is not data availability. It is data readiness.
Operational technology data is frequently unstructured, inconsistently timestamped, and generated by legacy systems that were never designed to interface with modern AI platforms. Bridging OT and IT data environments into a unified foundation that AI agents can act on requires a platform that handles structured and unstructured data with equal rigor, enforces consistent governance across both, and maintains the real-time quality standards that autonomous production decisions demand. This is precisely the architecture Datafi was built to deliver.
Financial Services
Commercial banking, wealth management, and specialty finance outside the top-tier institutions face a version of the same structural challenge. Data lives across core banking systems, CRM platforms, market data feeds, regulatory reporting tools, and client communication archives. The workflows that would benefit most from AI autonomy, including credit analysis, client reporting, compliance monitoring, and portfolio risk assessment, all require coordinating across these systems with consistency and accountability.
Regulatory pressure in financial services also makes the governance requirements non-negotiable. Any AI agent operating on financial data must operate within documented, auditable boundaries. Every decision must be traceable to the data that informed it. This is not a compliance checkbox; it is a fundamental design requirement that most AI platforms treat as an afterthought and that Datafi treats as a core architectural principle.
What Distinguishes Datafi Across All of These Markets
The thread connecting supply chain, insurance, life sciences, cybersecurity, manufacturing, and financial services is not industry-specific. It is structural. Every one of these sectors has:
Complex, multi-system data environments that are not naturally interoperable. Governance and auditability requirements that make casual AI deployment unacceptable. Operational workflows where the difference between AI that informs and AI that acts is measured in real business outcomes. And an absence of a purpose-built platform that delivers the full stack required for agentic AI at scale without demanding a team of data engineers to build and maintain it.
Datafi’s Business AI Operating System addresses all four conditions. The vertically integrated stack ensures that context, governance, lineage, and quality travel together through every layer, from data ingestion through semantic modeling to agentic execution. The Chat UI designed for business users means that the value of the platform is accessible to the people closest to the operational problems it is solving, not just to the technical teams who configured it. And the unified architecture eliminates the integration gaps where value consistently disappears in point-solution stacks.
Data foundations increasingly define competitive positioning in the agentic age. That is not a forecast. It is a description of what is already happening in the markets Datafi serves.
The Opportunity Ahead
The commercial middle of the global economy is not a niche. It is the majority of business activity, employment, and economic value creation outside the largest enterprise tier. It spans the industries described above and dozens more: healthcare operations, media and publishing, logistics and freight, specialty retail, energy services, professional services at scale.
What these organizations share is not a sector. It is a position: sophisticated enough to generate the data that agentic AI requires, operationally complex enough to benefit enormously from autonomous workflow execution, and underserved by every platform currently competing for the enterprise AI budget.
Palantir owns its market. The intelligence community, the defense sector, the Fortune 50 with dedicated data engineering teams. That market is real and defensible.
The market Datafi is building for is larger, faster-growing, and structurally unaddressed by any incumbent. The data foundation that makes agentic AI work at scale is not just a technical requirement. It is the competitive moat. And for the enterprises across these six industries and beyond, Datafi is that foundation, purpose-built for the moment we are in.
Datafi is the Business AI Operating System for commercial enterprises. Our vertically integrated data and AI platform enables organizations across supply chain, insurance, life sciences, cybersecurity, manufacturing, financial services, and beyond to deploy agentic AI that solves business problems autonomously, with full governance, context, and control. Learn more at datafi.us

