The Agent Economy

Arlo - Vol. 1, No. 1 - The Agent Economy

The Brief

The Agent Economy: The Productivity Gains Are Real. The Economics Are Broken. And the Window to Fix This Is Closing.

Agentic AI has a deployment problem - and it is not the technology. 62% of enterprises are experimenting with AI agents. Only 23% are scaling in any function. No more than 10% are scaling in any single business function. (McKinsey, State of AI in 2025, 1,993 organizations, 105 countries.) The gap between experimentation and production is not a capability gap - it is a cost architecture gap, a governance gap, and an organizational readiness gap. Organizations that mistake it for the former will spend the next two years in pilot purgatory while a small cohort builds an insurmountable lead.

The infrastructure bill is arriving - and most organizations did not budget for it. 44% of organizations now allocate 76-100% of their AI budget to inference: not model training, not acquisition, just running the AI they already have at the scale they already committed to. [Vendor disclosure: DigitalOcean, Currents February 2026, 1,100+ developers, CTOs, founders.] 49% cite inference costs as their single biggest barrier to scaling - above reliability, above integration complexity. The organizations discovering this after committing to production are having very different CFO conversations than the ones who modeled it before.

The governance gap is not a future risk - it is a present liability. Only 21% of organizations have mature governance models for autonomous agents (Deloitte, State of AI in the Enterprise 2026, 3,600+ global leaders). 67% of decision-makers approve AI deployments despite known security concerns - driven by competitive pressure. Singapore published the world's first national agentic AI governance framework in January 2026. OWASP published its Top 10 for Agentic AI the same year. The regulatory infrastructure is being built around organizations that have not yet built their own. That is not a comfortable position.

The ROI exists - but only in a narrow band of use cases, and the economics are specific. Customer service agents deliver 85-90% lower cost per interaction versus human handling. Supply chain predictive maintenance returns 10-30x ROI. These numbers are real. What makes them real is also what limits them: high transaction volume, narrow task definition, easily quantifiable counterfactual. Organizations deploying agents outside this band - in knowledge work, strategic research, creative production - are generating productivity gains they cannot yet translate to a defensible ROI case. That is not nothing. But it is also not the same conversation.

More than 40% of agentic AI projects are forecast to fail by 2027 - and the failure pattern is consistent. Gartner's projection (cited in Deloitte Tech Trends 2026) identifies three root causes: legacy systems without real-time APIs, data architectures that predate agent requirements, and process automation layered onto workflows that were never redesigned for agents. Deloitte calls this the "agentic reality gap." The organizations that fail will not fail because the technology let them down. They will fail because they deployed into organizational infrastructure that was never built to support them.

The Reality Check

The productivity gains from AI agents are real. The economics are not solved. The organizations treating those as the same statement are about to have a very expensive year - because "measurable productivity gains" and "defensible ROI" are two different conversations, and only one of them keeps the program funded past Q3.

The Signal

The Gap Is Not Between Early Adopters and Late Movers. It Is Between Organizations That Built Infrastructure and Organizations That Built Demos.

Here is the uncomfortable read of this week's data: the organizations currently winning on agentic AI did not win because they moved faster. They won because they did something slower - they built cost architecture, governance frameworks, and workflow redesign before they scaled. The organizations losing are not losing because they lack ambition. They are losing because they scaled deployment before they built the infrastructure to make deployment defensible.

McKinsey's high-performer cohort makes this concrete. The 6% of organizations currently extracting 5% or more of EBIT from AI are three times more likely to be running agents at scale. 55% of them fundamentally redesign workflows before deploying agents - versus 20% of everyone else. They are 3.6 times more likely to achieve what McKinsey characterizes as transformative change. The gap between this cohort and the field is not a technology gap. It is a methodology gap. And it is compounding every quarter.

The inference cost problem is where this becomes a CFO conversation. 44% of organizations are running 76-100% of their AI budget through inference - the operating cost of AI, not the capital investment. [Vendor disclosure: DigitalOcean.] That number has no precedent in how most organizations budgeted for AI. When a program moves from pilot to production, inference costs scale linearly with usage. A pilot running 1,000 queries a week becomes a production system running 50,000 queries a week - at 50 times the cost, on the same per-query economics. The organizations who modeled this before they scaled are managing it. The ones who discovered it after are calling it a crisis.

The governance problem is where this becomes a board conversation. Agents are not software in the conventional sense - they have identities, they access systems, they make decisions and execute actions autonomously. 71% of deployed AI agents lack proper oversight, according to Gravitee.io's 2026 governance research. Shadow agents - deployed outside IT visibility by individual business units chasing productivity - are proliferating. The organizations that approved deployments despite known security concerns did so because the competitive pressure felt more immediate than the governance risk. It was not.

Who is winning right now: organizations in McKinsey's high-performer cohort that built measurement, FinOps, and governance infrastructure before scaling. They are currently seeing 10%+ revenue uplift and have the cost architecture to defend it. Who is losing: organizations in the 77% that have agents in pilot or strategy mode - not because the technology isn't ready, but because their organizational infrastructure isn't. The window to close that gap is two quarters. After that, the compounding works against them.

The Deep Dive

The Inference Cost Reckoning: Why the Organizations Winning on Agentic AI Are the Ones Who Solved Their Economics Before They Scaled

Thesis: The organizations failing to scale agentic AI are not failing because the technology doesn't work. They are failing because they deployed agents without understanding inference economics - and they are now watching AI budgets consumed by infrastructure costs that produce no boardroom-defensible ROI.

What inference cost actually is - and why it surprises everyone

Inference is the cost of running a trained AI model on a task. Not building it. Not acquiring it. Running it. Every query an agent processes, every decision it makes, every action it executes - each one is an inference operation with a real compute cost: tokens processed, GPU cycles consumed, API calls made.

In the pilot phase, this cost is invisible. An agent running 50 test queries a day is a rounding error on the cloud bill. The problem is that pilot economics are not production economics, and the relationship between the two is not intuitive until you have already committed. Consider a customer service agent pilot: 1,000 queries per week, 65% autonomous resolution rate, cost per query at modest pilot pricing. Leadership approves scaling. Production: 50,000 queries per week. Same cost per query. Total cost: 50 times larger. If the organization had not modeled this before committing, the gap between those two numbers is a crisis - and nearly half of organizations (49%) report encountering exactly this wall. [Vendor disclosure: DigitalOcean, Currents February 2026.]

What practitioners running these programs consistently report: the inference cost surprise is the most common reason agent programs stall between pilot success and production scale. Not technical failure. Not organizational resistance. The bill.

The FinOps gap: why most organizations have no infrastructure for this problem

Deloitte's Tech Trends 2026 identifies "scalability and cost overruns without FinOps" as one of three core failure modes for agentic AI. AI programs start in innovation budgets, where nobody owns inference cost as a financial line item until it becomes large enough to attract attention - which usually happens between month two and month four of production. By then, the organizational structures for managing it don't exist. Building them reactively is slower and more expensive than building them before deployment.

The organizations that solved this built FinOps infrastructure before they scaled. They optimize at three levels: model selection (routing simple tasks to smaller, cheaper models - the single highest-leverage optimization available today, and most organizations are not doing it systematically), infrastructure (batching non-time-sensitive operations, using reserved capacity for predictable workloads), and architecture (caching frequent queries, reducing redundant agent calls, minimizing token consumption).

Where the ROI works - and why

Customer service agents deliver 85-90% lower cost per interaction versus human handling, with documented 65% autonomous resolution rates and 47% faster resolution times. (Deloitte, State of AI in the Enterprise 2026; Gartner, 2026 enterprise AI analysis.) Three reasons: volume is high enough that inference costs are dwarfed by labor cost replaced; the task is narrow enough that accuracy is consistently maintainable; and the counterfactual - human handling - is precisely quantifiable, making ROI defensible to a CFO. Supply chain predictive maintenance shows 10-30x ROI and 18-25% lower costs versus preventive maintenance. Toyota and Dell have documented outcomes in these areas. (Deloitte, Tech Trends 2026.)

The honest read on everything outside these anchors: productivity gains without a quantifiable counterfactual cannot yet be translated into a defensible ROI case. That is not permanent. But it should be made consciously.

The failure mode: what getting this wrong looks like

Gartner's 40%+ failure projection is not a technology forecast. It is an organizational one. The pattern across organizations that have stalled is consistent: agents deployed into workflows never redesigned for agents, inference costs exceeding the business case at scale, governance gaps creating security or compliance exposure, and no measurement infrastructure to defend the program at budget time. The program does not fail dramatically. It fails quietly, at budget time, when nobody can answer the CFO's question about what the program produced.

The three questions that separate defensible programs from those heading toward failure: What is our cost per agent operation at production scale, and does the business case hold? What governance infrastructure is in place before we scale, not after? Are we redesigning workflows for agents, or automating workflows built for humans?

The Playbook

For the C-Suite (CEO / COO / CFO)

Before approving any agent program for production, require a production cost model - not a pilot cost. Ask the team to model per-operation inference costs at 2x, 5x, and 10x current pilot volume and show you the business case at each level. If the model does not exist, the program is not ready to scale. Organizations that skip this step are the ones having unplanned CFO conversations six months later.
Establish AI FinOps ownership now, before it becomes a crisis. Assign a specific owner responsible for understanding, attributing, and optimizing AI infrastructure costs in real time. If nobody owns this, the cost surprises that nearly half of organizations are experiencing will arrive on your P&L without warning. This is a governance decision, not a technical one.
Ask your governance question directly, and expect a specific answer. "Do we have a mature governance model for our AI agents?" If the answer is not a confident yes with specifics, the organization is in the 79% without one - a risk profile that boards are beginning to ask about and regulators are beginning to address.

For CMOs and Marketing VPs

Map every agent use case against the proven ROI anchors before deploying. Customer service and supply chain have documented economics and quantifiable counterfactuals. Marketing use cases have productivity gains but largely unproven ROI at scale. Build the measurement framework that connects agent activity to pipeline and revenue outcomes before you deploy - agents compound the measurement problem most organizations already have.
Run the inference math on whatever you have in production right now. Find out your cost per agent operation. Calculate what it looks like at 2x and 5x volume - because that is the decision you will face in the next budget cycle. If the business case breaks at scale, fix it now. The CFO will ask this question eventually.
Build governance into your agent program before it becomes visible to regulators or your legal team. Marketing functions handle customer data subject to regulatory oversight in multiple jurisdictions. Agents operating without defined failure modes, escalation paths, and audit trails create liability exposure most marketing organizations are not currently scoping.

For Department Leads and AI Initiative Owners

Do not automate your current process. Redesign it. The single most consistent predictor of agentic AI project failure is automating an existing workflow rather than redesigning it for agent capabilities. High performers are 2.75x more likely to fundamentally redesign workflows before deploying. Ask: if we built this process from scratch knowing we had agents available, what would it look like? The answer is almost always different - and that difference is where the ROI lives.
Define your agent's failure modes before you go live. What happens when the agent is wrong? What is the escalation path when a decision exceeds its authority? Organizations that answer these questions before deployment have governance. Organizations that answer them after an incident have an incident report. The OWASP Top 10 for Agentic AI is a free, specific, published starting checklist.
Run a shadow agent audit before your next budget cycle. In most organizations, agents have been deployed outside IT visibility by team members chasing productivity. Find them now. Shadow agents create security exposure, data access issues, and accountability gaps that will surface eventually. Surfacing them proactively puts you in control of the conversation.

The Numbers

62%	of enterprises are experimenting with AI agents. Only 23% are scaling in any function. No more than 10% in any single function.McKinsey, State of AI in 2025, November 2025 (1,993 organizations, 105 countries)
44%	of organizations allocate 76-100% of their AI budget to inference costs - not training, not acquisition. Running the AI they already have is now the dominant expense.DigitalOcean, Currents February 2026 - vendor-produced (1,100+ developers, CTOs, founders)
49%	cite inference costs as their #1 barrier to scaling AI - above reliability (41%) and integration complexity (31%).DigitalOcean, Currents February 2026 - vendor-produced
21%	of organizations have mature governance models for autonomous agents. 67% approved AI deployments despite known security concerns.Deloitte, State of AI in the Enterprise 2026 (3,600+ global leaders)
85-90%	lower cost per interaction for customer service AI agents vs. human handling. 65% autonomous resolution rates. 47% faster resolution times.Deloitte, State of AI in the Enterprise 2026; Gartner, 2026 enterprise AI analysis
10-30x	ROI on AI-powered predictive maintenance. 18-25% lower costs vs. preventive maintenance schedules.Multiple independent sources
55%	of supply chain leaders expect agentic AI to reduce entry-level hiring needs within their organizations.Gartner, Supply Chain Leaders Survey, February 2026
>40%	of agentic AI projects forecast to fail by 2027 - due to legacy infrastructure constraints, data architecture gaps, and failure to redesign workflows before deployment.Gartner forecast, cited in Deloitte Tech Trends 2026
3x	more likely to scale agents at scale - the ratio between McKinsey's high performers (6% of organizations, 5%+ EBIT from AI) and the rest of the market.McKinsey, State of AI in 2025, November 2025

The governance gap and the inference cost problem are the same problem seen from two angles: both are the result of organizations deploying agents before building the infrastructure to run them responsibly at scale. The organizations closing that gap now will have defensible programs in 2027. The ones that don't will be explaining the bill.

What's Next

Next issue - The Workforce Reckoning: McKinsey's high-performer data surfaces a pattern most organizations are getting exactly backwards. 32% of organizations expect headcount reductions linked to AI in the next year. But high performers are not reducing headcount - they are redeploying it, and their revenue numbers reflect that choice. The distinction between AI as a cost-reduction tool and AI as a capability-expansion tool determines whether your AI program produces a competitive advantage or a workforce crisis. We will show you what the playbook looks like from the organizations currently getting it right.

One thing to watch before next Tuesday: CoreWeave's infrastructure agreements with Meta ($21B) and Anthropic (multi-year Claude commitment) are worth tracking closely. When compute at this scale gets locked into preferential arrangements, organizations without established infrastructure relationships become price-takers. Watch for follow-on announcements from AWS, Google Cloud, or Azure. The compute layer is the economic foundation of every agent deployment. Who controls it is not an abstract question.

M&A + Corporate Moves

OpenAI acquires TBPN (April 2, 2026) - First media acquisition for OpenAI. A tech news podcast network. The move into content ownership raises questions about editorial independence in AI coverage worth watching.

Nebius-AI21 Labs merger - $2B+ vertical integration play with Nvidia as key investor. Enterprise infrastructure consolidation is accelerating faster than most procurement teams have accounted for.

Meta-Nebius Group - $27B comprehensive AI infrastructure agreement beginning 2027. The scale signals where enterprise AI economics are heading.

New Tools Worth Knowing

Nvidia Agent Toolkit (April 3, 2026) - Open-source enterprise agent platform with Adobe, Salesforce, and SAP integrations. The AI-Q blueprint specifically documents 50%+ query cost reduction - directly relevant to the inference cost problem this issue covers.

Microsoft Agent Governance Toolkit (April 3, 2026) - Free open-source toolkit with policy engine, cryptographic agent IDs, EU AI Act/HIPAA/SOC2 compliance. The most practical starting point for organizations that need governance infrastructure and don't have it.

Events on the Radar

Okta Showcase 2026 (April 30) - AI identity and access management. Directly relevant to the agent governance gap covered in this issue.

Sources: McKinsey, State of AI in 2025 (Nov 2025) · McKinsey, State of AI Trust in 2026 · Deloitte, State of AI in the Enterprise 2026 · Deloitte Tech Trends 2026 · Gartner Supply Chain Leaders Survey (Feb 2026) · DigitalOcean Currents Feb 2026 (vendor-produced) · Singapore IMDA, Model Agentic AI Framework (Jan 2026) · OWASP Top 10 for Agentic AI (2026) · PwC AI Predictions 2026

Produced with AI assistance and human editorial review. Vol. 1, No. 1 · April 2026 · Confidential - Subscriber Use Only

Unsubscribe · Manage preferences

The Agent Economy

Recommended for you

Explore

Your Account

Socials