We might as well be honest and predict that enterprises are going to develop all sorts of metrics that purportedly show the positive impact of their artificial intelligence investments, but that the metrics will quite probably be proxies that measure all sorts of things other than direct AI impact.
Still, some common metrics are a starting point:
Cost per unit of output — Does AI reduce the labor or compute cost to produce a document, resolve a ticket, process a claim, underwrite a loan?
Throughput / cycle time — How many units processed per hour, or how much time shaved off a workflow (e.g., code review, contract drafting, customer onboarding)?
Error rates and rework costs — Does AI reduce defect rates, compliance exceptions, or manual correction loops?
Headcount avoidance — the ability to scale output without proportional headcount growth. Often measured as "FTE equivalents automated."
Revenue-side metrics are less common, but might include:
Conversion lift — Does AI-personalized outreach or recommendation improve sales conversion rates?
Revenue per sales rep — If AI handles pipeline qualification or proposal drafting, does rep productivity improve?
Customer retention / churn reduction — Does AI-assisted support or proactive intervention improve net revenue retention?
Time-to-market — Does AI-accelerated R&D or software development compress product cycles in ways that generate earlier revenue?
Other operational outcomes also sometimes are quantified:
Accuracy or precision rates on specific tasks (e.g., document classification, anomaly detection in fraud)
Audit findings or compliance exceptions reduced
Model risk KPIs — false positive/negative rates in detection systems
Employee time recaptured — hours per week freed from low-value tasks, redirected to higher-value work
Employee satisfaction / retention — particularly in roles prone to burnout from repetitive work
Decision quality — harder to measure, but some firms track downstream outcomes of AI-assisted decisions against historical baselines
As rational as all that sounds, the metrics are “soft.” The attribution problem is severe, as AI is almost never the sole variable changing in a deployment.
AI might be deployed while other changes also are occurring:
Process redesign — Most AI deployments force workflow reengineering. Efficiency gains may be 60% process change and 40% AI
Training and change management — The same tool deployed with weak adoption programs vs. strong ones produces dramatically different outcomes
Data quality improvements — Organizations often clean and structure data as a precondition to AI deployment; that alone drives gains
Personnel changes — New hires, role restructuring, or management changes co-occur with AI rollouts
Macroeconomic or market tailwinds — Revenue gains during an AI deployment may reflect market growth, not AI impact.
Hawthorne effects — Measuring a team's performance changes behavior regardless of the tool.
The point is that it can be almost impossible to isolate the impact of AI cleanly. So most enterprise AI ROI figures are really "ROI of the initiative that included AI," not AI's marginal contribution.
The more interesting question might be "which specific processes have changed in ways we can measure, and do we understand why?"
Skeptics are correct to argue that attributing success purely to AI is often an oversimplification. But enterprises will have to try and do so, as investors will demand such “proof.”
So firms will supply such “proof” as best they can, even if the outcomes are not, strictly speaking, solely because of AI use.
And that is not an unusual case.
Research highlights that AI’s impact is heavily moderated by "complementary assets.” In other words, a firm’s organizational structure, existing data quality and worker skill levels often do more to determine the outcome than the AI model itself.
The difficulty in quantifying the immediate return on investment for new technologies is a recurring theme in economic history.
During the 1970s and 1980s, despite massive corporate investment in information and communications technology, overall productivity growth in many industrialized nations remained stagnant. This led economists to question whether computers were truly providing the expected value.
Eventually, results were observed, but:
Results lagged deployment: it took decades for firms to fully "reimagine" their organizational structures, business models, and workflows to leverage the new technology effective
Value was indirect: better management, more efficient coordination or improved service quality, but correlation, not causation, remains a question.
The measurable financial benefits of a transformative technology often became clear only after business processes were redesigned.
Still, in the meantime, we will see all sorts of metrics “demonstrating” AI impact. Enterprises making the investments have no choice but to try to do so, even if those metrics are “soft.”