The Vertical AI Credibility Gap

Jun 16

Written By Chris C

Why domain-specific agents could be the only part of today's AI economy actually working

The global AI economy has a credibility problem that is not evenly distributed. ~95% of companies that invested in generative AI in 2025 saw no measurable financial return. ~42% percent abandoned most AI initiatives. Copilot has a 5% conversion rate from a 400-million-user Office base after more than a year of availability. These numbers, from MIT, BCG, and Microsoft’s own disclosures, describe the horizontal AI layer, the general-purpose agentic tools that were supposed to transform enterprise workflows in 2024 and 2025. They are failing that thesis, and the market is beginning to price the consequences.

But these same numbers obscure a different story running underneath. In healthcare, legal, and financial services, vertical AI agents are delivering 3:1 to 5:1 ROI with documented business outcomes, 12-to-14-month payback periods, and revenue growth rates that defy the broader AI commercialization crisis. The market is treating “AI” as a homogeneous bucket. The commercial evidence is almost entirely concentrated in vertical, domain-specific agents, and the structural reasons why explain why that gap will widen before it closes.

Microsoft vs. Harvey

In the same week that Microsoft’s stock closed down ~16% year-to-date from an October 2025 high despite $37 billion in annualized AI revenue, Harvey AI announced it had crossed $200 million in annualized revenue at an $11 billion valuation. Harvey’s $200 million ARR is impressive because it represents a legal AI platform, contract review, due diligence, compliance, litigation support, that is generating real commercial revenue in an industry where 95% of AI investments are supposed to be producing nothing.

The contrast is structural. Harvey operates in legal; it was trained on legal documents, court filings, contract structures, and case law. It embeds compliance requirements natively. Its workflow is narrow, contract review, not general document processing, and that narrowness is the feature, not the limitation. Every AI investment thesis that predicted horizontal general-purpose tools would dominate enterprise adoption has been wrong in the same way: the technology works, the integration into specific workflows does not. Harvey and its vertical peers are succeeding precisely because they rejected the horizontal thesis and built for one domain at a time.

Enterprise AI Challenges are Vertical vs. Horizontal

The MIT Sloan finding that 95% of companies saw no measurable financial return from generative AI in 2025 is real and important. But it requires disaggregation to be useful. The 95% failure rate describes companies that deployed horizontal AI tools, Copilot, general-purpose chatbots, document summarizers, against workflows that are inherently domain-specific and for which general-purpose AI is structurally unsuited. A finance department trying to use ChatGPT to process an invoice is failing at horizontal AI deployment in a vertical domain.

The disaggregated data tells a different story. In healthcare, AI agents are delivering $3.20 in return for every $1 invested, with a typical payback period of 14 months. Administrative workload reductions of 30-40% in billing, documentation, and claims processing are documented and measurable. Revenue cycle management AI, claims processing, denial management, eligibility verification, is producing 40% reductions in billing errors and denials at provider systems that have deployed it. In legal, contract review AI is reducing drafting time by ~50-75% per matter. Harvey AI’s customer base is meaningful, with a substantial enterprise customer roster paying meaningful subscription fees for a product that demonstrably reduces billable hour requirements.

In financial services, the numbers are even more striking. Gartner data from Q1 2026 shows that 44% of finance teams are now utilizing agentic AI, a 600% increase from prior years. Average ROI for AI agents deeply embedded in financial processes is running at 80%, with a quarter of finance leaders reporting ROI exceeding 101%. Fraud detection AI agents in financial services are delivering some of the fastest payback timelines of any industry with ~8 months. Accounts payable automation, scanning invoices, matching to purchase orders, flagging mismatches, triggering approvals, is generating measurable efficiency gains that are traceable directly to P&L lines. McKinsey’s projection that generative AI could add $200 billion to $340 billion annually to global banking is grounded in these specific workflow-level deployments instead of generalized AI adoption.

Why Vertical Agents Work Today Where Horizontal Agents Fail

The structural explanation for why vertical agents generate ROI while horizontal agents do not is not complicated, but it is consistently underestimated by investors who apply a SaaS valuation framework to what is actually a workflow automation business with domain-specific data moats.

Domain-Specific Data Builds an Advantage: A legal AI agent trained on 50 million court filings, contracts, and regulatory documents produces outputs that a general-purpose model simply cannot match on legal workflow tasks, the general model was not trained on the specific distribution of legal language that makes contract review accurate and defensible. The legal domain has a specific vocabulary, a specific document structure, and specific compliance requirements that are not captured in general training data regardless of scale. The same logic applies to medical coding, financial compliance, and regulatory reporting. The vertical agent’s accuracy advantage is a consequence of the training data being the workflow.
Workflow Embedding Drives Adoption: When AI is embedded into an existing workflow as a bolt-on layer, as Copilot is layered onto Office, it requires the user to learn a new behavior without removing an old one. The user still opens Word, still drafts the contract, and then is asked to use Copilot to review it. This is additive friction. When AI replaces the workflow itself, as vertical agents do, the user adopts the AI as the workflow. There is no additive behavior required. The AI becomes the contract review. It becomes the claims processing. It becomes the fraud detection. Adoption follows utility rather than requiring training campaigns and change management programs.
Compliance Embedding Removes Governance Risk: Enterprise AI adoption faces a consistent governance bottleneck: legal, security, and compliance teams cannot approve tools whose outputs they cannot audit or explain. A horizontal AI tool that processes contracts produces outputs that the legal team cannot verify against their domain knowledge, creating a compliance risk that requires human review of AI outputs, which negates the efficiency gain. A vertical legal AI tool whose compliance logic is embedded, auditable, and documentable can be approved because its decision process is traceable to domain rules. The same dynamic applies to HIPAA compliance in healthcare and AML/KYC compliance in financial services. The vertical agent’s compliance advantage is not peripheral. It is the adoption enabler.
Outcome-based Pricing Aligns Incentives: The failure of “unlimited” AI pricing, where companies watched their inference bills balloon while productivity gains remained unmeasured, has driven a shift toward outcome-based pricing. Healthcare agents are priced on claims processed or errors reduced. Legal examples of vertical AI is priced on matters reviewed or contract cycles shortened. Financial services use cases is priced on fraud caught or compliance violations remediated. This pricing structure does two things: it gives the buyer a measurable ROI denominator (what did we pay vs. what did we save?), and it forces the vendor to build for outcomes rather than engagement.

Who Is Winning in Vertical AI and How Players Differentiate

The vertical AI landscape in 2026 is stratified into three tiers, each with a different competitive position and business model logic.

Tier 1: Vertical-Domain AI-Native Companies With Proven ARR

Harvey AI is the clearest example in legal. The company reported annualized revenue above $200 million. Its product covers contract analysis, due diligence, compliance, litigation, and legal research, all domains where legal-specific training data creates genuine accuracy advantages over general-purpose models. Harvey’s customers are law firms and in-house legal departments, and its revenue growth reflects contract value instead of seat count. The company is at the stage where its primary competitive threat is not another AI company but the legal software incumbents themselves.

In healthcare, Ambience Healthcare ($111 million raised across four rounds) has deployed its AutoScribe real-time AI medical scribe at health systems including Houston Methodist. OpenEvidence, which provides an AI copilot for healthcare providers, recently launched Coding Intelligence for automated medical coding and billing, and doubled its valuation to $12 billion after a recent funding round, a signal that the market believes healthcare AI has a durable commercial category. AKASA focuses on AI-driven revenue cycle management, automating coding, claims follow-up, and prior authorization, a workflow where the billing error reduction data is documented and the pricing can be tied directly to error rates. Suki offers voice-activated AI assistants for clinicians to streamline EHR interactions, directly addressing the administrative burden that drives clinician burnout and turnover.

Tier 2: Incumbents Building Vertical AI Capabilities Into Existing Platforms

The second tier is the most underappreciated competitive threat to pure-play vertical AI agents. Epic Systems, the dominant EHR platform in US healthcare, has been building AI-assisted clinical documentation and coding capabilities that directly compete with Ambience Healthcare and Suki. Clio, with its $850 million war chest and dominant position in legal practice management, is integrating AI contract review and legal research capabilities that could commoditize stand-alone legal AI tools. Salesforce’s Agentforce platform and ServiceNow’s AI agent products are building workflow automation for financial services and IT operations that compete with point solutions in those verticals.

The incumbent threat is structurally real but slower-moving than pure-play vertical AI investors expect. Epic’s EHR integration is a strength in clinical settings but produces a platform that must serve all specialties, limiting its depth in any one workflow. Clio’s legal practice management dominance does not automatically confer AI accuracy in contract review, which requires specific training data investments. The incumbents are real competitors but they are competing from a platform position, not a domain expertise position, and domain expertise is the moat that the data creates.

Tier 3: Horizontal Agent Platforms With Vertical Ambitions

Salesforce Agentforce, Microsoft Copilot, and ServiceNow AI agents are the horizontal platforms attempting to move down-stack into vertical workflows. They have distribution advantages, existing customer relationships, existing workflow integrations, existing procurement relationships, but they face the same structural problem that horizontal AI faces everywhere: the accuracy, compliance, and workflow integration required to generate documented ROI in a specific domain requires domain-specific investment that a horizontal platform is not optimized to make. Microsoft Copilot’s 5% conversion rate from 400 million Office users is a precise demonstration of this dynamic at scale. The horizontal platform can be embedded everywhere. It cannot deliver ROI anywhere specific without vertical-specific investment, and that investment is exactly what the vertical AI pure-plays have already made.

Why the Economics are Defensible

Vertical AI agents are generating commercial returns at rates that are 3x to 5x the enterprise AI average for a reason that goes beyond workflow specificity. The business model economics of vertical AI are structurally different from horizontal AI in ways that create defensibility and pricing power.

Gross margins on vertical AI services are higher and more stable than horizontal AI

Horizontal AI gross margins are approximately 52%, dragged down by the variable cost of inference scaling with usage in ways that seat-based SaaS pricing does not. Vertical AI agents that are priced on outcomes or on subscription contracts with defined scopes have better margin structure because the scope of the workflow is defined. A contract review agent that processes a defined number of matters per month has a defined compute cost. A general-purpose AI tool whose users generate unpredictable token volumes has an unpredictable cost structure. The margin difference shows up in the financials of vertical AI companies that have reached scale.

Domain-specific data creates a compounding moat

Every contract reviewed, every claim processed, every medical record coded by a vertical AI agent generates data that improves the model’s accuracy in that specific workflow. This creates a compounding advantage: the vertical agent with more deployments has better training data than the vertical agent with fewer deployments, which means its accuracy advantage grows with scale. Horizontal AI models do not benefit from this compounding effect in the same way, because their training data distribution is already comprehensive for general language tasks. The vertical data moat is not permanent, it can be competed away over time, but it is durable at current scale because the domain-specific data requirement creates a barrier to entry that general-purpose models cannot cross regardless of their capabilities.

The buyer decision chain is shorter and more tractable.

A horizontal AI tool purchase in a large enterprise requires IT approval, security review, legal compliance sign-off, and end-user training, a multi-stakeholder process that adds six to eighteen months to the sales cycle. A vertical AI tool purchase that replaces a specific workflow in a specific department can be approved at the department level, particularly when the pricing is tied to measurable outcomes. Accounts payable automation can be approved by the CFO because the ROI is calculable in dollars saved. Legal contract review AI can be approved by the General Counsel because the workflow outcome is auditable. This shorter decision chain is why vertical AI sales cycles are shorter and why vertical AI companies can reach meaningful ARR faster than horizontal AI companies that are competing for enterprise-wide deployment.

The Risks

Incumbent platform commoditization: The most credible threat to vertical AI pure-plays is Epic Systems building medical AI documentation natively into its EHR, Clio building legal AI contract review natively into its practice management platform, or Oracle embedding financial services AI natively into its ERP. Incumbents have distribution and workflow integration advantages that pure-plays do not. If the incumbents succeed in building vertically competitive AI capabilities, the pure-plays face a Razor-and-blade re-pricing: the platform is free (or included in existing licenses) and the AI add-on is commoditized. This is the failure mode that Legal AI companies have been navigating since 2023, and it is the reason Harvey AI’s $11 billion valuation is dependent on maintaining accuracy and workflow depth advantages that are defensible against a well-funded incumbent building equivalent capabilities.
Regulatory compression of the compliance moat: The compliance advantage that vertical AI agents enjoy in healthcare, legal, and financial services is partly a function of the regulatory environment being unclear about AI decision-making in these domains. If regulators clarify and simplify the compliance requirements for AI-assisted decisions in these industries, as has happened in some areas of financial services AI, the regulatory moat narrows and horizontal AI tools become more viable in these verticals. The HIPAA compliance complexity that currently favors specialized healthcare AI is not necessarily permanent. A future in which AI-assisted medical coding is clearly regulated and approved could remove the compliance differentiation that healthcare vertical AI agents currently enjoy.
The 40% enterprise adoption figure for AI agents masks a production gap: Gartner’s finding that 80% of enterprise applications shipped in Q1 2026 embed at least one AI agent, while only 31% of organizations have an agent running in production, is the leading indicator of the same adoption failure that plagued horizontal AI in 2024 and 2025. Vertical AI agents may be hitting the same ceiling: the product is being embedded into workflows but not generating sustained business outcomes because the organizational change management required to move from pilot to production is systematically underestimated. If this production gap is as wide for vertical AI as it was for horizontal AI, the ARR growth rates for vertical AI companies will compress before they reach scale.
China AI commoditization of foundation models: DeepSeek and Chinese AI companies have demonstrated the ability to train competitive models at a fraction of US frontier model costs. If this capability extends to domain-specific fine-tuning, legal AI models trained on Chinese contract law, healthcare AI models trained on Chinese medical records, the training data moat that vertical AI companies depend on faces direct competitive pressure. The moat is not just technical; it is partly legal and regulatory (access to US court data, US medical records, US financial data is restricted in ways that Chinese company access to Chinese data is not). If Chinese AI companies can build competitive domain-specific models using Chinese data at lower cost, the global competitive dynamics for vertical AI vendors change significantly.

Investment Implications

Vertical AI pure-plays are the most credible AI investment opportunity today: Harvey AI at $11B with $200M ARR, OpenEvidence at $12B valuation, and Ambience Healthcare with documented health system deployments represent companies with genuine commercial traction in industries where 95% of AI investment is producing nothing. The market bifurcation between AI infrastructure beneficiaries (the hyperscalers and picks-and-shovels plays) and AI application winners is real, and the winners are vertical.
The incumbent-threat narrative is credible but timing-dependent: Epic, Clio, and Oracle building vertical AI capabilities is a matter of time. The investment question is whether the pure-plays can build sufficient domain depth and customer switching costs before the incumbents catch up. Harvey AI’s contract review accuracy advantage over a hypothetical Clio-built AI tool is a function of training data and workflow depth, advantages that are not permanent but are currently real. Investors should track Clio’s AI feature roadmap and Epic’s clinical AI announcements as leading indicators of when the incumbent threat becomes commercially material.
The healthcare vertical is the highest-ROI, highest-barrier opportunity: Healthcare AI agents with documented 30-40% administrative workload reductions, 40% billing error reductions, and 14-month payback periods are operating in the highest-ROI vertical with the highest regulatory barriers to entry. The combination of HIPAA compliance requirements, clinical accuracy standards, and healthcare system procurement complexity creates a moat that is more durable than legal or financial services vertical AI. AKASA, Ambience Healthcare, and the emerging cohort of healthcare AI agents focused on revenue cycle management are the most attractively positioned companies in the vertical AI landscape.
Outcome-based pricing is the key differentiator in the next phase of AI commercialization: As the market moves past the faith-based AI investment phase (where AI stocks were priced for potential rather than proof), the companies that can demonstrate ROI on a specific pricing metric, per claims processed, per matter reviewed, per fraud case detected, will command premium valuations. Companies still on seat-license or usage-based pricing models will face margin compression and valuation pressure as buyers demand ROI denominators. The transition to outcome-based pricing is a leading indicator of AI company commercial maturity.
Horizontal AI infrastructure remains a crowded trade with compressed upside: Microsoft is suffering despite strong Azure growth and $37B in annualized AI revenue as of the time of this post, is the most visible expression of the market’s reassessment of the horizontal AI thesis. The hedge fund de-grossing data from Goldman Sachs Prime Book, the largest reduction in IT sector positions in a decade, reflects institutional recognition that “AI beneficiary” is not a sufficient investment thesis when the AI is not generating measurable enterprise returns.

The Closing Question

Horizontal AI and vertical AI are both “AI.” They share the same foundation models, the same GPU infrastructure, the same transformer architecture. And yet one, horizontal AI, is generating $2.52 trillion in annual investment with documented 95% ROI failure rates. The other, vertical AI, is generating 3:1 to 5:1 documented ROI in healthcare, legal, and financial services with ARR growth rates that are the exception to every enterprise AI generalization.

The question is whether the pure-play vertical AI companies can maintain their domain-specific advantages long enough to build durable businesses before the incumbents catch up, and whether the current venture funding valuations are priced for that durability or for the early momentum that the incumbents will eventually match.

For now, vertical AI agents are the only part of the AI economy that seems to be actually working. It is the most important thing an investor in this market can know.

——————————————————

References

- Harvey AI: $200M annualized revenue, $11B valuation, $200M raise March 2026; contract review 50-75% time reduction [1] [Lux Capital portfolio]

- MIT Sloan: 95% of companies saw no measurable financial return from generative AI in 2025 [MIT Sloan Management School research]

- Boston Consulting Group: 42% of companies abandoned most AI initiatives in 2025 [BCG AI research]

- Microsoft Copilot: 20M paying users, ~5% conversion from 400M Office 365 commercial users [2]

- Microsoft Q3 FY2026: Azure 40% YoY, $37B annualized AI revenue, $190B capex guidance; MSFT at $420.77 May 7, down 15.7% YTD from $539.83 high [3]

- Gartner Q1 2026: 80% of enterprise applications embed AI agent, only 31% running in production; 40% of enterprise apps will feature task-specific AI agents by end of 2026, up from less than 5% in 2025 [4]

- Gartner Hype Cycle: agentic AI at “Peak of Inflated Expectations” in 2026 [5]

- IDC FutureScape 2026: 40% of G2000 job roles involve working with AI agents by 2026; $1.4T global enterprise AI agent spending by 2027 [6]

- Healthcare AI ROI: $3.20 per $1 invested, 14-month payback; 30-40% administrative workload reduction; 40% billing error reduction [7] [8]

- Healthcare AI agents: 60% of providers automating claims processing, RCM, and billing workflows; prior auth AI 40% processing time reduction [9]

- Legal AI market: $4.59B 2025 to $5.59B 2026, CAGR 22.3%, reaching $12.49B by 2030 [10]

- Harvey AI legal contract review: 50-75% drafting time reduction [11]

- Clio: $850M Series G round 2025, 79% of legal professionals using AI [Clio research]

- Finance AI ROI: 80% average ROI, 101%+ for deeply embedded deployments; 44% of finance teams utilizing agentic AI by 2026, 600%+ increase [12] [13]

- McKinsey: generative AI could add $200B to $340B annually to global banking sector [14]

- Fraud detection AI: fastest payback in financial services at 8 months [15]

- Accounts payable AI: fastest route to measurable AI ROI in finance; invoice processing, fraud triage, regulatory reporting [16]

- Goldman Sachs Prime Book: largest IT sector reduction in a decade over two weeks ending May 4, 2026 [17]

- AI market size: $7.84B 2025 to $12-15B 2026, CAGR 41% through 2030 [$52.62B by 2030] [18]

- Horizontal AI gross margins: ~52% vs 75-85% for traditional SaaS [19]

- OpenEvidence: valuation doubled to $12B after latest funding round [20]

- Ambience Healthcare: $110.65M raised across 4 rounds, AutoScribe deployed at Houston Methodist [21]

- AKASA: AI-driven revenue cycle management, automating coding, claims follow-up, prior authorization [22]

- Suki: voice-activated AI assistant for clinicians, EHR interaction [22]

- Legora (Europe): $550M round, $5.55B valuation, 3x prior valuation [1]

## Endnotes

[1] Broadband Breakfast: https://broadbandbreakfast.com/legal-tech-valuations-surge-in-2026-because-of-ai/

[2] ComputerWorld: https://www.computerworld.com/article/4166676/microsoft-now-has-over-20-million-paying-copilot-users.html

[3] Seeking Alpha: https://seekingalpha.com/article/4900458-microsoft-azure-doing-the-heavylifting

[4] Gartner: https://www.gartner.com/en/newsroom/press-releases/2025-08-26-gartner-predicts-40-percent-of-enterprise-apps-will-feature-task-specific-ai-agents-by-2026-up-from-less-than-5-percent-in-2025

[5] Gartner: https://www.gartner.com/en/articles/hype-cycle-for-agentic-ai

[6] IDC: https://www.idc.com/resource-center/blog/futurescape-2026-moving-into-the-agentic-future/

[7] Droidal: https://droidal.com/blog/calculate-roi-ai-agents-healthcare-rcm/

[8] Procurement Edge: https://www.productiveedge.com/blog/the-roi-of-ai-in-healthcare-what-the-numbers-actually-show

[9] Snowflake: https://www.snowflake.com/en/blog/ai-agents-healthcare-efficiency/

[10] The Business Research Company: https://www.thebusinessresearchcompany.com/report/artificial-intelligence-ai-in-legal-market-report

[11] Spellbook.legal: https://spellbook.com/learn/ai-contract-drafting-roi

[12] Dextralabs: https://dextralabs.com/blog/roi-of-implementing-ai-agents-in-finance/

[13] Salesmate: https://www.salesmate.io/blog/ai-agents-adoption-statistics/

[14] McKinsey BCG research: https://www.bcg.com/publications/2025/how-finance-leaders-can-get-roi-from-ai

[15] Neurons Lab: https://neurons-lab.com/article/agentic-ai-in-financial-services-2026/

[16] Basware: https://www.basware.com/en/ai-to-roi-unlock-value-with-ai-agents

[17] Investing.com: https://www.investing.com/news/stock-market-news/tech-stocks-see-largest-hedge-fund-selloff-in-decade-goldman-sachs-93CH-4655112

[18] AI Funding Tracker: https://aifundingtracker.com/top-ai-agent-startups/

[19] Forbes: https://www.forbes.com/sites/terdawn-deboe/2026/02/28/why-your-ai-investment-is-not-making-money-and-how-to-fix-it/

[20] Fundraising Insider: https://fundraiseinsider.com/blog/ai-startups/

[21] Lightit.io: https://www.lightit.io/blog/top-9-companies-building-ai-agents-in-healthcare-2026/

[22] Keragon: https://www.keragon.com/blog/ai-agent-companies

Chris C

The Vertical AI Credibility Gap

Why domain-specific agents could be the only part of today's AI economy actually working

Microsoft vs. Harvey

Enterprise AI Challenges are Vertical vs. Horizontal

Who Is Winning in Vertical AI and How Players Differentiate

Tier 1: Vertical-Domain AI-Native Companies With Proven ARR

Tier 2: Incumbents Building Vertical AI Capabilities Into Existing Platforms

Tier 3: Horizontal Agent Platforms With Vertical Ambitions

Why the Economics are Defensible

Gross margins on vertical AI services are higher and more stable than horizontal AI

Domain-specific data creates a compounding moat

The buyer decision chain is shorter and more tractable.

The Risks

Investment Implications

The Closing Question

References

Location

Contact

The Vertical AI Credibility Gap

Why domain-specific agents could be the only part of today's AI economy actually working

Microsoft vs. Harvey

Enterprise AI Challenges are Vertical vs. Horizontal

Who Is Winning in Vertical AI and How Players Differentiate

Tier 1: Vertical-Domain AI-Native Companies With Proven ARR

Tier 2: Incumbents Building Vertical AI Capabilities Into Existing Platforms

Tier 3: Horizontal Agent Platforms With Vertical Ambitions

Why the Economics are Defensible

Gross margins on vertical AI services are higher and more stable than horizontal AI

Domain-specific data creates a compounding moat

The buyer decision chain is shorter and more tractable.

The Risks

Investment Implications

The Closing Question

References

Applied Statistical Models for Evaluating Firm Operating Performance and Investment Returns

Location

Contact