Two Hundred Million to Study the Wave You're Making
A frontier lab has priced in the scenario where its product makes office work unnecessary — and is funding the academics who'll help governments respond. Adoption is wide; realisation is narrow. The gap is now numerical.
Two Hundred Million to Study the Wave You're Making
On Wednesday June 10, Anthropic announced a $200 million research commitment and two policy frameworks aimed at the labor consequences of the technology it builds. The headline number is what most coverage seized on — US News carried it cleanly, Business Standard a few hours later. The document itself is more interesting, and you can read it for yourself in Anthropic's own PDF. What it does, in plain English, is map three escalating scenarios — 5% unemployment, 10% unemployment, an "unprecedented" tier — and propose policy responses for each, including basic income, sovereign wealth structures, and equity-sharing if labor markets stop clearing.
You can read this two ways. The charitable reading is that a frontier lab is being responsible about externalities most of its competitors will not name. The less charitable one is that the company has now publicly priced in a scenario where its product makes a substantial fraction of office work unnecessary — and is funding the academic apparatus that will help governments respond when that happens. Both readings are true. I find the second more useful for thinking about your enterprise this quarter, because it tells you something the vendor decks will not.
the gap is now numerical
Strip away the rhetoric and the productivity story in mid-2026 is converging on a single uncomfortable shape. Adoption is wide. Realization is narrow.
McKinsey's State of AI Trust 2026, drawn from roughly five hundred organizations surveyed late last year, puts regular AI use at 88% of respondent firms and generative-AI use at 72%, up from 33% two years earlier. The same survey reports that only about 6% of organizations qualify as "AI high performers" — meaning they attribute more than 5% of EBIT to AI use — and that in developed markets a vanishing 1% of executives describe their gen-AI rollouts as "mature." Almost everyone is using it. Almost nobody is making the numbers move enough to defend in front of an audit committee.
A separate piece of work from the Atlanta Fed and a coalition of researchers, published as NBER Working Paper 34984 earlier this year, surveyed roughly 750 corporate executives — mostly CFOs — and reached the same shape from the inside. Labor-productivity gains are positive, vary by sector, and are expected to strengthen through 2026, but the paper names what most boards have not yet been told plainly: a "productivity paradox in which perceived productivity gains are larger than measured productivity gains, likely reflecting a delay in revenue realizations." Translated: the people using the tools think they are faster. The income statement does not yet agree. That is the honest measurement problem, restated by 750 CFOs who have no incentive to invent it.
The macro side is duller and more aligned. The Bureau of Labor Statistics' revised Q1 2026 release shows nonfarm-business labor productivity up 0.3% on the quarter and 2.9% on the year. The Q2 number drops on August 6. The trend through 2025 and into 2026 is positive but unspectacular — exactly what you would expect if the productivity-paradox literature (Brynjolfsson, Solow) is right and we are still in the install phase of a general-purpose technology, where the model and the model's economic dividend are separated by years of organizational rewiring.
where the discipline lives — and where it doesn't
The piece of recent work worth your time is the Stanford Enterprise AI Playbook — Pereira, Graylin, and Brynjolfsson, published in March. Fifty-one production deployments across forty-one organizations, seven countries, and roughly a million employees in aggregate. The finding that should be on every CIO's office wall: 95% of AI transformation failures trace to organizational factors, not technology. Workflow mapping before tool selection. Governance from day one. Observability before production. Leadership continuity through the first eighteen months, including through visible failure. None of that is glamorous, and none of it is what the typical vendor deck recommends.
The Stanford team also reports a median productivity gain of 71% for agentic deployments and 40% for high-automation systems. Treat those carefully — vendor-side numbers in spirit, drawn from cases self-selected as successful. The honest reading is that if you do the unglamorous work, the gains are real. If you don't, you join the 22% of agent deployments that arrive at twelve months with negative ROI.
the klarna reversal is the cleanest counter-example we have
If you only remember one case from 2025-2026, make it Klarna. Between 2022 and 2024 the Swedish fintech eliminated roughly 700 customer-service positions and replaced them with an OpenAI-backed assistant that, at peak, the company said was handling two-thirds to three-quarters of customer interactions. CEO Sebastian Siemiatkowski told the world in 2024 that the AI was doing the work of 700 people. By early 2026, with customer-satisfaction scores deteriorating on complex calls, the company quietly began rehiring — targeting students and rural residents — and shifted to a hybrid model where AI handles routine queries and humans handle escalations.
I would not over-extract from one company's experience. The point is not that AI customer service "doesn't work." It is that the production reality is a centaur model — humans on escalation, AI on volume — and that the firms which announced the autopilot version loudest are the ones rebuilding the staff most quietly. The case is also a reminder that "the AI did the work of 700 people" was always a self-report from a CEO with an IPO to walk into, and that the right question to ask of any such claim is: what would the customer-satisfaction time-series look like if you charted it next to the headcount?
the worker side of the ledger
A piece of evidence that does not get cited often enough sits in ManpowerGroup's Global Talent Barometer 2026. Regular AI usage among workers jumped 13 percentage points to 45%. In the same period, confidence in using technology fell 18 points — the first overall decline in worker confidence in three years. The drop was sharpest among Baby Boomers (-35) and Gen X (-25). Fifty-six percent of the global workforce reported receiving no recent training. Forty-three percent said they fear automation may replace their job within two years.
That is not a healthy adoption pattern. It is mass usage in the absence of structured upskilling, with predictable consequences. A workforce that uses a tool more while trusting itself less is not producing centaurs. It is producing operators flying on autopilot they don't understand — and "autopilot you don't understand" is the configuration that, every time it has appeared in any high-consequence industry, has eventually broken expensively. Aviation learned this in the late 1980s with the early glass-cockpit era. Trading floors learned it in 2010. Knowledge work is about to learn it again, more diffusely, with a much longer tail of damage.
what i would do this week, on a board, with a budget
Three questions are the ones I would push to the next board agenda this quarter — and I would not accept a vendor case study in lieu of an answer to any of them.
One: show me the instrumented baseline. Not survey data. Not "users say." A measured, pre-AI throughput number for the workflow you are claiming improvement on. If it does not exist, the gain does not exist; what exists is a feeling about a gain.
Two: where in the workflow is the human still doing what the model cannot? Name the escalation path. Klarna eventually named theirs. Most of your deployments still pretend the path is not needed.
Three: what is your training spend per AI-using employee, this year, in money? If the answer is meaningfully smaller than your tool license spend, you are running the ManpowerGroup experiment inside your own company. The 56% no-training figure is not a global statistic. It is a forecast for your own audit findings in eighteen months.
The Anthropic announcement is — read correctly — useful for exactly this reason. A frontier lab has now produced a document that takes seriously the possibility that the technology it sells could remove a meaningful fraction of paid work without a clean substitution path. That is not a reason to slow your deployment. It is a reason to make sure your deployment includes the workforce you intend to keep.
Tarry Singh is the founder and CEO of Real AI (realai.eu), an enterprise AI advisory and deployment firm working with global enterprises on production agent systems, model risk, and AI sovereignty strategy. He also leads Earthscan, an Energy AI startup, and is a founding contributor to the EU-funded HCAIM and PANORAIMA programmes for responsible AI education across European universities. He writes at tarrysingh.com.