← All essays
Governance6 minVol I · Ch 8

What Klarna's Reversal Actually Teaches: Governance, Not AI, Was the Failure

Klarna's AI customer-service retreat gets read as proof that AI isn't ready. The evidence says the opposite: the technology worked, and the governance architecture didn't.

ShareXLinkedInFacebookEmail

In January 2024, Sebastian Siemiatkowski announced what sounded like a proof of concept for the entire AI-Born thesis. Klarna's customer-service agent was handling 2.4 million conversations a month — the work of 700 full-time agents, he said publicly — at resolution speeds no human team could match. Sixteen months later, in May 2025, he told Bloomberg the company had gone too far: "cost unfortunately seems to have been a too predominant evaluation factor when organizing this, what you end up having is lower quality." Klarna began rehiring humans.

The headline writes itself: AI overhyped, company retreats, humans win. It's a satisfying story. It's also the wrong lesson — and getting it wrong will cost you the next deployment.

The conventional read, taken seriously

The skeptic's version deserves a fair hearing. A high-profile company replaced people with an agent, the quality dropped, and they reversed course. If that's not evidence that AI customer service isn't ready, what is?

Here's what that read misses. Klarna didn't collapse. Revenue grew 24% year-over-year in 2024, and the company IPO'd on the NYSE in September 2025 at a valuation reflecting real momentum. The Machine Core delivered genuine efficiency — fast, scalable resolution of straightforward queries. The agent wasn't the problem. So if the technology worked and the business kept growing, what exactly failed?

The reframe: the failure lived in the measurement, not the model

The agent handled transactional queries beautifully. Where it failed was the vast emotional middle of customer experience — the user whose payment was wrongly flagged, the person in genuine financial distress who needed to feel heard before they needed to be resolved. Optimized for closure, the agent treated every conversation as a transaction, applying the same script to the mildly curious and the financially desperate.

And the dashboards stayed green. Resolution rate and speed — both excellent. What the instruments didn't capture: repeat contact rate climbing, Net Promoter Score degrading, conversations closed at 89% confidence that had been objectively mishandled. The agent was confidently wrong, and no system connected those lagging outcome signals back to its decision logic.

This is the governing law of production governance, and it's deceptively simple: detection systems measure what you instrument, and governance failures hide in what you don't. Klarna measured whether the conversation closed. It didn't measure whether the customer got what they needed. The gap between those two is exactly where the failure compounded — for months — while everyone watching the dashboard believed they were winning.

Figure: The hybrid architecture Klarna eventually built — AI triaging simple cases, humans handling the emotionally complex ones, with automatic escalation connecting the two. The design it should have started with.

The mechanism: alignment debt compounding in the dark

There's a name for what was accumulating. Alignment debt is the divergence between an organization's stated intent and its agents' actual behavior. Klarna's stated intent was customer-service excellence; the agent's behavior optimized for closure rate. That gap is debt, and it built invisibly because no detection system measured it. By the time it surfaced, it had compounded into structural customer-experience damage.

Like financial debt, alignment debt doesn't wait for a convenient moment. An agent operating at 10% misalignment for six months produces hundreds of misaligned decisions that harden into patterns, each cycle reinforcing the next. The window for cheap correction closes faster than most teams expect — and the Klarna timeline is the proof. Three governance systems were absent or underpowered at deployment:

  • Detection: no monitoring sensitive enough to catch quality drift before it became widespread dissatisfaction.
  • Escalation: no reliable routing to identify which cases needed the Human Cortex before the customer had a bad experience.
  • Recovery: no pre-designed playbook for when the system drifted past its competence boundaries.

The escalation gap is the subtle one. Klarna's protocols existed on paper, but they depended on the agent recognizing its own limits. The cases that most need human judgment are exactly the ones an agent least recognizes as needing it. An emotionally charged dispute doesn't present as an escalation flag — it presents as a routine query the agent proceeds to mishandle at high confidence. Escalation triggered by agent confidence alone systematically under-routes the cases where the failure mode is miscalibrated confidence.

What to do instead

Klarna eventually built all three layers and landed on the hybrid model it should have designed from the start. The lesson isn't "don't deploy." It's "build the triad before you scale, not after the complaints arrive."

  1. Instrument outcomes, not just proxies. Resolution rate is a proxy. Repeat contact rate, sentiment degradation, and the confidence-accuracy calibration gap are outcomes. The hard-to-measure signals are the ones that catch the failures that matter. If you build detection around the easy metrics, you'll catch the failures that don't need catching and miss the ones that do.

  2. Trigger escalation on outcome signals, not just confidence. A case that generated a callback within 48 hours is an escalation signal retroactively — even if the agent closed it at 94% confidence. Build that logic in.

  3. Run a pre-deployment failure-mode workshop. Spend half a day before launch asking: what would a failure look like that wouldn't show up in the metrics we plan to track? In practice this surfaces at least two signal gaps the team assumed were covered.

  4. Design the hybrid from the start. AI handling triage and simple cases; humans handling the emotionally complex and structurally ambiguous; automatic escalation connecting them. This is where Klarna ended up. Starting there costs about three months of slower deployment. Retrofitting it cost a year of compounding governance debt and a rehiring cycle that partially negated the savings.

The principle

The Klarna arc is not a story about AI failure. It's a story about governance architecture that was absent at deployment, expensive to retrofit, and ultimately necessary to sustain the efficiency the Machine Core delivered. The technology did its job. The instruments didn't.

Build detection around the outcomes that matter, and the governance layer becomes operating infrastructure — compounding returns, not compounding costs. Build it around what's easy to measure, and you'll spend year three retrofitting a system that should have been designed in year one.

Adapted from the essays accompanying AI‑Born by Mehran Granfar. Themes drawn from Volume I, "The Machine Core".

The Dispatch — N°01

Essays from
the lineage break.

New essays, framework studies, excerpts and pre‑order news. Sent rarely. Never noise.