Neoclouds were meant to eat the hyperscalers’ lunch.
Faster builds. No price gouging. None of the procurement theatre that slows the big three down.
For a while, that pitch worked.
Especially when AI demand was vertical and “build it and they will come” was a viable business model.
But a lot of them built the exact same product. Pointed it at the same customers. Used the same messaging and colour schemes. And financed it all with expensive debt.
And now?
H100 rental rates are in freefall.
H200s? Not far behind. Margins? Gone. Lease payments? Missed. First mover advantage? A distant memory. And the companies that sought to break the hyperscaler monopoly by deploying thousands of systems into a commodity market under the illusion that they’d achieved product-market fit?
They’re taking their calls and signing their term sheets.
And it looks like consolidation has begun.
I’m Ben Baldieri. Every week, I break down what’s moving in GPU compute, AI infrastructure, and the data centres that power it all.
Here’s what’s inside this week:
Let’s get into it.
The GPU Audio Companion Issue #57
Want the GPU breakdown without the reading? The Audio Companion does it for you, but only if you’re subscribed. If you can’t see it below, click here to fix that.
Northern Data sells Peak Mining, Mulls Rumble Sale
Northern Data is selling off business units while weighing a future under new ownership.
Completely missed in this story is that Northern Data is one of Europe’s very few sovereign cloud data centre players. That it could be sold to a US company should ring alarm bells amongst those interested in European resilience.
— #Mike Butcher (BlueSky/Threads: @mikebutcher) (#@mikebutcher)
9:49 AM • Aug 15, 2025
The company has signed a non-binding, non-exclusive term sheet to sell Peak Mining, its crypto unit, to Elektron Energy in a deal valued at $235M. It includes $175M upfront, with the rest tied to performance metrics and the transfer of a deposit agreement with a mining hardware supplier. At the same time, video platform Rumble has made a $1.17B all-share offer to acquire the rest of Northern Data, folding in its Taiga Cloud GPU business and Ardent Data Centres.
So, what’s going on?
Northern Data operates more than 22,000 GPUs across Europe.
Those systems are predominantly collateralised H100s, deployed with leverage on the cash flows they’re meant to generate. But H100 rental rates have plummeted. And cash flows have likely dried up.
Given how painful that leverage is likely feeling, asset sales and partnership deals with last year’s competition are all you have left.
Why this matters:
Considering how many neoclouds deployed H100s at scale, and the leverage (15-20% cost of capital) sat on their balance sheets, extensive and painful consolidation is on the horizon.
Northern Data is also one of Europe's few major sovereign AI clouds, and the fact that a US company may buy them isn’t great for EU ambitions or resilience.
Click here to read the Peak Mining announcement and Rumble announcement from Northern Data.
Washington’s 15% of NVIDIA & AMD’s China Revenue
The US has given NVIDIA and AMD the green light to sell AI chips to China, but only if they hand over 15% of the revenue to Washington.
Nvidia, AMD may sell high-end AI chips to China if they pay US a cut | TechCrunch techcrunch.com/2025/08/11/nvi…
— #TechCrunch (#@TechCrunch)
1:14 PM • Aug 11, 2025
The deal will see NVIDIA share proceeds from H20 sales, while AMD will do the same for its MI308 chips. This comes after the Trump administration briefly banned those exports in April, only to reverse course when NVIDIA pledged up to $500B in US data centre investments. Licenses are now being issued for the sales, with the deal reportedly tied to broader trade talks over rare earths.
Shakedown? Or the cost of doing business?
You decide.
Why this matters:
Even with export controls, Chinese hyperscalers and AI labs are buying billions in US silicon annually.
For NVIDIA and AMD, cutting them off entirely would mean ceding that revenue to Huawei and other domestic rivals.
Open-source models may erode US dominance at the application layer, but if those models still run on US-designed infrastructure, Washington still controls the bottleneck that matters most. Which is probably why Beijing is urging local firms not to buy the chips.
Core42 Gives Northern Data Breathing Room
Core42 is expanding its European sovereign AI footprint through a new strategic deal with Northern Data.
We’re excited to announce our partnership with @NorthernDataGrp to expand sovereign AI infrastructure across Europe. With access to up to 10,000 Taiga Cloud GPUs, Core42 will accelerate the deployment of high-performance, scalable, and compliant AI solutions.
— #Core42 (#@core42_ai)
11:14 AM • Aug 12, 2025
The partnership gives Core42 access to up to 10,000 GPUs from Northern Data’s Taiga Cloud estate, boosting its ability to deliver compliant, sovereign-grade AI infrastructure across Europe. It builds on Core42’s recent European moves in France and Italy, and aligns with its sovereign strategy: blending its own deployments with trusted local partner capacity to scale quickly.
For Northern Data, the deal comes as the company faces falling H100 pricing, senior exec departures, and takeover offers.
Why this matters:
Access to 10,000 GPUs gives Core42 immediate scale in Europe without waiting for new builds.
This means Core42 can meet near-term sovereign AI demand with live clusters, sidestepping the long lead times, construction risk, and supply chain friction.
With H100 economics under pressure, per the top story, offloading capacity under a high-profile partnership with Core42 gives Northern Data offtake, cash flow, and, if the struggle is real, a lifeline.
GPT-5 Lands with a Resounding Thud
OpenAI’s long-teased GPT-5 has become a case study in over-promising and under-delivering.
Fabulous quote on AI, from someone in the media who actually gets what we have just witnessed.
“In the aftermath of GPT-5’s launch, it has become more difficult to take bombastic predictions about A.I. at face value, and the views of critics like Marcus seem increasingly
— #Gary Marcus (#@GaryMarcus)
10:12 PM • Aug 12, 2025
Hyped as a “PhD-level expert” that would revolutionise interaction, the new model landed with an immediate backlash. Reddit threads with thousands of upvotes called it “horrible,” with so many users demanding the return of GPT-4o that OpenAI caved. Benchmarks show only incremental gains, and in some areas, users feel performance has slipped. A scathing piece from Gary Marcus accused the release of being rushed, overhyped, and riddled with the same hallucination and logic failures that have plagued LLMs for years.
Why this matters:
The initial rollout and router feature effectively shifted all users to a single “one-size-fits-all” model that’s cheaper to run for OpenAI.
While OpenAI still dominate the conversation, you’d be forgiven for thinking cracks might be appearing in the techno-optimist facade.
IBM & Google Claim Commercial Quantum by 2030
The quantum race is no longer just physics experiments in a lab.
Google and IBM believe first workable quantum computer is in sight — Richard Waters
— #FT Opinion (#@ftopinion)
4:12 AM • Aug 12, 2025
IBM and Google both claim they can deliver industrial-scale quantum computers by 2030, pushing the field from sub-200 qubit prototypes toward the million-qubit systems needed for practical use. That leap means overcoming brutal scaling challenges: qubit instability, interference, manufacturing limits, and the staggering cost of error correction. Google is betting on surface code error correction, requiring millions of physical qubits but already showing scaling gains. IBM is pursuing low-density parity-check codes to slash qubit requirements by up to 90%, though manufacturing proof is pending.
AWS and others, however, remain sceptical, suggesting “useful” quantum may still be decades out.
Why this matters:
Quantum computing could eventually redefine AI and HPC workloads beyond what GPUs can handle, and legitimate progress is being made at an accelerating pace.
We’ve already had big quantum announcements from Google and Microsoft this year, plus a host of quantum companies behaving like meme coins from a stock price standpoint.
However, considering how much hype we all have to wade through on a daily basis in anything vaguely AI-adjacent, cynicism as to when quantum becomes commercially viable is healthy.
Microsoft Is in Hot Water Over Unit 8200 Azure Use
Microsoft is investigating reports that units from Israel’s Intelligence Corps are using Azure to run airstrike targeting decision workloads.
Microsoft, Israel, and the profit-ethics equation dlvr.it/TMSvvS
— #DCD (#@dcdnews)
3:49 PM • Aug 13, 2025
Unit 8200 is allegedly using Azure to store and process vast intelligence datasets on Palestinians and high-priority targets. This isn’t the first time the company has answered questions about their engagement with Unit 8200. Prior reviews found “no evidence” that Microsoft’s tech had been used to harm people in Gaza. But new reporting from The Guardian, +972 Magazine, and Local Call suggests some employees may have known more.
Why this matters:
Azure’s value to sovereign and military customers rests on guaranteed confidentiality.
Any confirmation of misuse could trigger shareholder pressure, lawsuits, and new due diligence rules for hyperscalers, ultimately reshaping how US cloud providers sell to high-risk state actors.
TeraWulf & Fluidstack Ink $3.7B Google-Backed Deal
TeraWulf has signed two 10-year HPC colocation agreements with Fluidstack for 200+MW of capacity.
TeraWulf CEO on Google investment: Building one of the largest data center campuses in the U.S. with @CNBC @PowerLunch @SullyCNBC @PaulBPrager
— #TeraWulf (#@TeraWulfInc)
7:08 PM • Aug 14, 2025
The deal is worth $3.7B in contracted revenue, with two 5-year extensions potentially pushing that to $8.7B. And to make it even sweeter, Google is backstopping $1.8B of Fluidstack’s lease obligations while taking an 8% equity stake in TeraWulf via warrants. Phase one (40MW) is due online at the Lake Mariner campus in 1H 2026, with the rest deployed by year-end. TeraWulf also granted Fluidstack a 30-day exclusivity window for another 160MW at the site.
With this new agreement, Lake Mariner is now firmly cemented as the anchor site for AI hosting in New York State.
Why this matters:
This isn’t the first neocloud Terawulf is hosting - Core42 signed a 70MW hosting deal back in December 2024.
200MW+ is serious capacity requiring serious financing.
A $1.8B Google backstop de-risks the deal, cements Lake Mariner’s status, and gives serious credibility to both Fluidstack and Terawulf.
The Rundown
At Christmas, there were 145+ neoclouds and GPU providers.
“Tier 1 are the leaders. They’ve established massive private GPU clusters, have access to deep liquidity, and diversified into high-margin services. CoreWeave’s $10 billion contract with Microsoft is a blueprint for this tier’s success. For Tier 1 players, it’s not just about scale but investing in infrastructure and services that enable the next generation of AI applications.
Tier 2’s rising stars are scaling fast, securing significant funding, deploying their first sizeable high-end GPU clusters, and winning private cloud contracts. The mid-tier faces a choice: scale up or risk being squeezed out by Tier 1 dominance.
This leaves Tier 3: the connectors. These aggregators stitch spare capacity together to meet short-term demand. Their agility makes them essential for niche workloads, but limited scalability and resources prevent them from competing with the upper tiers. Yet, given the market’s spare capacity, direct competition may not be necessary.”
Eight months later, the lines are sharper, and the stakes are higher.
Tier 1 are still the leaders, and they’re way out in front. They’ve carved out their niche, dictate terms, own the relationships that matter, and are selective about the partnerships they sign.
Tier 2 are the survivors and dealmakers. They’re pragmatic enough to cut capacity deals, swap workloads, or align with sovereign operators if it keeps the lights on.
And Tier 3? The left behind. They’re still in the market, but who knows for how much longer.
Because in August 2025, you’re either overextended and living with the consequences of questionable operational decisions. Or you’re not. And that changes everything.
The calculus has shifted from independence vs growth to two simple questions:
Can you make it through the next twelve months?
And if you can’t, what are you willing to do to survive?
See you next week.