AI is running into real‑world limits: power, transformers, and memory are now scarcer than model ideas, and a small cartel of labs and infra giants is soaking up most of the capital. At the same time, China is standing up a cheaper, increasingly capable Huawei+DeepSeek stack while workers and local politicians start to push back on how the gains are shared.
The big question is whether firms lock themselves into today’s expensive capacity and vendors or wait for efficiency, open weights, and regulation to change the balance of power.
Key Events
/Google to invest up to $40B in Anthropic as part of a deep AI partnership.
/Amazon backs Anthropic with $5B and a commitment to about $100B in cloud spend tied to Trainium compute.
/SpaceX secures an option to acquire coding startup Cursor for roughly $60B after proposing a $10B collaboration fee.
/Tesla discloses a $2B acquisition of an AI hardware company and ramps new factories focused on AI compute and battery materials.
/DeepSeek V4 launches on Huawei Ascend 950 with a 1M‑token context and ultra‑low token pricing.
Report
Money is pouring into AI faster than the grid, transformers, and memory can keep up. This quarter the trade is less about who has the smartest model and more about who actually has power, capacity, and a credible second supplier.
the hard cap on compute and power
Data centers already consume roughly 7% of U.S. power. Half of the AI facilities planned for 2026 are delayed or cancelled because transformers are scarce and expensive.
Transformer prices have roughly tripled in four years, turning basic grid equipment into a strategic asset for anyone scaling AI. Gas projects for just 11 data‑center campuses could emit more greenhouse gases than entire countries.
One study pegs the hidden environmental and health cost of data centers at $25B a year. Local politics is starting to bite, with Maine vetoing some new centers but predicting a moratorium will pass, and cities like Monterey Park outright banning them.
Chipmakers themselves expect to cover only about 60% of AI memory demand by 2027, so even if GPUs are available, feeding them becomes the next choke point.
capital piling into a tiny AI cartel
Anthropic has effectively become a third hyperscaler, taking a $5B investment from Amazon tied to a long‑term cloud deal. That agreement commits Anthropic to roughly $100B of cloud spend over the next decade on Amazon’s Trainium‑based compute.
Google is hedging too, planning to invest up to $40B into Anthropic on top of its own Gemini stack. Secondary markets now price Anthropic at around a trillion‑dollar valuation, above OpenAI, even as it raises prices to manage ongoing compute shortages.
In parallel, Jeff Bezos’ Project Prometheus raised about $10B at a ~$38B valuation and SpaceX secured an option to buy coding‑assistant startup Cursor for roughly $60B, concentrating enormous capital into a handful of AI platforms.
the chinese stack: cheap, big, and good enough
DeepSeek V4 Pro and Flash run on Huawei’s Ascend 950, offering a 1M‑token context window and parameter counts in the trillion range.
The Pro and Flash variants are priced as low as $1.74 and $0.028 per million input tokens, with a temporary 75% API discount undercutting Western frontier models.
Huawei’s domestic chips have already captured about 41% of China’s AI server market, pairing these models with a homegrown hardware base. Chinese systems like Xiaomi’s MiMo‑V2.5‑Pro now match U.S. frontier coding AIs on SWE‑Bench Pro while using 40–60% fewer tokens, and open‑weight Chinese models increasingly beat U.S. ones on cost.
All this lands against U.S. accusations of "industrial‑scale" AI theft and open talk of an AI Cold War, so the cheap Chinese stack is advancing technically while becoming geopolitically contentious.
labor, surveillance, and the AI blame game
Meta is cutting about 10% of its workforce, roughly 8,000 people, while rolling out software that captures mouse movements, keystrokes, and screenshots from employees to train its AI.
Microsoft is offering voluntary buyouts to up to 7% of its U.S. workforce and plans to cut around 8,750 jobs as part of its AI spending pivot.
IBM has already replaced about 43,000 American staff with 135,000 workers in India, and a study projects over 260,000 people in Massachusetts alone could lose jobs to AI systems within five years.
At the same time, UK data show no significant impact of AI on overall employment with AI‑exposed jobs actually growing faster, and thousands of CEOs admit their AI efforts have had little effect on productivity or headcount.
Workers are responding with open pushback, from Samsung’s 30,000‑plus union members demanding roughly $400k bonuses each from a $38B AI memory windfall to employees framing layoffs as profit grabs rather than technological inevitability.
ROI reality check on AI agents and code
Practitioners estimate that maybe 20% of deployed AI agents deliver real ROI, with the rest mostly hype or shallow time‑savers. Discussions highlight that classic predictive models often beat flashy generative agents on return, and that real value tends to come from narrow, recurring workflows rather than grand "autonomous employee" visions.
Google now says about 75% of its new code is AI‑generated, up from 25% in 2024, but users are loudly complaining about deteriorating quality in products like Search and Maps.
A controlled study found that when 1,222 people lost access to AI assistants, their performance fell below a control group, yet benchmarks like CI‑Work show enterprise agents still leak sensitive data in 15.8–50.9% of test cases.
There are genuine wins—Starlink replaced its call center with an agent called Grok that now resolves roughly 70% of calls, and the UAE wants half of government operations running on agentic AI within two years—but the hit rate remains low.
What This Means
Compute, power, and a handful of vertically integrated labs are turning into semi‑regulated scarce commodities, while a cheaper Chinese stack and restless labor undercut the idea that this will be a frictionless software play. The live decision is whether firms lock in capacity and model access with this emerging cartel now or bet that efficiency gains, open weights, and political backlash will reset the economics before today’s contracts expire.
On Watch
/Unauthorized access to Anthropic’s Mythos cybersecurity model—initially deemed too dangerous for public release—exposed control gaps at a lab that is rapidly becoming core infrastructure for governments and hyperscalers.
/The UK’s move to reconsider Palantir’s NHS data‑platform contract even as Palantir secures a $300M USDA food‑security deal shows political risk around high‑profile gov AI deployments is rising fast.
/Maryland’s ban on 'surveillance pricing' and Colorado’s open‑source carve‑out in age‑verification rules hint at a coming wave of targeted regulation on how AI systems use personal data and dynamic pricing.
Interesting
/Google Cloud's revenue backlog is projected to exceed $240 billion by the end of 2025, indicating strong growth in AI services.
/China's backing of an orbital data center startup with $8.4 billion in credit lines highlights the global race in data infrastructure.
/Nearly a third of Anthropic staff believe their AI model could replace junior engineers within three months, indicating rapid advancements in AI capabilities.
/The Maryland ban on surveillance pricing reflects increasing legislative action against privacy violations in data usage.
/Thousands of cities in the U.S. have implemented AI-integrated license plate readers, raising significant privacy concerns due to their extensive surveillance capabilities.
We processed 10,000+ comments and posts to generate this report.
AI-generated content. Verify critical information independently.
/Google to invest up to $40B in Anthropic as part of a deep AI partnership.
/Amazon backs Anthropic with $5B and a commitment to about $100B in cloud spend tied to Trainium compute.
/SpaceX secures an option to acquire coding startup Cursor for roughly $60B after proposing a $10B collaboration fee.
/Tesla discloses a $2B acquisition of an AI hardware company and ramps new factories focused on AI compute and battery materials.
/DeepSeek V4 launches on Huawei Ascend 950 with a 1M‑token context and ultra‑low token pricing.
On Watch
/Unauthorized access to Anthropic’s Mythos cybersecurity model—initially deemed too dangerous for public release—exposed control gaps at a lab that is rapidly becoming core infrastructure for governments and hyperscalers.
/The UK’s move to reconsider Palantir’s NHS data‑platform contract even as Palantir secures a $300M USDA food‑security deal shows political risk around high‑profile gov AI deployments is rising fast.
/Maryland’s ban on 'surveillance pricing' and Colorado’s open‑source carve‑out in age‑verification rules hint at a coming wave of targeted regulation on how AI systems use personal data and dynamic pricing.
Interesting
/Google Cloud's revenue backlog is projected to exceed $240 billion by the end of 2025, indicating strong growth in AI services.
/China's backing of an orbital data center startup with $8.4 billion in credit lines highlights the global race in data infrastructure.
/Nearly a third of Anthropic staff believe their AI model could replace junior engineers within three months, indicating rapid advancements in AI capabilities.
/The Maryland ban on surveillance pricing reflects increasing legislative action against privacy violations in data usage.
/Thousands of cities in the U.S. have implemented AI-integrated license plate readers, raising significant privacy concerns due to their extensive surveillance capabilities.