← All Memos
AiMay 14, 2026

Infrastructure Up, AI Applications Face Headwinds

AI capex surge validates physical infrastructure plays while model commoditization and alpha generation limits constrain application-layer value capture.

The AI investment landscape is bifurcating sharply between physical infrastructure, where capital intensity and scarcity economics favor concentrated winners, and the application layer, where model convergence and structural limitations in autonomous systems compress margins and challenge differentiation. Three converging dynamics define the current phase: bottlenecks migrating downstream from GPUs to power and memory; frontier model performance converging across geographies while routing architectures gain strategic importance; and persistent failures in LLM-driven autonomous trading despite bounded success in narrower tasks. For crypto-native portfolios, this suggests favoring decentralized compute and storage infrastructure over AI agent tokens, while maintaining deep skepticism toward on-chain autonomous trading protocols.


Infrastructure Scarcity Intensifies Beyond Silicon

The AI infrastructure constraint has evolved from a chip supply problem into a multi-dimensional physical bottleneck spanning power delivery, CPU orchestration, and memory capacity. Hyperscaler power purchase agreement flow surged 16 points in a single week, reflecting urgent procurement of generation capacity rather than incremental GPU orders [1][8]. Aggregate Q1 capex reached $148B, representing 91% year-over-year growth, yet much of this capital is flowing to industrial infrastructure rather than chip purchases alone [6][10]. Goldman Sachs frames the challenge starkly: power density requirements now exceed traditional data center architectures by orders of magnitude, forcing hyperscalers to fundamentally redesign thermal and electrical systems [8].

The memory layer presents an equally compelling structural story. Projected 625x expansion in memory demand by 2028 transforms what was historically a cyclical trade into a secular growth vector [5]. Glass substrate commercialization, representing the first major packaging transition in 30 years, signals that physical constraints at the package level now gate system performance as much as transistor scaling [7]. CPUs, often overlooked in the GPU-centric narrative, have emerged as critical orchestration bottlenecks; branching and scheduling efficiency increasingly determines effective cluster utilization [4][9].

For capital allocation, this migration of constraints implies that power equipment manufacturers, industrial infrastructure providers, and memory suppliers occupy the most defensible positions in the value chain. The "picks and shovels" thesis has shifted from semiconductors to the utilities and physical plant that feed them [1][10].

Model Frontier Compression Reshapes Competitive Dynamics

While infrastructure scarcity intensifies, the model layer is experiencing rapid commoditization. Composite IQ scores at the frontier have converged across US and Chinese labs, with cost-efficient non-US models achieving competitive performance at materially lower effective cost [11][12][16]. This convergence undermines moat assumptions predicated on singular model superiority and shifts competitive advantage toward deployment efficiency and system integration.

The strategic response is a migration from single-model architectures to multi-model routing systems that dynamically allocate queries based on task complexity and cost constraints [17]. Graphon's "intelligence layer" exemplifies this trend, promising to reduce effective compute costs by routing simple queries to lightweight models while reserving frontier capacity for complex tasks [13]. IDC projects that model routing will become the dominant deployment paradigm, suggesting that orchestration middleware, not raw model capability, may capture disproportionate value [17].

Cerebras's $40B IPO signals sustained public market appetite for AI infrastructure, yet the filing reveals significant revenue concentration risk, with a small number of hyperscaler customers accounting for the majority of revenue [14][19]. This concentration mirrors broader structural risks in AI hardware: while demand is robust, customer bargaining power and the potential for vertical integration compress supplier margins over time [15].

China's open-model strategy, documented in USCC analysis, reinforces commoditization pressure by flooding the market with capable, cost-competitive alternatives [18]. For investors, this implies that pure-play model companies face margin compression unless they can differentiate through proprietary data, vertical integration, or deployment efficiency.

LLM Alpha Generation: Promising in Bounds, Failing at Autonomy

The application of LLMs to financial alpha generation reveals a consistent pattern: bounded tasks yield measurable but modest gains, while autonomous systems fail reliably. Agentic LLMs demonstrate 68% directional accuracy in earnings forecasting and generate 15.8 basis points per day in cross-sectional equity selection [21]. However, these results require careful task scoping and human-in-the-loop oversight.

Autonomous trading bots, by contrast, lose money in 81% of public arena trials [23][26]. The gap between constrained and unconstrained performance is not merely quantitative but structural. A critical finding is that 23.4% of LLM-generated factors carry memorization leakage, inflating backtest returns by 14.3 percentage points [22][25]. This contamination renders naive backtesting unreliable and demands explicit bias consideration in any LLM-driven strategy development [25].

LLM-guided hypothesis discovery accelerates combinatorial signal search but rarely escapes known factor structures [22]. The integration layer, encompassing how signals are weighted, combined, and risk-managed, matters more than raw signal generation [24][27]. Adaptive alpha weighting using reinforcement learning approaches shows promise in managing the noisy output of prompt-based LLM alphas, but adds implementation complexity [24].

Cross-Theme Tensions and Portfolio Implications

The three themes reveal a fundamental tension: infrastructure investment is accelerating precisely because application-layer value capture remains elusive. Hyperscalers are spending $148B quarterly on capex, yet the downstream use cases, including autonomous trading and AI agents, have not demonstrated reliable economic returns outside narrow, supervised contexts.

For crypto-native portfolios, this bifurcation suggests several actionable implications:

1. Favor decentralized compute and storage infrastructure over AI agent tokens. Protocols providing verifiable compute, distributed storage, or decentralized inference routing benefit from the same scarcity dynamics driving hyperscaler capex, without exposure to the application-layer failures documented in autonomous trading research.

2. Maintain deep skepticism toward on-chain autonomous trading agents. The 81% loss rate in controlled arenas, combined with memorization leakage risks, implies that current-generation AI trading agents destroy value more often than they create it. Protocols premised on autonomous AI-driven returns face fundamental headwinds.

3. Monitor model routing infrastructure. As the competitive surface shifts from single-model capability to multi-model orchestration, decentralized routing and model marketplace protocols may capture value at the integration layer, where differentiation is emerging.

4. Hedge concentration risk in AI hardware exposure. Cerebras's revenue concentration and the broader pattern of hyperscaler bargaining power suggest that even successful AI infrastructure companies face margin pressure. Diversification across the physical stack, spanning power, memory, and packaging, reduces single-point-of-failure risk.

The binding constraint has moved from silicon to watts; the competitive moat has moved from models to routing; and the alpha generation challenge has moved from signal discovery to integration discipline. Capital allocation should follow these migrations.


References
1The Chokepoint #1: Power Moves Fastest
2LBNL Data Center Energy Reports: 2016 vs 2024, Annotated
3Is Anyone Going to Make These Data Centers Beautiful?
4CPUs are the Bottleneck - and Branching is the Unlock
5The Case for Memory as a Structural Growth Story, Not a Cyclical Trade
6Dylan Patel on AI Token Demand, Compute Supply Constraints, and the Economics of Intelligence
7[Investment Map] 15 Companies in the Glass Substrate Cycle: From Material to Mass Production
8Goldman Sachs: 1,000 Homes of Power in a Filing Cabinet — Rising Power Density Disrupts AI Infrastructure
9CNAS: American AI Companies Can't Get Enough Chips
1024/7 Wall St: The Real AI Trade May Not Be Software — It May Be Power Equipment
11AI IQ: Composite AI Model Intelligence and Cost-Performance Tracker
12The Best AI Models of March 2026
13Graphon Says Its Intelligence Layer Will Lighten the Load on AI Models
14Cerebras to Kick Off Hotly Anticipated Year for Artificial-Intelligence IPOs
15Hedge Funds Are Making a Killing in the 'Golden Age' of AI Hardware
16Notes from Inside China's AI Labs
17IDC: The Future of AI Is Model Routing
18USCC: Two Loops — How China's Open AI Strategy Reinforces Its Industrial Dominance
19Cerebras' $48 Billion IPO Tests the Market's Inference Bet — Investing.com
20AQAA: An Always-On Autonomous Quant AI Agent for Continuous Alpha Discovery, Self-Optimization, and Live Trading
21Can Agentic AI Predict Stock Returns?
22Can AI Do Financial Research? LLM-Guided Hypothesis Discovery in Asset Pricing
23AI Bots Auditioning for Wall Street Trading Are Mostly Losing
24Adaptive Alpha Weighting with PPO: Enhancing Prompt-Based LLM-Generated Alphas in Quant Trading
25Evaluating LLMs in Finance Requires Explicit Bias Consideration
26StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?
27Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization

This is a preview of our weekly research powered by ShikumiBot. The full platform is available to a limited group of development partners. Request access at ShikumiBot.xyz.


Disclaimer: The Shikumi Company publishes market analysis and educational content intended solely for informational and entertainment purposes. We are not registered investment advisors and do not provide individualized financial, legal, or tax advice. The opinions, charts, and trade ideas shared are based on the authors' personal research, experience, and judgment at the time of writing. All content is subject to change without notice and may be incomplete or inaccurate.

Nothing in this publication should be interpreted as a recommendation or solicitation to buy or sell any securities or financial instruments. Past performance is not indicative of future results, and all investments carry risk, including the potential loss of principal. Readers are strongly encouraged to conduct their own research and consult with licensed professionals before making investment decisions. The authors or affiliates of Shikumi may hold positions in assets mentioned and may benefit from market movements discussed herein.

We make no guarantees about the accuracy, completeness, or timeliness of the information provided. By accessing this newsletter or our related content, you agree to hold Shikumi harmless for any outcomes resulting from your interpretation or use of the material.