AI Semiconductor/GPU Economics: NVIDIA / TPU / Trainium

AI Navigate Original / 4/27/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisIndustry & Market Moves
共有:

Key Points

  • AI competition is decided by compute procurement at huge scale
  • NVIDIA dominant; challengers: TPU, Trainium, Maia, Groq/Cerebras ASICs
  • China uses Ascend/SMIC under export controls
  • Bottlenecks: TSMC/HBM/power/cooling; multi-cloud/chip strategy

AI Competition's Essence Is Chip Procurement

The 2024-2026 AI competition is an era decided not just by model smartness but by "how much compute you can amass." OpenAI, Anthropic, Google, Meta, xAI secure GPU/TPU at hundreds-of-billions-of-yen scale, and governments advance trillion-yen-scale investment in sovereign-AI strategies.

NVIDIA's Dominance and Challengers

NVIDIA H100 / H200 / B100 / B200 / GB200

The Hopper generation (H100) in 2024, successor H200, and from 2025 the Blackwell generation (B100, B200, GB200) mass-produced. $40-50k per card, one Blackwell GPU rack (GB200 NVL72) on the order of 100M yen. Supply tightness makes 6-12-month waits the norm.

Google TPU v5/v6

Google's in-house TPU updated v5p (2023), v5e (2024), v6 Trillium (2025), v6e (2026). Supports Gemini training/inference. Also provided externally via GCP (Anthropic, various startups).

AWS Trainium / Inferentia

AWS's AI-dedicated chips. Trainium2 (2024) for training, Inferentia2 for inference. Large-scale adoption in exchange for investing in Anthropic. Targets 30-50% cheaper than H100 on cost-performance.

Microsoft Maia / Cobalt

Microsoft's in-house AI chips announced in 2023. For OpenAI-model operation and Azure customers. Aims to reduce NVIDIA dependence and improve Azure margins.

Dedicated Inference ASICs

  • Groq LPU: ultra-low-latency inference-dedicated. Llama 3 over 500 tokens/sec
  • Cerebras WSE-3: a giant AI chip where 1 chip = wafer-sized
  • SambaNova: enterprise-dedicated
  • Etched Sohu: a Transformer-dedicated ASIC

China's Situation

US export controls ban H100/H200 export to China. NVIDIA sold regulation-compliant versions (H800, H20, L20, L2) but controls tightened from 2024. China's response:

  • Huawei Ascend 910B/910C: performance 60-80% of H100
  • Startups like Biren, Moore Threads
  • SMIC (China's TSMC equivalent) strengthening 7nm/5nm manufacturing
  • DeepSeek etc. proving "frontier models possible even with regulation-compliant versions"

Bottlenecks

Sign up to read the full article

Create a free account to access the full content of our original articles.