Microsoft debuts Surface RTX Spark Dev Box to run large AI models without cloud costs

VentureBeat / 6/3/2026

📰 NewsDeveloper Stack & InfrastructureSignals & Early TrendsIndustry & Market MovesModels & Research

Key Points

  • Microsoft unveiled the Surface RTX Spark Dev Box, a compact desktop designed to run large AI models locally to avoid cloud computing costs.
  • The device uses Nvidia’s Blackwell-architecture RTX Spark processor and includes 128GB unified memory, enabling developers to run models with over 120B parameters without making cloud API calls.
  • Microsoft says model effectiveness depends not only on size but also on sufficient context length; at 100,000 tokens of context, key-value cache can require 40–50GB of memory.
  • The Dev Box will be available later this year in the United States exclusively via Microsoft.com, with pricing not yet disclosed.
  • By positioning fixed, upfront hardware costs against per-token or metered cloud pricing, Microsoft aims to challenge the dominant AI economics since ChatGPT’s launch.

Continue reading this article on the original site.

Read original →