Why run local? Count the money

Reddit r/LocalLLaMA / 5/6/2026

💬 OpinionSignals & Early TrendsTools & Practical UsageIndustry & Market Moves

Key Points

  • The author reports running local AI models with Hermes on a 2-spark cluster and asks the system to tally token usage, which comes to about 200 million tokens in 5 days.
  • Using a cited provider benchmark of roughly $1.25 per million tokens, the author estimates the equivalent cloud cost at about $1,250 per month, suggesting the hardware can pay for itself within ~6 months (with caveats about purchase timing and pricing).
  • The piece argues that even if token usage is higher with more intensive programming workflows, the ROI from local deployment could still be attractive at high daily token volumes.
  • The motivation is framed as both cost efficiency and greater control over privacy and intellectual property compared with cloud services, reinforcing the author’s belief that local AI is a durable trend.
  • The author ends by asking readers whether their own local rigs feel more sustainable price-wise than they did six months ago.

I’m not a coder, but I run local models. I gave in to agent hype (I was building my own, but there is so much to do) and installed Hermes. Running with Qwen-397b out of a 2 spark cluster.
So…I asked Hermes today to tally the token count, and the result…200 million tokens. In 5 days.

At this rate, using an agent for tasks like installing software and debugging things I want to try out, what is the cost I am saving? Artificial Analysis says the price is about 1.25 dollars per million tokens on average from providers. At current pricing per Artificial Analysis, that gives me about 1250 dollars per month, and my sparks will pay themselves by 6 months.

So, caveats of course I bought them at cheaper prices than today, but it’s a simple estimate that there is some valid reasons to go local.

Like I said, I am not programming and I know there are programmers that easily triple my token count in the same time. That implies that if you use 100 million tokens per day, the return on investment is still there today, even with crazy computer prices.

To me, local AI is about the desire to utilize a cool technology without the strings attached that threaten individual privacy and intellectual property. But knowing that my investment is not just purely hobbyism gives me more conviction that local AI is the future.

I know I am preaching to the choir…So the question is, has anyone else felt their rig is becoming more sustainable now than 6 months ago, price wise? Would love to hear!!

submitted by /u/Badger-Purple
[link] [comments]