| Boutta Thrash some MoE speeds on a blackwell + m3 Ultra RDMA cluster. Theres a bit less than 2tb of ram here. I want to exchange ideas with you guys and make some cool experiments. what benches would you guys like to see? EDIT: Given all the interest on this post, I will be streaming this on the sub’s discord. Let me know what you guys want to do and I’ll add these to the list! Follow me on x @mlx_reaper [link] [comments] |
Tinygrad Driver testing!
Reddit r/LocalLLaMA / 5/3/2026
💬 OpinionDeveloper Stack & InfrastructureSignals & Early TrendsTools & Practical UsageModels & Research
Key Points
- The post announces an effort to test Tinygrad performance on a Blackwell + M3 Ultra RDMA cluster with just under 2TB of RAM.
- The author is specifically looking to benchmark Mixture-of-Experts (MoE) speeds in that environment and proposes using the results for follow-on experiments.
- They invite community input on which benchmarks to run, indicating an iterative, user-driven testing plan.
- The author says they will stream the activity on the subreddit’s Discord and coordinate benchmark selection with interested participants.
Related Articles

Black Hat USA
AI Business
The foundational UK sovereign-AI patents are filed. The collaboration door is open.
Dev.to

Building a Shopify app with Claude Code — spec-driven development and pricing design
Dev.to
The AI Habit That Pays Dividends (And Takes Zero Extra Time)
Dev.to
From Chaos to Clarity: AI-Powered Client Portals for Designers
Dev.to