The new SWE-bench numbers for Mythos just dropped, and the gap between it and the current best is terrifying.
SWE-bench Verified:
Mythos: 93.9%
Opus 4.6: 80.8%
SWE-bench Pro:
Mythos: 77.8%
Opus 4.6: 53.4%
That Pro score is a nearly 25% jump in autonomous coding. Factor in the rumors around Project Glasswing giving it deep architectural understanding, and the barrier between a prompt and a fully deployed product is basically gone.
Imagine what you will be able to build when Mythos drops.
All you need is a laptop and an idea. What are you building first?
[link] [comments]




