My weekend project overran a little but happy with the end result.
soleval pass@1 beat Opus 4.7 on the same set of tasks. Some more work to be done here but any feedback is welcome, I spent quite a lot of time (and money) on this one!
[link] [comments]



