Procedural Generation of Algorithm Discovery Tasks in Machine Learning
arXiv cs.LG / 3/19/2026
📰 NewsModels & Research
Key Points
- DiscoGen is a procedural generator of algorithm-discovery tasks for machine learning, enabling automated creation and evaluation of new ML algorithms.
- It addresses key limitations of prior task suites, such as poor evaluation methodologies, data contamination, and saturation of similar problems.
- The generator spans millions of tasks across multiple ML fields, parameterized by a small configuration set, and can be used to optimize algorithm discovery agents (ADAs).
- It includes DiscoBench, a fixed, small benchmark subset for principled ADA evaluation, and supports experiments like prompt optimization of ADAs.
- The project is open-source and hosted at GitHub, inviting researchers to explore new directions in algorithm discovery (https://github.com/AlexGoldie/discogen).
Related Articles

I made a 'benchmark' where LLMs write code controlling units in a 1v1 RTS game.
Dev.to

My AI Does Not Have a Clock
Dev.to
How to settle on a coding LLM ? What parameters to watch out for ?
Reddit r/LocalLLaMA

Andrej Karpathy's autonomous AI research agent ran 700 experiments in 2 days and gave a glimpse of where AI is heading
Reddit r/artificial

So cursor admits that Kimi K2.5 is the best open source model
Reddit r/LocalLLaMA