AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage
arXiv cs.AI / 4/27/2026
📰 NewsIdeas & Deep AnalysisTools & Practical UsageModels & Research
Key Points
- The paper introduces “paper lineage,” an approach that mines implicit knowledge from a paper’s cited literature to make research reproduction less dependent on domain expertise.
- It presents AutoReproduce, a multi-agent framework that autonomously reproduces experimental code end-to-end, aiming for complete workflow coverage.
- To improve executability, AutoReproduce uses a sampling-based unit testing strategy for fast validation during reproduction.
- The authors propose “AutoReproduceBench” (referred to as ourbench), a benchmark with verified implementations and metrics to evaluate both reproduction fidelity and execution fidelity.
- Experiments on PaperBench and AutoReproduceBench show AutoReproduce outperforms prior baselines across all metrics, with notable gains in both reproduction fidelity and final execution performance.




