PaperBanana: Automating Academic Illustration for AI Scientists

arXiv cs.CL / 3/25/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

Key Points

  • The article introduces PaperBanana, an agentic framework that uses VLMs and image-generation models to automatically produce publication-ready academic illustrations for AI scientists.
  • PaperBanana coordinates specialized steps—reference retrieval, planning of content and style, image rendering, and iterative self-critique—to reduce the illustration bottleneck in research workflows.
  • It proposes PaperBananaBench, a benchmark with 292 test cases drawn from NeurIPS 2025 methodology diagrams across multiple domains and illustration styles.
  • Experiments indicate PaperBanana outperforms existing baselines on key criteria including faithfulness to sources, conciseness, readability, and aesthetics, and it can also generate statistical plots effectively.
  • The work positions automated illustration generation as a practical capability for more autonomous AI-scientist pipelines, aiming to streamline end-to-end paper preparation.

Abstract

Despite rapid advances in autonomous AI scientists powered by language models, generating publication-ready illustrations remains a labor-intensive bottleneck in the research workflow. To lift this burden, we introduce PaperBanana, an agentic framework for automated generation of publication-ready academic illustrations. Powered by state-of-the-art VLMs and image generation models, PaperBanana orchestrates specialized agents to retrieve references, plan content and style, render images, and iteratively refine via self-critique. To rigorously evaluate our framework, we introduce PaperBananaBench, comprising 292 test cases for methodology diagrams curated from NeurIPS 2025 publications, covering diverse research domains and illustration styles. Comprehensive experiments demonstrate that PaperBanana consistently outperforms leading baselines in faithfulness, conciseness, readability, and aesthetics. We further show that our method effectively extends to the generation of high-quality statistical plots. Collectively, PaperBanana paves the way for the automated generation of publication-ready illustrations.