PaperBanana: Automating Academic Illustration for AI Scientists

arXiv cs.CL / 3/25/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisTools & Practical UsageModels & Research

共有:

Key Points

The article introduces PaperBanana, an agentic framework that uses VLMs and image-generation models to automatically produce publication-ready academic illustrations for AI scientists.
PaperBanana coordinates specialized steps—reference retrieval, planning of content and style, image rendering, and iterative self-critique—to reduce the illustration bottleneck in research workflows.
It proposes PaperBananaBench, a benchmark with 292 test cases drawn from NeurIPS 2025 methodology diagrams across multiple domains and illustration styles.
Experiments indicate PaperBanana outperforms existing baselines on key criteria including faithfulness to sources, conciseness, readability, and aesthetics, and it can also generate statistical plots effectively.
The work positions automated illustration generation as a practical capability for more autonomous AI-scientist pipelines, aiming to streamline end-to-end paper preparation.

Abstract

Despite rapid advances in autonomous AI scientists powered by language models, generating publication-ready illustrations remains a labor-intensive bottleneck in the research workflow. To lift this burden, we introduce PaperBanana, an agentic framework for automated generation of publication-ready academic illustrations. Powered by state-of-the-art VLMs and image generation models, PaperBanana orchestrates specialized agents to retrieve references, plan content and style, render images, and iteratively refine via self-critique. To rigorously evaluate our framework, we introduce PaperBananaBench, comprising 292 test cases for methodology diagrams curated from NeurIPS 2025 publications, covering diverse research domains and illustration styles. Comprehensive experiments demonstrate that PaperBanana consistently outperforms leading baselines in faithfulness, conciseness, readability, and aesthetics. We further show that our method effectively extends to the generation of high-quality statistical plots. Collectively, PaperBanana paves the way for the automated generation of publication-ready illustrations.

Sentiment Analysis API Tutorial: Build a Customer Review Dashboard

Dev.to

Teaching AI Agents to Handle NFTs: ERC-721, ERC-1155, and Metaplex

Dev.to

The Complete Guide to Model Context Protocol (MCP): Building AI-Native Applications in 2026

Dev.to

AI Agent Skill Security Report — 2026-03-25

Dev.to

How to Build Multi-Agent AI Systems That Actually Work: A 2026 Practical Guide

Dev.to

PaperBanana: Automating Academic Illustration for AI Scientists

Key Points

Abstract

Related Articles

Sentiment Analysis API Tutorial: Build a Customer Review Dashboard

Teaching AI Agents to Handle NFTs: ERC-721, ERC-1155, and Metaplex

The Complete Guide to Model Context Protocol (MCP): Building AI-Native Applications in 2026

AI Agent Skill Security Report — 2026-03-25

How to Build Multi-Agent AI Systems That Actually Work: A 2026 Practical Guide

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer