Even the best AI models lose about half their performance when charts get complicated, new benchmark finds

THE DECODER / 4/19/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

共有:

Key Points

The RealChart2Code benchmark evaluates 14 leading AI models using complex visualizations generated from real-world datasets to test code generation quality.
Results show that even top proprietary models lose nearly half their performance when the input charts become more complicated.
The findings suggest that current model capabilities are more reliable on simpler chart-to-code tasks than on diagram-, graph-, and visualization-heavy inputs.
The benchmark highlights the need for better robustness in AI systems that translate visual analytics into executable code under challenging conditions.

Collage of diagram windows, color schemes and cables as a symbol for the complexity of converting visualizations into code.

The RealChart2Code benchmark puts 14 leading AI models to the test on complex visualizations built from real-world datasets. Even the top proprietary models lose nearly half their performance compared to simpler tests.

The article Even the best AI models lose about half their performance when charts get complicated, new benchmark finds appeared first on The Decoder.

💡 Insights using this article

This article is featured in our daily AI news digest — key takeaways and action items at a glance.

📅 4/19DailyView insight →

Small NSFW model for chatbot

Reddit r/LocalLLaMA

ChatGPT for Nurses: Prompts That Help You Document, Communicate, and Study

Dev.to

I Added a Stopwatch to My AI in 1 LOC Using the Livingrimoire While Corporations Need a Year

Dev.to

Built tasuki — an AI CLI Orchestrator that Seamlessly Hands Off Between Tools

Dev.to

I built a GNOME extension for Codex with local/remote history, live filters, Markdown export, and a read-only MCP server

Reddit r/artificial

Even the best AI models lose about half their performance when charts get complicated, new benchmark finds

Key Points

💡 Insights using this article

Related Articles

Small NSFW model for chatbot

ChatGPT for Nurses: Prompts That Help You Document, Communicate, and Study

I Added a Stopwatch to My AI in 1 LOC Using the Livingrimoire While Corporations Need a Year

Built tasuki — an AI CLI Orchestrator that Seamlessly Hands Off Between Tools

I built a GNOME extension for Codex with local/remote history, live filters, Markdown export, and a read-only MCP server

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer