Assessing Y-Axis Influence: Bias in Multimodal Language Models on Chart-to-Table Translation

arXiv cs.AI / 4/29/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies chart-to-table translation and finds that public chart datasets contain imbalances in how y-axis information is represented, which can lead to unintended bias in multimodal language models.
  • It introduces a new evaluation framework, FairChart2Table, to systematically analyze y-axis–related bias across five state-of-the-art models.
  • The results show significant y-axis bias factors tied to major tick digit length, the number of major ticks, the y-axis value range, and the tick formatting style (e.g., abbreviations or scientific notation).
  • Beyond y-axis details, the study finds that the number of legends/entities in chart images also affects model performance, and that including y-axis information in prompts can noticeably improve results for some models.

Abstract

Chart-to-table translation converts chart images into structured tabular data. Accurate translation is crucial for Multimodal Language Model (MLM) to answer complex queries. We observe imbalances in the number of images across different aspects of the y-axis information in public chart datasets. Such imbalances can introduce unintended biases, causing uneven MLM performance. Previous works have not systematically examined these biases. To address this gap, we propose a new framework, FairChart2Table, for analyzing y-axis-related bias on five state-of-the-art models. Key Findings: (1) There are significant y-axis biases related to the digit length of the major tick values, the number of major ticks, the range of values, and the tick value format (e.g., abbreviation or scientific format). (2) The number of legends/entities in chart images impacts MLM performance. (3) Prompting MLM with y-axis information can significantly enhance the performance for some MLMs.