AI Navigate

Representing data in words: A context engineering approach

arXiv cs.CL / 3/16/2026

💬 OpinionModels & Research

Key Points

  • Wordalisations propose transforming numerical data into descriptive texts that are as digestible as visualisations, addressing LLMs' difficulty with numeric reasoning.
  • The approach is demonstrated on three applications: scouting football players, personality tests, and international survey data.
  • They evaluate accuracy with both LLM-as-judge and human-as-judge experiments, reporting engaging and faithful representations of data.
  • The authors outline best practices for open and transparent development and communication about data.

Abstract

Large language models (LLMs) have demonstrated remarkable potential across a broad range of applications. However, producing reliable text that faithfully represents data remains a challenge. While prior work has shown that task-specific conditioning through in-context learning and knowledge augmentation can improve performance, LLMs continue to struggle with interpreting and reasoning about numerical data. To address this, we introduce wordalisations, a methodology for generating stylistically natural narratives from data. Much like how visualisations display numerical data in a way that is easy to digest, wordalisations abstract data insights into descriptive texts. To illustrate the method's versatility, we apply it to three application areas: scouting football players, personality tests, and international survey data. Due to the absence of standardized benchmarks for this specific task, we conduct LLM-as-a-judge and human-as-a-judge evaluations to assess accuracy across the three applications. We found that wordalisation produces engaging texts that accurately represent the data. We further describe best practice methods for open and transparent development of communication about data.