Efficient Personalization of Generative User Interfaces

arXiv cs.LG / 4/14/2026

💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper examines why personalizing generative user interfaces is difficult, noting that UI preferences are subjective, costly to infer from sparse feedback, and hard to specify as explicit rules.
  • It introduces a new dataset built from 20 trained designers who provide pairwise judgments over the same 600 generated UIs, revealing substantial preference disagreement across designers (average kappa = 0.25).
  • Analysis of designers’ written rationales shows that even when they use similar high-level concepts (e.g., hierarchy or cleanliness), they differ in definitions, prioritization, and how those concepts are applied.
  • To address these challenges, the authors propose a sample-efficient personalization method that represents a new user through prior designers (preference priors) rather than a fixed rubric of design concepts.
  • In evaluations, the preference model beats a pretrained UI evaluator and a larger multimodal model, scales better with more feedback, and when applied to generation produces interfaces preferred by 12 new designers versus baseline and direct prompting approaches.

Abstract

Generative user interfaces (UIs) create new opportunities to adapt interfaces to individual users on demand, but personalization remains difficult because desirable UI properties are subjective, hard to articulate, and costly to infer from sparse feedback. We study this problem through a new dataset in which 20 trained designers each provide pairwise judgments over the same 600 generated UIs, enabling direct analysis of preference divergence. We find substantial disagreement across designers (average kappa = 0.25), and written rationales reveal that even when designers appeal to similar concepts such as hierarchy or cleanliness, designers differ in how they define, prioritize, and apply those concepts. Motivated by these findings, we develop a sample-efficient personalization method that represents a new user in terms of prior designers rather than a fixed rubric of design concepts. In a technical evaluation, our preference model outperforms both a pretrained UI evaluator and a larger multimodal model, and scales better with additional feedback. When used to personalize generation, it also produces interfaces preferred by 12 new designers over baseline approaches, including direct user prompting. Our findings suggest that lightweight preference elicitation can serve as a practical foundation for personalized generative UI systems.