Sparse Personalized Text Generation with Multi-Trajectory Reasoning

arXiv cs.AI / 4/29/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper presents PAT (Personalization with Aligned Trajectories), a framework for cold-start personalization in LLM text generation where user histories are sparse or missing.
  • PAT retrieves two complementary signal types—writing-style cues from stylistically similar users and topic/context information from preference-aligned users—to reduce the noisiness and heterogeneity of raw external data.
  • It uses a reinforcement learning-based iterative dual-reasoning process that lets the LLM refine and jointly integrate the retrieved signals over multiple steps.
  • Experiments on real-world personalization benchmarks report consistent improvements in both generation quality and alignment when data is sparse, indicating strong effectiveness for the cold-start setting.

Abstract

As Large Language Models (LLMs) advance, personalization has become a key mechanism for tailoring outputs to individual user needs. However, most existing methods rely heavily on dense interaction histories, making them ineffective in cold-start scenarios where such data is sparse or unavailable. While external signals (e.g., content of similar users) can offer a potential remedy, leveraging them effectively remains challenging: raw context is often noisy, and existing methods struggle to reason over heterogeneous data sources. To address these issues, we introduce PAT (Personalization with Aligned Trajectories), a reasoning framework for cold-start LLM personalization. PAT first retrieves information along two complementary trajectories: writing-style cues from stylistically similar users and topic-specific context from preference-aligned users. It then employs a reinforcement learning-based, iterative dual-reasoning mechanism that enables the LLM to jointly refine and integrate these signals. Experimental results across real-world personalization benchmarks show that PAT consistently improves generation quality and alignment under sparse-data conditions, establishing a strong solution to the cold-start personalization problem.