PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

arXiv cs.CL / 4/10/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

共有:

Key Points

PeReGrINE is a new benchmark and evaluation framework for generating personalized reviews using graph-structured user–item evidence with explicit temporal cutoff constraints.
The approach restructures Amazon Reviews 2023 into a temporally consistent bipartite graph and conditions each target review on bounded evidence from user history, item context, and neighborhood interactions.
To capture persistent preferences despite sparse raw histories, it computes a User Style Parameter summarizing users’ linguistic and affective tendencies from prior reviews.
It enables controlled comparisons across four retrieval evidence settings (product-only, user-only, neighbor-only, and combined) and adds “Dissonance Analysis” to measure deviations in user style and product-level consensus.
The study also tests visual evidence as auxiliary context, finding it can improve textual quality in some scenarios, while graph-derived evidence remains the primary driver of personalization and consistency.

Abstract

We introduce PeReGrINE, a benchmark and evaluation framework for personalized review generation grounded in graph-structured user--item evidence. PeReGrINE restructures Amazon Reviews 2023 into a temporally consistent bipartite graph, where each target review is conditioned on bounded evidence from user history, item context, and neighborhood interactions under explicit temporal cutoffs. To represent persistent user preferences without conditioning directly on sparse raw histories, we compute a User Style Parameter that summarizes each user's linguistic and affective tendencies over prior reviews. This setup supports controlled comparison of four graph-derived retrieval settings: product-only, user-only, neighbor-only, and combined evidence. Beyond standard generation metrics, we introduce Dissonance Analysis, a macro-level evaluation framework that measures deviation from expected user style and product-level consensus. We also study visual evidence as an auxiliary context source and find that it can improve textual quality in some settings, while graph-derived evidence remains the main driver of personalization and consistency. Across product categories, PeReGrINE offers a reproducible way to study how evidence composition affects review fidelity, personalization, and grounding in retrieval-conditioned language models.

GLM 5.1 tops the code arena rankings for open models

Reddit r/LocalLLaMA

can we talk about how AI has gotten really good at lying to you?

Reddit r/artificial

AI just found thousands of zero-days. Your firewall is still pattern-matching from 2014

Dev.to

Emergency Room and the Vanishing Moat

Dev.to

I Built a 100% Browser-Based OCR That Never Uploads Your Documents — Here's How

Dev.to

PeReGrINE: Evaluating Personalized Review Fidelity with User Item Graph Context

Key Points

Abstract

Related Articles

GLM 5.1 tops the code arena rankings for open models

can we talk about how AI has gotten really good at lying to you?

AI just found thousands of zero-days. Your firewall is still pattern-matching from 2014

Emergency Room and the Vanishing Moat

I Built a 100% Browser-Based OCR That Never Uploads Your Documents — Here's How

関連おすすめサービス

Notta搭載AI議事録イヤホン ZENCHORD1

AI搭載ボイスレコーダー Plaud

画像高画質化AIツール Aiarty Image Enhancer