Value-Aware Product Recommendation by Customer Segmentation using a suitable High-Dimensional Similarity Measure

arXiv cs.LG / 5/1/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper proposes a value-aware product recommendation method that accounts for both the high dimensionality and sparsity of typical user-item interaction data.
  • It explicitly incorporates each product’s and user’s contribution to overall sales revenue by encoding revenue contributions in the user-item matrix and computing customer similarity from that representation.
  • The approach segments customers based on revenue-based similarity of their purchase baskets, enabling recommendations that better match profitability goals rather than generic relevance.
  • The authors compare conventional similarity/distance metrics with a new alternative designed for high-dimensional settings and introduce three recommendation strategies (revenue share, product popularity, and expected profit).
  • The method is validated via simulation experiments and a real-world case study using the UCI Online Retail dataset.

Abstract

This paper presents a novel value-aware approach to product recommendation that simultaneously addresses the high dimensionality and sparsity of user-item data while explicitly incorporating the contribution of each product and user to overall sales revenue. The proposed framework encodes revenue contributions in the user-item matrix and computes customer similarity directly on this basis using suitable distance measures. This enables the segmentation of users according to the revenue-based similarity of their purchase baskets and supports recommendations aligned with profitability objectives. We compare conventional similarity metrics with a novel alternative tailored to high-dimensional contexts and propose three recommendation strategies based on revenue share, product popularity, and expected profit generation. The effectiveness of the proposed method is validated through simulation experiments and a real-world application using the UCI Online Retail dataset.