AI Navigate

OCP: Orthogonal Constrained Projection for Sparse Scaling in Industrial Commodity Recommendation

arXiv cs.LG / 3/20/2026

📰 NewsIndustry & Market MovesModels & Research

Key Points

  • The paper proposes Orthogonal Constrained Projection (OCP) to improve Item-Id embedding representations in industrial commodity recommender systems, addressing sparse scaling and representation collapse.
  • By enforcing orthogonality, OCP aligns the learned embeddings' singular value spectrum with an orthogonal basis, yielding high singular entropy and isotropic generalized features while suppressing spurious correlations.
  • Empirical results show faster loss convergence and improved scalability, with gains when scaling up dense layers in the model.
  • In large-scale deployment on JD.com, OCP achieves a 12.97% uplift in UCXR and an 8.9% uplift in GMV, indicating strong practical utility for scaling both sparse vocabularies and dense architectures.

Abstract

In industrial commodity recommendation systems, the representation quality of Item-Id vocabularies directly impacts the scalability and generalization ability of recommendation models. A key challenge is that traditional Item-Id vocabularies, when subjected to sparse scaling, suffer from low-frequency information interference, which restricts their expressive power for massive item sets and leads to representation collapse. To address this issue, we propose an Orthogonal Constrained Projection method to optimize embedding representation. By enforcing orthogonality, the projection constrains the backpropagation manifold, aligning the singular value spectrum of the learned embeddings with the orthogonal basis. This alignment ensures high singular entropy, thereby preserving isotropic generalized features while suppressing spurious correlations and overfitting to rare items. Empirical results demonstrate that OCP accelerates loss convergence and enhances the model's scalability; notably, it enables consistent performance gains when scaling up dense layers. Large-scale industrial deployment on JD.com further confirms its efficacy, yielding a 12.97% increase in UCXR and an 8.9% uplift in GMV, highlighting its robust utility for scaling up both sparse vocabularies and dense architectures.