Two Approaches to Direct Estimation of Riesz Representers

arXiv stat.ML / 3/24/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper studies how two econometrics/ML approaches to directly estimating Riesz representers relate, focusing on debiased machine learning versus sieve methods for conditional moment models.
  • It shows that when using unregularized or ridge-regularized linear/sieve/RKHS models, the two estimators become numerically equivalent.
  • For other regularization methods like Lasso, and for broader ML function classes (including neural networks), the estimators may diverge rather than remain equivalent.
  • The authors propose that extending the Chen et al. (2014) viewpoint to machine learning leads to a new constrained optimization formulation for Riesz representers.
  • They conjecture—based on prior results—that the constrained approach could improve statistical performance but likely increases computational complexity.

Abstract

The Riesz representer is a central object in semiparametric statistics and debiased/doubly-robust estimation. Two literatures in econometrics have highlighted the role for directly estimating Riesz representers: the automatic debiased machine learning literature (as in Chernozhukov et al., 2022b), and an independent literature on sieve methods for conditional moment models (as in Chen et al., 2014). These two literatures solve distinct optimization problems that in the population both have the Riesz representer as their solution. We show that with unregularized or ridge-regularized linear, sieve, or RKHS models, the two resulting estimators are numerically equivalent. However, for other regularization schemes such as the Lasso, or more general machine learning function classes including neural networks, the estimators are not necessarily equivalent. In the latter case, the Chen et al. (2014) formulation yields a novel constrained optimization problem for directly estimating Riesz representers with machine learning. Drawing on results from Birrell et al. (2022), we conjecture that this approach may offer statistical advantages at the cost of greater computational complexity.