AI Navigate

Demand Acceptance using Reinforcement Learning for Dynamic Vehicle Routing Problem with Emission Quota

arXiv cs.LG / 3/17/2026

📰 NewsModels & Research

Key Points

  • Introduces and formalizes the Dynamic and Stochastic Vehicle Routing Problem with Emission Quota (DS-QVRP-RR), a novel routing problem that combines dynamic demand acceptance and routing under a global emission constraint.
  • Proposes a two-layer optimization framework enabling anticipatory rejection of demands and generation of new routes.
  • Develops hybrid algorithms that combine reinforcement learning with combinatorial optimization techniques to solve DS-QVRP-RR.
  • Presents a comprehensive computational study comparing the approach against traditional methods, showing effectiveness across different input types and horizons of uncertainty.

Abstract

This paper introduces and formalizes the Dynamic and Stochastic Vehicle Routing Problem with Emission Quota (DS-QVRP-RR), a novel routing problems that integrates dynamic demand acceptance and routing with a global emission constraint. A key contribution is a two-layer optimization framework designed to facilitate anticipatory rejections of demands and generation of new routes. To solve this, we develop hybrid algorithms that combine reinforcement learning with combinatorial optimization techniques. We present a comprehensive computational study that compares our approach against traditional methods. Our findings demonstrate the relevance of our approach for different types of inputs, even when the horizon of the problem is uncertain.