FiLMMeD: Feature-wise Linear Modulation for Cross-Problem Multi-Depot Vehicle Routing

arXiv cs.LG / 5/1/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The paper presents FiLMMeD, a unified neural combinatorial optimization model designed to solve 24 different multi-depot vehicle routing problem (MDVRP) variants, targeting the real-world heterogeneity of constraints.
  • It improves cross-variant generalization by adding Feature-wise Linear Modulation (FiLM) to a Transformer encoder, dynamically conditioning internal representations on the currently active constraint set.
  • The authors introduce Preference Optimization as an initial multi-task learning (MTL) training approach, arguing it is superior to reinforcement learning for future MTL research.
  • To reduce the generalization gap introduced by multi-depot constraints, they propose a targeted curriculum learning strategy that gradually increases the complexity of constraint interactions.
  • Experiments on 24 MDVRP variants (including 8 new formulations) and 16 single-depot VRPs show FiLMMeD consistently outperforms existing state-of-the-art baselines, and the code is released on GitHub.

Abstract

Solving practical multi-depot vehicle routing problems (MDVRP) is a challenging optimization task central to modern logistics, increasingly driven by e-commerce. To address the MDVRP's computational complexity, neural-based combinatorial optimization methods offer a promising scalable alternative to traditional approaches. However, neural-based methods typically rely on rigid architectures and input encodings tailored to specific problem formulations. In real-world settings, heterogeneous constraints create multiple MDVRP variants, limiting the applicability of such models. While multi-task learning (MTL) has begun to accelerate the development of unified neural-based solvers, prior works focus almost exclusively on single-depot VRPs, leaving the MDVRP unaddressed. To bridge this gap, we propose Feature-wise Linear Modulation for Cross-Problem Multi-Depot Vehicle Routing (FiLMMeD), a novel unified neural-based model for 24 different MDVRP variants. We introduce three main contributions: (1) to improve the model's generalization, we augment the standard Transformer encoder with Feature-wise Linear Modulation (FiLM), which dynamically conditions learned internal representations based on the active set of constraints; (2) we provide an initial demonstration of Preference Optimization in the MTL setting, establishing it as a superior alternative to Reinforcement Learning for future MTL works; (3) to mitigate the generalization gap caused by the introduction of multi-depot constraints, we introduce a targeted curriculum learning strategy that progressively exposes the model to increasingly more complex constraint interactions. Extensive experiments on 24 MDVRP variants (including 8 novel formulations) and 16 single-depot VRPs confirm the effectiveness of FiLMMeD, which consistently outperforms state-of-the-art baselines. Our code is available at: https://github.com/AJ-Correa/FiLMMeD/tree/main