Fusing Cellular Network Data and Tollbooth Counts for Urban Traffic Flow Estimation

arXiv cs.LG / 4/20/2026

📰 NewsIdeas & Deep AnalysisModels & Research

Key Points

  • The study addresses a key challenge in urban transport planning: obtaining vehicle-category-specific origin-destination (OD) data needed for traffic simulations.
  • It proposes a machine learning framework that fuses aggregated mobility signals from cellular network activity with sparse tollbooth counts, using the tollbooths as ground truth to correct systematic biases and disaggregate modes.
  • The approach uses temporal and spatial features to learn the relationship between aggregated and vehicular data, then applies route- and logic-based routing to distribute corrected flows across OD pairs.
  • A case study on a bus depot expansion in Trondheim, Norway, produces hourly OD matrices by vehicle length category, demonstrating that accurate but limited sensors can calibrate more extensive mobility data.
  • The authors argue the method is generalizable and can support downstream micro-scale traffic analyses and infrastructure planning in data-scarce settings.

Abstract

Traffic simulations, essential for planning urban transit infrastructure interventions, require vehicle-category-specific origin-destination (OD) data. Existing data sources are imperfect: sparse tollbooth sensors provide accurate vehicle counts by category, while extensive mobility data from cellular network activity captures aggregated crowd movement, but lack modal disaggregation and have systematic biases. This study develops a machine learning framework to correct and disaggregate cellular network data using sparse tollbooth counts as ground truth. The model uses temporal and spatial features to learn the complex relationship between aggregated mobility data and vehicular data. The framework infers destinations from transit routes and implements routing logic to distribute corrected flows between OD pairs. This approach is applied to a bus depot expansion in Trondheim, Norway, generating hourly OD matrices by vehicle length category. The results show how limited but accurate sensor measurements can correct extensive but aggregated mobility data to produce grounded estimates of background vehicular traffic flows. These macro-scale estimates can be refined for micro-scale analysis at desired locations. The framework provides a generalisable approach for generating origin-destination data from cellular network data. This enables downstream tasks, like detailed traffic simulations for infrastructure planning in data-scarce contexts, supporting urban planners in making informed decisions.

Fusing Cellular Network Data and Tollbooth Counts for Urban Traffic Flow Estimation | AI Navigate