Latent patterns of urban mixing in mobility analysis across five global cities

arXiv cs.AI / 4/15/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The study analyzes large-scale travel survey data from over 200,000 residents across five global cities (Boston, Chicago, Hong Kong, London, and Sao Paulo) to uncover latent patterns of social mixing that high-resolution mobility data alone cannot reveal.
  • It finds measurable discrepancies in inferred social mixing when socioeconomic status is estimated from residential neighborhoods versus when it is taken from self-reported survey data (estimated mixing is about 16% lower).
  • Using evidence across age and caregiving profiles, the paper supports the “second youth” hypothesis by showing higher social mixing among people over 66 than among those aged 55–65, while teenagers and women with caregiving responsibilities show lower mixing.
  • Transit access matters: proximity to major transit stations reduces the impact of individual socioeconomic status on social mixing across the cities.
  • The authors build city-specific spatio-temporal place networks with a graph neural network and show that where people travel to (activity-space structure) explains most variation in place exposure, with income groups potentially having similar mixing levels despite stratified activity spaces that produce different social mixing experiences.

Abstract

This study leverages large-scale travel surveys for over 200,000 residents across Boston, Chicago, Hong Kong, London, and Sao Paulo. With rich individual-level data, we make systematic comparisons and reveal patterns in social mixing, which cannot be identified by analyzing high-resolution mobility data alone. Using the same set of data, inferring socioeconomic status from residential neighborhoods yield social mixing levels 16% lower than using self-reported survey data. Besides, individuals over the age of 66 experience greater social mixing than those in late working life (aged 55 to 65), lending data-driven support to the "second youth" hypothesis. Teenagers and women with caregiving responsibilities exhibit lower social mixing levels. Across the five cities, proximity to major transit stations reduces the influence of individual socioeconomic status on social mixing. Finally, we construct detailed spatio-temporal place networks for each city using a graph neural network. Inputs of home-space, activity-space and demographic attributes are embedded and fed into a supervised autoencoder to predict individual exposure vectors. Results show that the structure of individual activity space, i.e., where people travel to, explains most of the variations in place exposure, suggesting that mobility shapes experienced social mixing more than sociodemographic characteristics, home environment, and transit proximity. The ablation tests further discover that, while different income groups may experience similar levels of social mixing, their activity spaces remain stratified by income, resulting in structurally different social mixing experiences.