Beam-aware Kernelized Contextual Bandits for User Association and Beamforming in mmWave Vehicular Networks

arXiv cs.LG / 3/23/2026

📰 NewsModels & Research

Key Points

  • The article proposes a Beam-aware Kernelized Contextual UCB (BKC-UCB) algorithm to estimate instantaneous transmission rates in mmWave vehicular networks without additional channel measurements.
  • It uses historical context such as vehicle location and velocity, along with past observed transmission rates, by mapping contexts into a reproducing kernel Hilbert space (RKHS) to capture nonlinear relationships.
  • The beam index is embedded into the context so the algorithm can exploit correlations among beams rather than treating each beam as an independent arm, accelerating convergence.
  • An event-triggered information sharing mechanism is incorporated to exchange information only when significant explorations occur, reducing communication overhead.
  • The approach aims to enable timely beamforming decisions at serving base stations in high-mobility environments while reducing estimation overhead.

Abstract

Timely channel information is necessary for vehicles to determine both the serving base station (BS) and the beamforming vector, but frequent estimation of fast-fading mmWave channels incurs significant overhead. To address this challenge, we propose a Beam-aware Kernelized Contextual Upper Confidence Bound (BKC-UCB) algorithm that estimates instantaneous transmission rates without additional channel measurements by exploiting historical contexts such as vehicle location and velocity, together with past observed transmission rates. Specifically, BKC-UCB leverages kernel methods to capture the nonlinear relationship between context and transmission rate by mapping contexts into a reproducing kernel Hilbert space (RKHS), where linear learning becomes feasible. Rather than treating each beam as an independent arm, the beam index is embedded into the context, enabling BKC-UCB to exploit correlations among beams to accelerate convergence. Furthermore, an event-triggered information sharing mechanism is incorporated into BKC-UCB, enabling information exchange only when significant explorations are conducted to improve learning efficiency with limited communication overhead.