Learning to Focus: CSI-Free Hierarchical MARL for Reconfigurable Reflectors
arXiv cs.AI / 4/8/2026
💬 OpinionSignals & Early TrendsIdeas & Deep AnalysisModels & Research
Key Points
- The paper proposes a “CSI-free” control framework for reconfigurable intelligent surfaces (RIS) that avoids costly channel state information (CSI) estimation by using user localization data instead.
- It introduces a hierarchical multi-agent reinforcement learning (HMARL) architecture with a two-tier design: a high-level controller for discrete user-to-reflector allocation and low-level MAPPO agents for continuous focal-point optimization under CTDE.
- Deterministic ray-tracing results show up to 7.79 dB improvements in received signal strength (RSSI) compared with centralized optimization baselines.
- The approach is evaluated as robust to multi-user scaling and resilient to realistic sub-meter localization tracking errors, maintaining strong beam-focusing performance.
- By reducing CSI-related computational overhead while preserving high-fidelity signal redirection, the work frames a scalable and cost-effective blueprint for intelligent wireless environments.

