MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery
arXiv cs.LG / 2026/3/24
💬 オピニオンIdeas & Deep AnalysisModels & Research
要点
- The paper introduces MARLIN, a multi-agent reinforcement learning method aimed at efficiently learning causal structures (directed acyclic graphs, DAGs) from observational data.
- MARLIN improves online suitability by using a continuous-to-DAG mapping policy for incremental, intra-batch DAG generation along with two complementary RL agents (state-specific and state-invariant) to identify causal relationships.
- The approach integrates the agents into an incremental learning framework so causal structure discovery can proceed over time rather than as a one-shot process.
- MARLIN employs a factored action space to increase parallelization efficiency, improving runtime performance.
- Experiments on both synthetic and real datasets show MARLIN outperforming existing state-of-the-art methods in both effectiveness and efficiency.

