Karma Mechanisms for Decentralised, Cooperative Multi Agent Path Finding

arXiv cs.RO / 4/10/2026

💬 OpinionIdeas & Deep AnalysisModels & Research

Key Points

  • The paper addresses Multi-Agent Path Finding (MAPF), where many agents must compute conflict-free trajectories under limited computation and communication constraints.
  • It proposes a decentralized cooperative framework using “Karma mechanisms,” i.e., non-tradeable credits that track agents’ past cooperative behavior to influence how future conflicts are resolved.
  • Conflict resolution is cast as a bilateral negotiation process that allows pairwise replanning without requiring global priority structures, aiming to maintain fairness over time.
  • Evaluated in a lifelong robotic warehouse pickup-and-delivery setting with kinematic orientation constraints, the method balances replanning workload across agents and reduces service-time disparities without reducing overall efficiency.
  • The work provides an associated code repository, supporting reproduction and further experimentation with the Karma-based decentralized MAPF approach.

Abstract

Multi-Agent Path Finding (MAPF) is a fundamental coordination problem in large-scale robotic and cyber-physical systems, where multiple agents must compute conflict-free trajectories with limited computational and communication resources. While centralised optimal solvers provide guarantees on solution optimality, their exponential computational complexity limits scalability to large-scale systems and real-time applicability. Existing decentralised heuristics are faster, but result in suboptimal outcomes and high cost disparities. This paper proposes a decentralised coordination framework for cooperative MAPF based on Karma mechanisms - artificial, non-tradeable credits that account for agents' past cooperative behaviour and regulate future conflict resolution decisions. The approach formulates conflict resolution as a bilateral negotiation process that enables agents to resolve conflicts through pairwise replanning while promoting long-term fairness under limited communication and without global priority structures. The mechanism is evaluated in a lifelong robotic warehouse multi-agent pickup-and-delivery scenario with kinematic orientation constraints. The results highlight that the Karma mechanism balances replanning effort across agents, reducing disparity in service times without sacrificing overall efficiency. Code: https://github.com/DerKevinRiehl/karma_dmapf