Reliable and Efficient Multi-Agent Coordination via Graph Neural Network Variational Autoencoders

TODO: alt text


Introducing a novel approach leveraging Graph Neural Network Variational Autoencoders (GNN-VAE) for multi-agent coordination, our method encodes optimal global schedules into a latent space to generate constraint-compliant coordination plans. Outperforming traditional centralized optimization in scalability and runtime, it ensures reliable, deadlock-free multi-robot navigation even in densely populated environments.

Abstract

Multi-agent coordination is crucial for reliable multi-robot navigation in shared spaces such as automated warehouses. In regions of dense robot traffic, local coordination methods may fail to find a deadlock-free solution. In these scenarios, it is appropriate to let a central unit generate a global schedule that decides the passing order of robots. However, the runtime of such centralized coordination methods increases significantly with the problem scale. In this paper, we propose to leverage Graph Neural Network Variational Autoencoders (GNN-VAE) to solve the multi-agent coordination problem at scale faster than through centralized optimization. We formulate the coordination problem as a graph problem and collect ground truth data using a Mixed-Integer Linear Program (MILP) solver. During training, our learning framework encodes good quality solutions of the graph problem into a latent space. At inference time, solution samples are decoded from the sampled latent variables, and the lowest-cost sample is selected for coordination. By construction, our GNN-VAE framework returns solutions that always respect the constraints of the considered coordination problem. Numerical results show that our approach trained on small-scale problems can achieve high-quality solutions even for large-scale problems with 250 robots, being much faster than other baselines.

TODO: alt text


Ours can generate close-to-oracle (MILP) assignments with the optimality ratio consistently over 0.9 while the optimality ratio curves for B-BTS and CMA-ES drop quickly as the number of robots is more than 20. This shows the great generalizability of our approach. Ours can be 10 to 20 times faster than the baselines, solving coordination problem with 250 robots in less than 5 seconds on average.

More content coming soon!