Research
We work on the algorithmic and engineering problems that arise when many robots have to act together.
Our work spans four overlapping themes — multi-agent coordination, learned communication, vision-based navigation, and the lab-scale hardware platforms that make experiments possible.
Themes
Learned communication
End-to-end learning of what to say, when, and to whom — under bandwidth, latency, and reliability constraints.
Multi-agent coordination
Decentralised decision-making and emergent cooperation in robot teams under partial observability and constrained communication.
Swarm robotics
Hardware platforms and field experiments for multi-drone research at lab scale.
Vision-based navigation
Lightweight visual perception and control for resource-constrained aerial and ground robots.
Papers
2026
- Mar 16PAPER npj RoboticsConcrete multi-agent path planning enabling kinodynamically aggressive maneuvers
Keisuke Okumura, Guang Yang, Zhan Gao, Heedo Woo, Amanda Prorok
We present concrete planning, a hybrid approach that captures real-world continuous dynamics while maintaining scalable guaranteed planning via discrete search. The framework integrates advances in robot dynamics learning, optimal control, and anytime complete planning into a modular system deployed with 40 robots — 20 aerial, 8 ground, and 12 obstacle robots — operating in a compact laboratory space.
- Jan 20PAPER ICLR 2026When Is Diversity Rewarded in Cooperative Multi-Agent Learning?
Michael Amir, Matteo Bettini, Amanda Prorok
An investigation of when and why behavioural diversity benefits cooperative multi-agent learning, establishing conditions under which heterogeneous policies outperform homogeneous ones.
- Jan 20PAPER ICLR 2026Pairwise is Not Enough: Hypergraph Neural Networks for Multi-Agent Pathfinding
Rishi Jain, Keisuke Okumura, Michael Amir, Pietro Lio, Amanda Prorok
Pairwise interactions captured by standard GNNs miss higher-order dependencies in multi-agent pathfinding. This work introduces hypergraph neural networks that model group-level constraints, improving solution quality for dense navigation scenarios.
- Jan 20PAPER ICLR 2026Remotely Detectable Robot Policy Watermarking
Michael Amir, Manon Flageat, Amanda Prorok
A method for embedding remotely detectable watermarks into robot policies, enabling verification of policy provenance without requiring access to the model parameters.
2025
- Dec 2025PAPER NeurIPS 2025No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes
Jasmine Bayrooti, Sattar Vakili, Amanda Prorok, Carl Henrik Ek
A no-regret Thompson sampling algorithm for finite-horizon Markov decision processes using Gaussian process models, providing efficient exploration in model-based reinforcement learning settings.
- Sep 2025PAPER Science RoboticsExtending robot minds through collective learning
Amanda Prorok
The current trend toward generalist robot behaviours powered by massive, monolithic AI models is unsustainable. This viewpoint article argues for a paradigm shift toward collective robotic intelligence — a "mixture-of-robots" approach where diverse, specialised agents learn and work together, inspired by natural systems where cooperation among specialised components leads to greater intelligence and resilience.
- Aug 2025PAPER CoRL 2025ReCoDe: Reinforcement Learning-based Dynamic Constraint Design for Multi-Agent Coordination
Michael Amir, Guang Yang, Zhan Gao, Keisuke Okumura, Heedo Woo, Amanda Prorok
A hybrid, decentralised framework combining optimisation-based control with adaptation provided by MARL. Rather than discarding expert controllers, ReCoDe improves them by learning additional dynamic constraints that capture subtler behaviours — for example, constraining agent movements to prevent congestion in cluttered scenarios.
- Jul 2025PAPER IROS 2025D4orm: Multi-Robot Trajectories with Dynamics-aware Diffusion Denoised Deformations
Yixiao Zhang, Keisuke Okumura, Heedo Woo, Ajay Shankar, Amanda Prorok
An optimisation method for generating kinodynamically feasible and collision-free multi-robot trajectories that exploits an incremental denoising scheme from diffusion models. Evaluated for differential-drive and holonomic teams with up to 16 robots in 2D and 3D worlds.
- May 2025PAPER ICRA 2025DVM-SLAM: Decentralized Visual Monocular Simultaneous Localization and Mapping for Multi-Agent Systems
Joshua Bird, Jan Blumenkamp, Amanda Prorok
A decentralised visual monocular SLAM system for multi-agent systems, enabling collaborative mapping and localisation without centralised infrastructure.
- May 2025PAPER ICRA 2025Language-Conditioned Offline RL for Multi-Robot Navigation
Steven Morad, Ajay Shankar, Jan Blumenkamp, Amanda Prorok
Offline reinforcement learning approach conditioned on natural language commands for decentralised multi-robot navigation, bridging high-level human instructions and low-level robot coordination policies.
- Apr 2025PAPER ICLR 2025Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling
Jasmine Bayrooti, Carl Henrik Ek, Amanda Prorok
An efficient model-based reinforcement learning method using optimistic Thompson sampling to balance exploration and exploitation in complex sequential decision-making problems.
- Mar 2025PAPER IEEE Transactions on RoboticsCo-Optimizing Reconfigurable Environments and Policies for Decentralized Multi-Agent Navigation
Zhan Gao, Guang Yang, Amanda Prorok
Treats the environment as a co-decision variable alongside agent policies, jointly optimising both the physical layout and the decentralised navigation policies for multi-agent systems.
2024
- Dec 2024PAPER JMLRBenchMARL: Benchmarking Multi-Agent Reinforcement Learning
Matteo Bettini, Amanda Prorok, Vincent Moens
A standardised library for benchmarking multi-agent reinforcement learning, enabling seamless mixing and matching of MARL algorithms, tasks, and models while maintaining rigorous reproducibility and standardisation.
- Nov 2024PAPER CoRL 2024Provably Safe Online Multi-Agent Navigation in Unknown Environments
Zhan Gao, Guang Yang, Jasmine Bayrooti, Amanda Prorok
A provably safe online method for multi-agent navigation in unknown environments, combining control barrier functions with decentralised planning to guarantee collision avoidance while maintaining liveness.
- Oct 2024PAPER CoRL 2024CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications
Jan Blumenkamp, Steven Morad, Jennifer Gielis, Amanda Prorok
A decentralised visual spatial foundation model enabling real-time, platform-agnostic pose estimation and spatial comprehension for autonomous robots. Provides accurate pose estimates and local bird's-eye-view without needing camera overlap, deployed on wheeled platforms and a quadruped.
- Jul 2024PAPER ICML 2024Controlling Behavioral Diversity in Multi-Agent Reinforcement Learning
Matteo Bettini, Ryan Kortvelesy, Amanda Prorok
DiCo (Diversity Control) controls behavioural diversity to an exact value of a given metric by representing policies as the sum of a parameter-shared component and dynamically scaled per-agent components. Applied directly to the policy architecture, leaving the learning objective unchanged.
- May 2024PAPER DARS 2024The Cambridge RoboMaster: An Agile Multi-Robot Research Platform
Jan Blumenkamp, Steven Morad, Jennifer Gielis, Amanda Prorok
An agile multi-robot research platform built on DJI RoboMaster hardware, designed for real-world experiments in decentralised multi-agent coordination, supporting GNN-based policy deployment and sim-to-real transfer.
2023
- Dec 2023PAPER NeurIPS 2023Generalised f-Mean Aggregation for Graph Neural Networks
Ryan Kortvelesy, Steven Morad, Amanda Prorok
A generalised f-mean aggregation framework for graph neural networks that subsumes common aggregation functions (sum, mean, max) as special cases while enabling learnable, task-adaptive aggregation strategies.
- Dec 2023PAPER NeurIPS 2023Reinforcement Learning with Fast and Forgetful Memory
Steven Morad, Ryan Kortvelesy, Stephan Liwicki, Amanda Prorok
A new memory model for RL that serves as a super-efficient plug-in replacement for RNNs and transformers. Runs up to two orders of magnitude faster with linear space complexity, and sets new records on the POPGym benchmark for partially observable RL.
- May 2023PAPER AAMAS 2023Heterogeneous multi-robot reinforcement learning
Matteo Bettini, Ajay Shankar, Amanda Prorok
We study cooperative multi-robot tasks where the team is composed of agents with structurally different observations, action spaces, and reward functions. We introduce HetGPPO, a parameter-sharing paradigm enabling heterogeneous behaviours in multi-agent reinforcement learning with graph neural networks, achieving superior performance over role-blind baselines.
- May 2023PAPER ICLR 2023POPGym: Benchmarking Partially Observable Reinforcement Learning
Steven Morad, Ryan Kortvelesy, Matteo Bettini, Stephan Liwicki, Amanda Prorok
A comprehensive benchmark suite for partially observable reinforcement learning, providing a unified evaluation framework across diverse POMDP environments to compare memory-based RL architectures.
This page highlights recent work. For the complete record going back to 2008: