Multi-Agent Hierarchical Distributed NMPC With Learned Locomotion via Reinforcement Learning
| dc.contributor.author | Pastore, Vittorio | en |
| dc.contributor.committeechair | Akbari Hamed, Kaveh | en |
| dc.contributor.committeemember | Stepputtis, Simon Benjamin | en |
| dc.contributor.committeemember | Losey, Dylan Patrick | en |
| dc.contributor.department | Mechanical Engineering | en |
| dc.date.accessioned | 2026-06-23T08:00:49Z | en |
| dc.date.available | 2026-06-23T08:00:49Z | en |
| dc.date.issued | 2026-06-22 | en |
| dc.description.abstract | This thesis investigates the development of a distributed, hierarchical control architecture for the coordinated navigation of multi-agent robot teams. Due to the high-dimensional, nonlinear dynamics inherent in legged locomotion and the coupled nature of multi-robot spatial planning, centralized control approaches face significant computational bottlenecks. To mitigate these challenges, the proposed framework decomposes the navigation problem across two timescales. At the high level, a Distributed Nonlinear Model Predictive Controller (DNMPC) operating at 5,Hz calculates velocity commands utilizing the Alternating Direction Method of Multipliers (ADMM). Collision avoidance is addressed through a consensus projection on the shared ADMM variable for inter-agent separation, combined with smooth exponential barriers for static obstacles, providing empirically verified safe navigation. At the low level, an end-to-end deep reinforcement learning (RL) policy operates at 250,Hz to track the planner's velocity commands. Synthesized via Proximal Policy Optimization (PPO) with targeted domain randomization, the actor-critic policy directly maps proprioceptive observations to joint position targets. The mathematical formulation of the ADMM-NMPC planner and the underlying RL reward structures are detailed, and the integrated hierarchy is validated through MuJoCo simulations on various legged robots and experimental deployments on the Unitree A1 quadruped, demonstrating a scalable, real-time framework for robust multi-agent navigation on resource-constrained embedded hardware. | en |
| dc.description.abstractgeneral | This thesis explores an integrated approach to safely and efficiently control teams of robots in shared environments. Because the physical task of controlling a walking robot is incredibly complex, calculating both the motor commands and the collision avoidance simultaneously presents an intractable challenge for standard onboard computers. This research addresses this by decoupling the problem into a hierarchical architecture. First, a high-level cooperative planner calculates safe, obstacle-free paths for the robots by modeling them as simplified moving objects. Second, a robust deep neural network policy translates those broad path directions into precise, high-frequency joint commands required to physically drive the robot. By separating the swarm collision control from the intricate mechanics of locomotion, this framework enables multiple legged platforms to operate safely, collaboratively, and efficiently in the real world using standard mobile computing. | en |
| dc.description.degree | Master of Science | en |
| dc.format.medium | ETD | en |
| dc.identifier.other | vt_gsexam:46941 | en |
| dc.identifier.uri | https://hdl.handle.net/10919/143475 | en |
| dc.language.iso | en | en |
| dc.publisher | Virginia Tech | en |
| dc.rights | Creative Commons Attribution 4.0 International | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | en |
| dc.subject | Multi-Agent Systems | en |
| dc.subject | Distributed NMPC | en |
| dc.subject | Reinforcement Learning | en |
| dc.subject | Legged Locomotion | en |
| dc.title | Multi-Agent Hierarchical Distributed NMPC With Learned Locomotion via Reinforcement Learning | en |
| dc.type | Thesis | en |
| thesis.degree.discipline | Mechanical Engineering | en |
| thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
| thesis.degree.level | masters | en |
| thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1