Multi-Agent Hierarchical Distributed NMPC With Learned Locomotion via Reinforcement Learning

dc.contributor.authorPastore, Vittorioen
dc.contributor.committeechairAkbari Hamed, Kavehen
dc.contributor.committeememberStepputtis, Simon Benjaminen
dc.contributor.committeememberLosey, Dylan Patricken
dc.contributor.departmentMechanical Engineeringen
dc.date.accessioned2026-06-23T08:00:49Zen
dc.date.available2026-06-23T08:00:49Zen
dc.date.issued2026-06-22en
dc.description.abstractThis thesis investigates the development of a distributed, hierarchical control architecture for the coordinated navigation of multi-agent robot teams. Due to the high-dimensional, nonlinear dynamics inherent in legged locomotion and the coupled nature of multi-robot spatial planning, centralized control approaches face significant computational bottlenecks. To mitigate these challenges, the proposed framework decomposes the navigation problem across two timescales. At the high level, a Distributed Nonlinear Model Predictive Controller (DNMPC) operating at 5,Hz calculates velocity commands utilizing the Alternating Direction Method of Multipliers (ADMM). Collision avoidance is addressed through a consensus projection on the shared ADMM variable for inter-agent separation, combined with smooth exponential barriers for static obstacles, providing empirically verified safe navigation. At the low level, an end-to-end deep reinforcement learning (RL) policy operates at 250,Hz to track the planner's velocity commands. Synthesized via Proximal Policy Optimization (PPO) with targeted domain randomization, the actor-critic policy directly maps proprioceptive observations to joint position targets. The mathematical formulation of the ADMM-NMPC planner and the underlying RL reward structures are detailed, and the integrated hierarchy is validated through MuJoCo simulations on various legged robots and experimental deployments on the Unitree A1 quadruped, demonstrating a scalable, real-time framework for robust multi-agent navigation on resource-constrained embedded hardware.en
dc.description.abstractgeneralThis thesis explores an integrated approach to safely and efficiently control teams of robots in shared environments. Because the physical task of controlling a walking robot is incredibly complex, calculating both the motor commands and the collision avoidance simultaneously presents an intractable challenge for standard onboard computers. This research addresses this by decoupling the problem into a hierarchical architecture. First, a high-level cooperative planner calculates safe, obstacle-free paths for the robots by modeling them as simplified moving objects. Second, a robust deep neural network policy translates those broad path directions into precise, high-frequency joint commands required to physically drive the robot. By separating the swarm collision control from the intricate mechanics of locomotion, this framework enables multiple legged platforms to operate safely, collaboratively, and efficiently in the real world using standard mobile computing.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:46941en
dc.identifier.urihttps://hdl.handle.net/10919/143475en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectMulti-Agent Systemsen
dc.subjectDistributed NMPCen
dc.subjectReinforcement Learningen
dc.subjectLegged Locomotionen
dc.titleMulti-Agent Hierarchical Distributed NMPC With Learned Locomotion via Reinforcement Learningen
dc.typeThesisen
thesis.degree.disciplineMechanical Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Name:
Pastore_V_T_2026.pdf
Size:
10.88 MB
Format:
Adobe Portable Document Format

Collections