A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm

Files

TR Number

Date

2022-07-14

Journal Title

Journal ISSN

Volume Title

Publisher

Virginia Tech

Abstract

In this thesis, we consider a swarm of military drones flying over an unfriendly territory, where a drone can be shot down by an enemy with an age-based risk probability. We study the problem of scheduling surveillance image transmissions among the drones with the objective of minimizing the overall casualty. We present Hector, a reinforcement learning-based scheduling algorithm. Specifically, Hector only uses the age of each detected target, a piece of locally available information at each drone, as an input to a neural network to make scheduling decisions. Extensive simulations show that Hector significantly reduces casualties than a baseline round-robin algorithm. Further, Hector can offer comparable performance to a high-performing greedy scheduler, which assumes complete knowledge of global information.

Description

Keywords

Drone swarm, Casualty, Scheduling, Age of Information, Reinforcement learning

Citation

Collections