A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm
dc.contributor.author | Jin, Heng | en |
dc.contributor.committeechair | Hou, Yiwei Thomas | en |
dc.contributor.committeemember | Lou, Wenjing | en |
dc.contributor.committeemember | Liu, Qingyu | en |
dc.contributor.department | Computer Science | en |
dc.date.accessioned | 2022-07-15T08:00:07Z | en |
dc.date.available | 2022-07-15T08:00:07Z | en |
dc.date.issued | 2022-07-14 | en |
dc.description.abstract | In this thesis, we consider a swarm of military drones flying over an unfriendly territory, where a drone can be shot down by an enemy with an age-based risk probability. We study the problem of scheduling surveillance image transmissions among the drones with the objective of minimizing the overall casualty. We present Hector, a reinforcement learning-based scheduling algorithm. Specifically, Hector only uses the age of each detected target, a piece of locally available information at each drone, as an input to a neural network to make scheduling decisions. Extensive simulations show that Hector significantly reduces casualties than a baseline round-robin algorithm. Further, Hector can offer comparable performance to a high-performing greedy scheduler, which assumes complete knowledge of global information. | en |
dc.description.abstractgeneral | Drones have been successfully deployed by the military. The advancement of machine learning further empowers drones to automatically identify, recognize, and even eliminate adversary targets on the battlefield. However, to minimize unnecessary casualties to civilians, it is important to introduce additional checks and control from the control center before lethal force is authorized. Thus, the communication between drones and the control center becomes critical. In this thesis, we study the problem of communication between a military drone swarm and the control center when drones are flying over unfriendly territory where drones can be shot down by enemies. We present Hector, an algorithm based on machine learning, to minimize the overall casualty of drones by scheduling data transmission. Extensive simulations show that Hector significantly reduces casualties than traditional algorithms. | en |
dc.description.degree | Master of Science | en |
dc.format.medium | ETD | en |
dc.identifier.other | vt_gsexam:35276 | en |
dc.identifier.uri | http://hdl.handle.net/10919/111255 | en |
dc.language.iso | en | en |
dc.publisher | Virginia Tech | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | Drone swarm | en |
dc.subject | Casualty | en |
dc.subject | Scheduling | en |
dc.subject | Age of Information | en |
dc.subject | Reinforcement learning | en |
dc.title | A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm | en |
dc.type | Thesis | en |
thesis.degree.discipline | Computer Science and Applications | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.level | masters | en |
thesis.degree.name | Master of Science | en |
Files
Original bundle
1 - 1 of 1