A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm

Jin, Heng

A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm

dc.contributor.author	Jin, Heng	en
dc.contributor.committeechair	Hou, Yiwei Thomas	en
dc.contributor.committeemember	Lou, Wenjing	en
dc.contributor.committeemember	Liu, Qingyu	en
dc.contributor.department	Computer Science	en
dc.date.accessioned	2022-07-15T08:00:07Z	en
dc.date.available	2022-07-15T08:00:07Z	en
dc.date.issued	2022-07-14	en
dc.description.abstract	In this thesis, we consider a swarm of military drones flying over an unfriendly territory, where a drone can be shot down by an enemy with an age-based risk probability. We study the problem of scheduling surveillance image transmissions among the drones with the objective of minimizing the overall casualty. We present Hector, a reinforcement learning-based scheduling algorithm. Specifically, Hector only uses the age of each detected target, a piece of locally available information at each drone, as an input to a neural network to make scheduling decisions. Extensive simulations show that Hector significantly reduces casualties than a baseline round-robin algorithm. Further, Hector can offer comparable performance to a high-performing greedy scheduler, which assumes complete knowledge of global information.	en
dc.description.abstractgeneral	Drones have been successfully deployed by the military. The advancement of machine learning further empowers drones to automatically identify, recognize, and even eliminate adversary targets on the battlefield. However, to minimize unnecessary casualties to civilians, it is important to introduce additional checks and control from the control center before lethal force is authorized. Thus, the communication between drones and the control center becomes critical. In this thesis, we study the problem of communication between a military drone swarm and the control center when drones are flying over unfriendly territory where drones can be shot down by enemies. We present Hector, an algorithm based on machine learning, to minimize the overall casualty of drones by scheduling data transmission. Extensive simulations show that Hector significantly reduces casualties than traditional algorithms.	en
dc.description.degree	Master of Science	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:35276	en
dc.identifier.uri	http://hdl.handle.net/10919/111255	en
dc.language.iso	en	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Drone swarm	en
dc.subject	Casualty	en
dc.subject	Scheduling	en
dc.subject	Age of Information	en
dc.subject	Reinforcement learning	en
dc.title	A Reinforcement Learning-based Scheduler for Minimizing Casualties of a Military Drone Swarm	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Science and Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Jin_H_T_2022.pdf
Size:: 1.98 MB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses