Reinforcement Learning with Gaussian Processes for Unmanned Aerial Vehicle Navigation

Gondhalekar, Nahush Ramesh

Reinforcement Learning with Gaussian Processes for Unmanned Aerial Vehicle Navigation

dc.contributor.author	Gondhalekar, Nahush Ramesh	en
dc.contributor.committeechair	Tokekar, Pratap	en
dc.contributor.committeemember	Zeng, Haibo	en
dc.contributor.committeemember	Abbott, A. Lynn	en
dc.contributor.department	Electrical and Computer Engineering	en
dc.date.accessioned	2017-08-04T08:00:43Z	en
dc.date.available	2017-08-04T08:00:43Z	en
dc.date.issued	2017-08-03	en
dc.description.abstract	We study the problem of Reinforcement Learning (RL) for Unmanned Aerial Vehicle (UAV) navigation with the smallest number of real world samples possible. This work is motivated by applications of learning autonomous navigation for aerial robots in structural inspec- tion. A naive RL implementation suffers from curse of dimensionality in large continuous state spaces. Gaussian Processes (GPs) exploit the spatial correlation to approximate state- action transition dynamics or value function in large state spaces. By incorporating GPs in naive Q-learning we achieve better performance in smaller number of samples. The evalua- tion is performed using simulations with an aerial robot. We also present a Multi-Fidelity Reinforcement Learning (MFRL) algorithm that leverages Gaussian Processes to learn the optimal policy in a real world environment leveraging samples gathered from a lower fidelity simulator. In MFRL, an agent uses multiple simulators of the real environment to perform actions. With multiple levels of fidelity in a simulator chain, the number of samples used in successively higher simulators can be reduced.	en
dc.description.abstractgeneral	Increasing development in the field of infrastructure inspection using Unmanned Aerial Vehicles (UAVs) has been seen in the recent years. This thesis presents work related to UAV navigation using Reinforcement Learning (RL) with the smallest number of real world samples. A naive RL implementation suffers from the curse of dimensionality in large continuous state spaces. Gaussian Processes (GPs) exploit the spatial correlation to approximate state-action transition dynamics or value function in large state spaces. By incorporating GPs in naive Q-learning we achieve better performance in smaller number of samples. The evaluation is performed using simulations with an aerial robot. We also present a Multi-Fidelity Reinforcement Learning (MFRL) algorithm that leverages Gaussian Processes to learn the optimal policy in a real world environment leveraging samples gathered from a lower fidelity simulator. In MFRL, an agent uses multiple simulators of the real environment to perform actions. With multiple levels of fidelity in a simulator chain, the number of samples used in successively higher simulators can be reduced. By developing a bidirectional simulator chain, we try to provide a learning platform for the robots to safely learn required skills in the smallest possible number of real world samples possible.	en
dc.description.degree	Master of Science	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:12330	en
dc.identifier.uri	http://hdl.handle.net/10919/78667	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Reinforcement Learning	en
dc.subject	Gaussian Processes	en
dc.subject	Unmanned Aerial Vehicle Navigation	en
dc.title	Reinforcement Learning with Gaussian Processes for Unmanned Aerial Vehicle Navigation	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Engineering	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Gondhalekar_NR_T_2017.pdf
Size:: 42.29 MB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses