Non-Reciprocating Sharing Methods in Cooperative Q-Learning Environments

Cunningham, Bryan

Non-Reciprocating Sharing Methods in Cooperative Q-Learning Environments

dc.contributor.author	Cunningham, Bryan	en
dc.contributor.committeechair	Cao, Yong	en
dc.contributor.committeemember	Kavanaugh, Andrea L.	en
dc.contributor.committeemember	Cao, Yang	en
dc.contributor.department	Computer Science and Applications	en
dc.date.accessioned	2014-03-14T20:43:41Z	en
dc.date.adate	2012-08-28	en
dc.date.available	2014-03-14T20:43:41Z	en
dc.date.issued	2012-08-09	en
dc.date.rdate	2012-08-28	en
dc.date.sdate	2012-08-17	en
dc.description.abstract	Past research on multi-agent simulation with cooperative reinforcement learning (RL) for homogeneous agents focuses on developing sharing strategies that are adopted and used by all agents in the environment. These sharing strategies are considered to be reciprocating because all participating agents have a predefined agreement regarding what type of information is shared, when it is shared, and how the participating agent's policies are subsequently updated. The sharing strategies are specifically designed around manipulating this shared information to improve learning performance. This thesis targets situations where the assumption of a single sharing strategy that is employed by all agents is not valid. This work seeks to address how agents with no predetermined sharing partners can exploit groups of cooperatively learning agents to improve learning performance when compared to Independent learning. Specifically, several intra-agent methods are proposed that do not assume a reciprocating sharing relationship and leverage the pre-existing agent interface associated with Q-Learning to expedite learning. The other agents' functions and their sharing strategies are unknown and inaccessible from the point of view of the agent(s) using the proposed methods. The proposed methods are evaluated on physically embodied agents in the multi-agent cooperative robotics field learning a navigation task via simulation. The experiments conducted focus on the effects of the following factors on the performance of the proposed non-reciprocating methods: scaling the number of agents in the environment, limiting the communication range of the agents, and scaling the size of the environment.	en
dc.description.degree	Master of Science	en
dc.identifier.other	etd-08172012-110113	en
dc.identifier.sourceurl	http://scholar.lib.vt.edu/theses/available/etd-08172012-110113/	en
dc.identifier.uri	http://hdl.handle.net/10919/34610	en
dc.publisher	Virginia Tech	en
dc.relation.haspart	Cunningham_BL_T_2012.pdf	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	Information Exchanges in Multi-Agent Systems	en
dc.subject	Multi-Agent Reinforcement Learning	en
dc.subject	Agent Interaction Protocols	en
dc.subject	Cooperative Learning	en
dc.title	Non-Reciprocating Sharing Methods in Cooperative Q-Learning Environments	en
dc.type	Thesis	en
thesis.degree.discipline	Computer Science and Applications	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	masters	en
thesis.degree.name	Master of Science	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Cunningham_BL_T_2012.pdf
Size:: 1.54 MB
Format:: Adobe Portable Document Format

Download

Collections

Masters Theses