Non-Reciprocating Sharing Methods in Cooperative Q-Learning Environments

dc.contributor.authorCunningham, Bryanen
dc.contributor.committeechairCao, Yongen
dc.contributor.committeememberKavanaugh, Andrea L.en
dc.contributor.committeememberCao, Yangen
dc.contributor.departmentComputer Science and Applicationsen
dc.date.accessioned2014-03-14T20:43:41Zen
dc.date.adate2012-08-28en
dc.date.available2014-03-14T20:43:41Zen
dc.date.issued2012-08-09en
dc.date.rdate2012-08-28en
dc.date.sdate2012-08-17en
dc.description.abstractPast research on multi-agent simulation with cooperative reinforcement learning (RL) for homogeneous agents focuses on developing sharing strategies that are adopted and used by all agents in the environment. These sharing strategies are considered to be reciprocating because all participating agents have a predefined agreement regarding what type of information is shared, when it is shared, and how the participating agent's policies are subsequently updated. The sharing strategies are specifically designed around manipulating this shared information to improve learning performance. This thesis targets situations where the assumption of a single sharing strategy that is employed by all agents is not valid. This work seeks to address how agents with no predetermined sharing partners can exploit groups of cooperatively learning agents to improve learning performance when compared to Independent learning. Specifically, several intra-agent methods are proposed that do not assume a reciprocating sharing relationship and leverage the pre-existing agent interface associated with Q-Learning to expedite learning. The other agents' functions and their sharing strategies are unknown and inaccessible from the point of view of the agent(s) using the proposed methods. The proposed methods are evaluated on physically embodied agents in the multi-agent cooperative robotics field learning a navigation task via simulation. The experiments conducted focus on the effects of the following factors on the performance of the proposed non-reciprocating methods: scaling the number of agents in the environment, limiting the communication range of the agents, and scaling the size of the environment.en
dc.description.degreeMaster of Scienceen
dc.identifier.otheretd-08172012-110113en
dc.identifier.sourceurlhttp://scholar.lib.vt.edu/theses/available/etd-08172012-110113/en
dc.identifier.urihttp://hdl.handle.net/10919/34610en
dc.publisherVirginia Techen
dc.relation.haspartCunningham_BL_T_2012.pdfen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectInformation Exchanges in Multi-Agent Systemsen
dc.subjectMulti-Agent Reinforcement Learningen
dc.subjectAgent Interaction Protocolsen
dc.subjectCooperative Learningen
dc.titleNon-Reciprocating Sharing Methods in Cooperative Q-Learning Environmentsen
dc.typeThesisen
thesis.degree.disciplineComputer Science and Applicationsen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Cunningham_BL_T_2012.pdf
Size:
1.54 MB
Format:
Adobe Portable Document Format

Collections