Show simple item record

dc.contributor.authorKirke, Alexis John
dc.contributor.otherSchool of Engineering, Computing and Mathematicsen_US
dc.date.accessioned2011-09-26T11:17:16Z
dc.date.available2011-09-26T11:17:16Z
dc.date.issued1997
dc.identifierNot availableen_US
dc.identifier.urihttp://hdl.handle.net/10026.1/657
dc.descriptionMerged with duplicate record 10026.1/1984 on 27.02.2017 by CS (TIS)
dc.description.abstract

This thesis addresses the problem of setting the balance between exploration and exploitation in teams of learning robots who exchange information. Specifically it looks at groups of robots whose tasks include moving between salient points in the environment. To deal with unknown and dynamic environments,such robots need to be able to discover and learn the routes between these points themselves. A natural extension of this scenario is to allow the robots to exchange learned routes so that only one robot needs to learn a route for the whole team to use that route. One contribution of this thesis is to identify a dilemma created by this extension: that once one robot has learned a route between two points, all other robots will follow that route without looking for shorter versions. This trade-off will be labeled the Distributed Exploration vs. Exploitation Dilemma, since increasing distributed exploitation (allowing robots to exchange more routes) means decreasing distributed exploration (reducing robots ability to learn new versions of routes), and vice-versa. At different times, teams may be required with different balances of exploitation and exploration. The main contribution of this thesis is to present a system for setting the balance between exploration and exploitation in a group of robots. This system is demonstrated through experiments involving simulated robot teams. The experiments show that increasing and decreasing the value of a parameter of the novel system will lead to a significant increase and decrease respectively in average exploitation (and an equivalent decrease and increase in average exploration) over a series of team missions. A further set of experiments show that this holds true for a range of team sizes and numbers of goals.

en_US
dc.language.isoenen_US
dc.publisherUniversity of Plymouthen_US
dc.titleLearning and Co-operation in Mobile Multi-Robot Systemsen_US
dc.typeThesis
dc.identifier.doihttp://dx.doi.org/10.24382/4252
dc.identifier.doihttp://dx.doi.org/10.24382/4252


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record


All items in PEARL are protected by copyright law.
Author manuscripts deposited to comply with open access mandates are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author.
Theme by 
Atmire NV