Show simple item record

dc.contributor.authorSenft, Een
dc.contributor.authorLemaignan, Sen
dc.contributor.authorBaxter, Pen
dc.contributor.authorBelpaeme, Ten
dc.date.accessioned2019-09-02T11:53:00Z
dc.date.available2019-09-02T11:53:00Z
dc.date.issued2017-01-01en
dc.identifier.isbn9781577357940en
dc.identifier.urihttp://hdl.handle.net/10026.1/14855
dc.description.abstract

Social interacting is a complex task for which machine learning holds particular promise. However, as no sufficiently accurate simulator of human interactions exists today, the learning of social interaction strategies has to happen online in the real world. Actions executed by the robot impact on humans, and as such have to be carefully selected, making it impossible to rely on random exploration. Additionally, no clear reward function exists for social interactions. This implies that traditional approaches used for Reinforcement Learning cannot be directly applied for learning how to interact with the social world. As such we argue that robots will profit from human expertise and guidance to learn social interactions. However, as the quantity of input a human can provide is limited, new methods have to be designed to use human input more efficiently. In this paper we describe a setup in which we combine a framework called Supervised Progressively Autonomous Robot Competencies (SPARC), which allows safer online learning with Reinforcement Learning, with the use of partial states radier than full states to accelerate generalisation and obtain a usable action policy more quickly.

en
dc.format.extent109 - 113en
dc.language.isoenen
dc.titleToward supervised reinforcement learning with partial states for social HRIen
dc.typeConference Contribution
plymouth.volumeFS-17-01 - FS-17-05en
plymouth.publication-statusPublisheden
plymouth.journalAAAI Fall Symposium - Technical Reporten
plymouth.organisational-group/Plymouth
plymouth.organisational-group/Plymouth/Faculty of Science and Engineering
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA
plymouth.organisational-group/Plymouth/REF 2021 Researchers by UoA/UoA11 Computer Science and Informatics
plymouth.organisational-group/Plymouth/Research Groups
plymouth.organisational-group/Plymouth/Research Groups/Marine Institute
plymouth.organisational-group/Plymouth/Users by role
dc.rights.embargoperiodNot knownen
rioxxterms.licenseref.urihttp://www.rioxx.net/licenses/all-rights-reserveden
rioxxterms.typeConference Paper/Proceeding/Abstracten


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

Show simple item record


All items in PEARL are protected by copyright law.
Author manuscripts deposited to comply with open access mandates are made available in accordance with publisher policies. Please cite only the published version using the details provided on the item record or document. In the absence of an open licence (e.g. Creative Commons), permissions for further reuse of content should be sought from the publisher or author.
Theme by 
Atmire NV