Physical interaction as communication: Learning robot objectives online from human corrections

dc.contributor.authorLosey, Dylan P.en
dc.contributor.authorBajcsy, Andreaen
dc.contributor.authorO'Malley, Marcia K.en
dc.contributor.authorDragan, Anca D.en
dc.date.accessioned2022-02-11T21:15:36Zen
dc.date.available2022-02-11T21:15:36Zen
dc.date.issued2021-10-25en
dc.date.updated2022-02-11T21:15:34Zen
dc.description.abstractWhen a robot performs a task next to a human, physical interaction is inevitable: the human might push, pull, twist, or guide the robot. The state of the art treats these interactions as disturbances that the robot should reject or avoid. At best, these robots respond safely while the human interacts; but after the human lets go, these robots simply return to their original behavior. We recognize that physical human–robot interaction (pHRI) is often intentional: the human intervenes on purpose because the robot is not doing the task correctly. In this article, we argue that when pHRI is intentional it is also informative: the robot can leverage interactions to learn how it should complete the rest of its current task even after the person lets go. We formalize pHRI as a dynamical system, where the human has in mind an objective function they want the robot to optimize, but the robot does not get direct access to the parameters of this objective: they are internal to the human. Within our proposed framework human interactions become observations about the true objective. We introduce approximations to learn from and respond to pHRI in real-time. We recognize that not all human corrections are perfect: often users interact with the robot noisily, and so we improve the efficiency of robot learning from pHRI by reducing unintended learning. Finally, we conduct simulations and user studies on a robotic manipulator to compare our proposed approach with the state of the art. Our results indicate that learning from pHRI leads to better task performance and improved human satisfaction.en
dc.description.versionAccepted versionen
dc.format.extentPages 20-44en
dc.format.extent25 page(s)en
dc.format.mimetypeapplication/pdfen
dc.identifierARTN 02783649211050958 (Article number)en
dc.identifier.doihttps://doi.org/10.1177/02783649211050958en
dc.identifier.eissn1741-3176en
dc.identifier.issn0278-3649en
dc.identifier.issue1en
dc.identifier.urihttp://hdl.handle.net/10919/108316en
dc.identifier.volume41en
dc.language.isoenen
dc.publisherSAGEen
dc.relation.urihttp://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000711561200001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=930d57c9ac61a043676db62af60056c1en
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectRoboticsen
dc.subjectimpedance controlen
dc.subjectinverse reinforcement learningen
dc.subjectpersonal robotsen
dc.subjectphysical human-robot interactionen
dc.subjectMANIPULATIONen
dc.subjectOPTIMIZATIONen
dc.subjectFRAMEWORKen
dc.subjectBEHAVIORen
dc.subject0801 Artificial Intelligence and Image Processingen
dc.subject0906 Electrical and Electronic Engineeringen
dc.subject0913 Mechanical Engineeringen
dc.subjectIndustrial Engineering & Automationen
dc.titlePhysical interaction as communication: Learning robot objectives online from human correctionsen
dc.title.serialInternational Journal of Robotics Researchen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.otherArticleen
dc.type.otherJournalen
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/Engineeringen
pubs.organisational-group/Virginia Tech/Engineering/Mechanical Engineeringen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Engineering/COE T&R Facultyen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
losey_ijrr2021.pdf
Size:
2.35 MB
Format:
Adobe Portable Document Format
Description:
Accepted version