A survey of inverse reinforcement learning
Beling, Peter A.
MetadataShow full item record
Learning from demonstration, or imitation learning, is the process of learning to act in an environment from examples provided by a teacher. Inverse reinforcement learning (IRL) is a specific form of learning from demonstration that attempts to estimate the reward function of a Markov decision process from examples provided by the teacher. The reward function is often considered the most succinct description of a task. In simple applications, the reward function may be known or easily derived from properties of the system and hard coded into the learning process. However, in complex applications, this may not be possible, and it may be easier to learn the reward function by observing the actions of the teacher. This paper provides a comprehensive survey of the literature on IRL. This survey outlines the differences between IRL and two similar methods - apprenticeship learning and inverse optimal control. Further, this survey organizes the IRL literature based on the principal method, describes applications of IRL algorithms, and provides areas of future research.
Showing items related by title, author, creator and subject.
Choi, Jin-Woo (Virginia Tech, 2021-01-07)Recent progress on deep neural networks has shown remarkable action recognition performance from videos. The remarkable performance is often achieved by transfer learning: training a model on a large-scale labeled dataset ...
Klunk, Clare Dvoranchik (Virginia Tech, 1999-04-14)Many successful professionals, recognized for their experience, knowledge, competence and commitment to their field, experience a contradiction when they realize that their contributions are no longer valued by decision-makers ...
Becksford, Lisa; Metko, Stefanie (Taylor & Francis, 2018-09-05)In 2015, in response to the findings of an online learning needs assessment, two librarians and a web developer began creating a library learning objects repository. This repository would ensure that distance learners were ...