What is value—accumulated reward or evidence?

Friston, Karl J.; Adams, Rick; Montague, P. Read

What is value—accumulated reward or evidence?

dc.contributor.author	Friston, Karl J.	en
dc.contributor.author	Adams, Rick	en
dc.contributor.author	Montague, P. Read	en
dc.date.accessioned	2017-09-08T14:18:49Z	en
dc.date.available	2017-09-08T14:18:49Z	en
dc.date.issued	2012-11-02	en
dc.description.abstract	Why are you reading this abstract? In some sense, your answer will cast the exercise as valuable-but what is value? In what follows, we suggest that value is evidence or, more exactly, log Bayesian evidence. This implies that a sufficient explanation for valuable behavior is the accumulation of evidence for internal models of our world. This contrasts with normative models of optimal control and reinforcement learning, which assume the existence of a value function that explains behavior, where (somewhat tautologically) behavior maximizes value. In this paper, we consider an alternative formulation-active inference-that replaces policies in normative models with prior beliefs about the (future) states agents should occupy. This enables optimal behavior to be cast purely in terms of inference: where agents sample their sensorium to maximize the evidence for their generative model of hidden states in the world, and minimize their uncertainty about those states. Crucially, this formulation resolves the tautology inherent in normative models and allows one to consider how prior beliefs are themselves optimized in a hierarchical setting. We illustrate these points by showing that any optimal policy can be specified with prior beliefs in the context of Bayesian inference. We then show how these prior beliefs are themselves prescribed by an imperative to minimize uncertainty. This formulation explains the saccadic eye movements required to read this text and defines the value of the visual sensations you are soliciting.	en
dc.description.sponsorship	Wellcome Trust	en
dc.format.extent	25 pages	en
dc.format.mimetype	25 pages	en
dc.identifier.doi	https://doi.org/10.3389/fnbot.2012.00011	en
dc.identifier.uri	http://hdl.handle.net/10919/78835	en
dc.identifier.volume	6	en
dc.language.iso	en_US	en
dc.publisher	Frontiers	en
dc.rights	Creative Commons Attribution 4.0 International	en
dc.rights.uri	http://creativecommons.org/licenses/by/4.0/	en
dc.subject	free energy	en
dc.subject	active inference	en
dc.subject	value	en
dc.subject	evidence	en
dc.subject	surprise	en
dc.subject	self-organization	en
dc.subject	selection	en
dc.subject	Bayesian	en
dc.title	What is value—accumulated reward or evidence?	en
dc.title.serial	Frontiers in Neurobiotics	en
dc.type	Article - Refereed	en
dc.type.dcmitype	Text	en

Files

Original bundle

Now showing 1 - 1 of 1

Name:: MontagueWhatisValue2012.pdf
Size:: 1.53 MB
Format:: Adobe Portable Document Format
Description:

Download

Collections

Destination Area: Adaptive Brain and Behavior (ABB)
Scholarly Works, Fralin Biomedical Research Institute at VTC