Interplay of approximate planning strategies

Quentin J M Huys; Níall Lally; Paul Faulkner; Neir Eshel; Erich Seifritz; Samuel J Gershman; Peter Dayan; Jonathan P Roiser

doi:10.1073/pnas.1414219112

Interplay of approximate planning strategies

Proc Natl Acad Sci U S A. 2015 Mar 10;112(10):3098-103. doi: 10.1073/pnas.1414219112. Epub 2015 Feb 9.

Authors

Quentin J M Huys¹, Níall Lally², Paul Faulkner³, Neir Eshel⁴, Erich Seifritz⁵, Samuel J Gershman⁶, Peter Dayan⁷, Jonathan P Roiser⁸

Affiliations

¹ Translational Neuromodeling Unit, Institute of Biomedical Engineering, University of Zürich and Swiss Federal Institute of Technology (ETH) Zürich, 8032 Zurich, Switzerland; Department of Psychiatry, Psychotherapy and Psychosomatics, Hospital of Psychiatry, University of Zürich, 8032 Zurich, Switzerland; qhuys@cantab.net.
² Institute of Cognitive Neuroscience, University College London, London WC1N 3AR, United Kingdom; Experimental Therapeutics & Pathophysiology Branch, Intramural Research Program, National Institute of Mental Health, National Institutes of Health, Bethesda, MD 20892;
³ Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles, CA 90095;
⁴ Program in Neuroscience and MD-PhD Program, Harvard Medical School, Boston, MA 02115;
⁵ Department of Psychiatry, Psychotherapy and Psychosomatics, Hospital of Psychiatry, University of Zürich, 8032 Zurich, Switzerland;
⁶ Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139; and.
⁷ Gatsby Computational Neuroscience Unit, University College London, London WC1N 3AR, United Kingdom.
⁸ Institute of Cognitive Neuroscience, University College London, London WC1N 3AR, United Kingdom;

Abstract

Humans routinely formulate plans in domains so complex that even the most powerful computers are taxed. To do so, they seem to avail themselves of many strategies and heuristics that efficiently simplify, approximate, and hierarchically decompose hard tasks into simpler subtasks. Theoretical and cognitive research has revealed several such strategies; however, little is known about their establishment, interaction, and efficiency. Here, we use model-based behavioral analysis to provide a detailed examination of the performance of human subjects in a moderately deep planning task. We find that subjects exploit the structure of the domain to establish subgoals in a way that achieves a nearly maximal reduction in the cost of computing values of choices, but then combine partial searches with greedy local steps to solve subtasks, and maladaptively prune the decision trees of subtasks in a reflexive manner upon encountering salient losses. Subjects come idiosyncratically to favor particular sequences of actions to achieve subgoals, creating novel complex actions or "options."

Keywords: hierarchical reinforcement learning; memoization; planning; pruning.

Interplay of approximate planning strategies

Authors

Affiliations

Abstract

Publication types

MeSH terms

Grants and funding