Entropy-based metrics for predicting choice behavior based on local response to reward

Ethan Trepka; Mehran Spitmaan; Bilal A. Bari; Vincent D. Costa; Jeremiah Y. Cohen; Alireza Soltani

doi:10.1038/s41467-021-26784-w

Entropy-based metrics for predicting choice behavior based on local response to reward

Ethan Trepka, Mehran Spitmaan, Bilal A. Bari, Vincent D. Costa, Jeremiah Y. Cohen, Alireza Soltani

Oregon National Primate Research Center

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.

Original language	English (US)
Article number	6567
Journal	Nature communications
Volume	12
Issue number	1
DOIs	https://doi.org/10.1038/s41467-021-26784-w
State	Published - Dec 2021

ASJC Scopus subject areas

General Chemistry
General Biochemistry, Genetics and Molecular Biology
General Physics and Astronomy

Access to Document

10.1038/s41467-021-26784-w

Cite this

@article{f3d659173e9d4f20b85fadb2960c74de,

title = "Entropy-based metrics for predicting choice behavior based on local response to reward",

abstract = "For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.",

author = "Ethan Trepka and Mehran Spitmaan and Bari, {Bilal A.} and Costa, {Vincent D.} and Cohen, {Jeremiah Y.} and Alireza Soltani",

note = "Publisher Copyright: {\textcopyright} 2021, The Author(s).",

year = "2021",

month = dec,

doi = "10.1038/s41467-021-26784-w",

language = "English (US)",

volume = "12",

journal = "Nature communications",

issn = "2041-1723",

publisher = "Nature Publishing Group",

number = "1",

}

TY - JOUR

T1 - Entropy-based metrics for predicting choice behavior based on local response to reward

AU - Trepka, Ethan

AU - Spitmaan, Mehran

AU - Bari, Bilal A.

AU - Costa, Vincent D.

AU - Cohen, Jeremiah Y.

AU - Soltani, Alireza

PY - 2021/12

Y1 - 2021/12

N2 - For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.

AB - For decades, behavioral scientists have used the matching law to quantify how animals distribute their choices between multiple options in response to reinforcement they receive. More recently, many reinforcement learning (RL) models have been developed to explain choice by integrating reward feedback over time. Despite reasonable success of RL models in capturing choice on a trial-by-trial basis, these models cannot capture variability in matching behavior. To address this, we developed metrics based on information theory and applied them to choice data from dynamic learning tasks in mice and monkeys. We found that a single entropy-based metric can explain 50% and 41% of variance in matching in mice and monkeys, respectively. We then used limitations of existing RL models in capturing entropy-based metrics to construct more accurate models of choice. Together, our entropy-based metrics provide a model-free tool to predict adaptive choice behavior and reveal underlying neural mechanisms.

UR - http://www.scopus.com/inward/record.url?scp=85119011099&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85119011099&partnerID=8YFLogxK

U2 - 10.1038/s41467-021-26784-w

DO - 10.1038/s41467-021-26784-w

M3 - Article

C2 - 34772943

AN - SCOPUS:85119011099

SN - 2041-1723

VL - 12

JO - Nature communications

JF - Nature communications

IS - 1

M1 - 6567

ER -

Entropy-based metrics for predicting choice behavior based on local response to reward

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this