ダウンロード数: 122

このアイテムのファイル:
ファイル 記述 サイズフォーマット 
s41598-019-53600-9.pdf2.45 MBAdobe PDF見る/開く
完全メタデータレコード
DCフィールド言語
dc.contributor.authorHigashi, Hiroshien
dc.contributor.authorMinami, Tetsutoen
dc.contributor.authorNakauchi, Shigekien
dc.contributor.alternative東, 広志ja
dc.contributor.alternative南, 哲人ja
dc.contributor.alternative中内, 茂樹ja
dc.date.accessioned2019-12-11T00:49:50Z-
dc.date.available2019-12-11T00:49:50Z-
dc.date.issued2019-11-27-
dc.identifier.issn2045-2322-
dc.identifier.urihttp://hdl.handle.net/2433/245045-
dc.description.abstractIt is widely known that reinforcement learning systems in the brain contribute to learning via interactions with the environment. These systems are capable of solving multidimensional problems, in which some dimensions are relevant to a reward, while others are not. To solve these problems, computational models use Bayesian learning, a strategy supported by behavioral and neural evidence in human. Bayesian learning takes into account beliefs, which represent a learner’s confidence in a particular dimension being relevant to the reward. Beliefs are given as a posterior probability of the state-transition (reward) function that maps the optimal actions to the states in each dimension. However, when it comes to implementing this learning strategy, the order in which beliefs and state-transition functions update remains unclear. The present study investigates this update order using a trial-by-trial analysis of human behavior and electroencephalography signals during a task in which learners have to identify the reward-relevant dimension. Our behavioral and neural results reveal a cooperative update—within 300 ms after the outcome feedback, the state-transition functions are updated, followed by the beliefs for each dimension.en
dc.format.mimetypeapplication/pdf-
dc.language.isoeng-
dc.publisherSpringer Natureen
dc.rights© The Author(s) 2019. This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.en
dc.subjectDecisionen
dc.subjectHuman behaviouren
dc.subjectLearning algorithmsen
dc.titleCooperative update of beliefs and state-transition functions in human reinforcement learningen
dc.typejournal article-
dc.type.niitypeJournal Article-
dc.identifier.jtitleScientific Reportsen
dc.identifier.volume9-
dc.relation.doi10.1038/s41598-019-53600-9-
dc.textversionpublisher-
dc.identifier.artnum17704-
dc.identifier.pmid31776353-
dcterms.accessRightsopen access-
datacite.awardNumber15K21079-
datacite.awardNumber26240043-
datacite.awardNumber17H06292-
datacite.awardNumber19K16894-
jpcoar.funderName日本学術振興会ja
jpcoar.funderName日本学術振興会ja
jpcoar.funderName日本学術振興会ja
jpcoar.funderName日本学術振興会ja
jpcoar.funderName.alternativeJapan Society for the Promotion of Science (JSPS)en
jpcoar.funderName.alternativeJapan Society for the Promotion of Science (JSPS)en
jpcoar.funderName.alternativeJapan Society for the Promotion of Science (JSPS)en
jpcoar.funderName.alternativeJapan Society for the Promotion of Science (JSPS)en
出現コレクション:学術雑誌掲載論文等

アイテムの簡略レコードを表示する

Export to RefWorks


出力フォーマット 


このリポジトリに保管されているアイテムはすべて著作権により保護されています。