このアイテムのアクセス数: 6
このアイテムのファイル:
ファイル | 記述 | サイズ | フォーマット | |
---|---|---|---|---|
s10015-024-00954-7.pdf | 761.22 kB | Adobe PDF | 見る/開く |
タイトル: | Inferring source of learning by chimpanzees in cognitive tasks using reinforcement learning theory |
著者: | Hirata, Satoshi ![]() ![]() ![]() Sakai, Yutaka |
著者名の別形: | 平田, 聡 |
キーワード: | Reinforcement learning Chimpanzee Serial learning Actor-Critic |
発行日: | Aug-2024 |
出版者: | Springer Nature |
誌名: | Artificial Life and Robotics |
巻: | 29 |
開始ページ: | 398 |
終了ページ: | 403 |
抄録: | Reinforcement learning is a mathematical framework for learning better choices through trial-and-error. Recent studies revealed that reinforcement learning is applicable to animal behavior and cognition. However, applying reinforcement learning to animal behavior sometimes encounters difficulties because the information sources utilized by animals to make choices are often unknown, whereas this is identified as the “state” in the reinforcement learning framework. We sought to identify possible state settings including non-standard formulations suitable for explaining data from past chimpanzee studies. Although chimpanzees' performance in a serial learning task was inconsistent with standard reinforcement learning formulations, we found that the combination of state-independent choice making and state-dependent evaluation produced consistent results. Exploration of state settings in reinforcement learning may shed new light on animal learning processes. |
著作権等: | This version of the article has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature's AM terms of use, but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: https://doi.org/10.1007/s10015-024-00954-7 The full-text file will be made open to the public on 05 June 2025 in accordance with publisher's 'Terms and Conditions for Self-Archiving'. This is not the published version. Please cite only the published version. この論文は出版社版でありません。引用の際には出版社版をご確認ご利用ください。 |
URI: | http://hdl.handle.net/2433/294473 |
DOI(出版社版): | 10.1007/s10015-024-00954-7 |
出現コレクション: | 学術雑誌掲載論文等 |

このリポジトリに保管されているアイテムはすべて著作権により保護されています。