Reinforcement learning: Dopamine ramps with fuzzy value estimates

Current Biology(2022)

引用 0|浏览7
暂无评分
摘要
A new study in reinforcement learning theory shows that extending the temporal difference algorithm to unbiased learning under state uncertainty explains the observed ramping behaviour of dopamine neurons.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要