Department of Computing Science
University of Alberta
关注
立即认领
分享
关注
立即认领
分享
基本信息
浏览量:6
职业迁徙
个人简介
My current research still revolves around actor-critic algorithms. In particular, I’ve been continuing my study of these algorithms from an approximate policy iteration perspective. Although I’m interested in everything actor-critic, my recent research has focused on how actor-critic algorithms are affected by:
New policy improvement operators Entropy regularization Policy parameterizations