基本信息
浏览量:48
职业迁徙
个人简介
More specifically, my research has focused on problems that arise at the intersection of Machine Learning and Dynamical Systems/Time Series where one must learn to make a sequence of predictions to achieve a task. I have developed novel learning techniques, based on online learning and interactions with the learner, that leads to efficient learning techniques with good guarantees for these sequence prediction problems. I have applied these techniques in the context of imitation learning (learning from demonstrations), structured prediction in computer vision, model-based reinforcement learning, and currently on list optimization problems (e.g. ad placement, personalized news recommendation, grasp selection and trajectory optimization for robotic manipulation).
In the context of imitation learning, I have demonstrated the efficacy of these methods on two video game problems, making a computer learn how to play a 3D racing game and Mario Bros from input images and corresponding actions taken by a human player, as well as making a UAV learn to fly through forest environments while avoiding trees seen through its camera from pilot demonstrations.
In the context of imitation learning, I have demonstrated the efficacy of these methods on two video game problems, making a computer learn how to play a 3D racing game and Mario Bros from input images and corresponding actions taken by a human player, as well as making a UAV learn to fly through forest environments while avoiding trees seen through its camera from pilot demonstrations.
研究兴趣
论文共 28 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
CoRR (2011)
引用11浏览0EI引用
11
0
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn