基本信息
浏览量:52
职业迁徙
个人简介
In model-based RL, my work provided a novel insight into probabilistic environment model ensemble, which is commonly used in model-based RL algorithm. Based on the insight, we can substitute the ensemble with a single model and Lipschitz regularized value function to make the learning algorithm much more computationally efficient. For transfer-RL, I have worked on transferring domain knowledge under the drastic change of observation spaces (e.g., from vector-based observation to image-based observation). For adversarial RL, I have worked on observation attacks for the deep RL policy, as well as designing efficient algorithms to improve the agent’s robustness under attack. In addition, I have also worked on communication attacks in multi-agent reinforcement learning and developed a certifiable defense mechanism.
研究兴趣
论文共 32 篇作者统计合作学者相似作者
按年份排序按引用量排序主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
Ruijie Zheng,Yongyuan Liang,Xiyao Wang,Shuang Ma,Hal Daumé III,Huazhe Xu,John Langford, Praveen Palanisamy, Kalyan Shankar Basu,Furong Huang
CoRR (2024)
引用0浏览0EI引用
0
0
Tianying Ji,Yongyuan Liang,Yan Zeng,Yu Luo, Guowei Xu, Jiawei Guo,Ruijie Zheng,Furong Huang,Fuchun Sun,Huazhe Xu
CoRR (2024)
引用0浏览0EI引用
0
0
CoRR (2023)
引用0浏览0EI引用
0
0
Guowei Xu,Ruijie Zheng,Yongyuan Liang,Xiyao Wang,Zhecheng Yuan,Tianying Ji,Yu Luo,Xiaoyu Liu, Jiaxin Yuan,Pu Hua, Shuzhen Li,Yanjie Ze,
ICLR 2024 (2023)
引用0浏览0EI引用
0
0
CoRR (2023)
引用0浏览0EI引用
0
0
加载更多
作者统计
合作学者
合作机构
D-Core
- 合作者
- 学生
- 导师
数据免责声明
页面数据均来自互联网公开来源、合作出版商和通过AI技术自动分析结果,我们不对页面数据的有效性、准确性、正确性、可靠性、完整性和及时性做出任何承诺和保证。若有疑问,可以通过电子邮件方式联系我们:report@aminer.cn