基本信息
views: 31

Bio
Research
My primary research interest is reinforcement learning (RL), particularly enabling RL agents to efficiently adapt to unseen tasks (meta/multi-task RL) by learning "nice" task representations (resp. good coverage policies) in the presence (resp. absence) of training rewards.
As the dynamics of control tasks are commonly governed by physical laws, I embarked upon the quest of developing RL agents that explicitly model dynamics with equations as I believe that it can enable prior knowledge and/or inductive bias injection, sample efficiency gains, better domain randomization (à-la-Sim2Real) and risk-control thanks to interpretability... Practically, this involves using symbolic regression (SR), the search of analytic expressions composed of mathematical operators, e.g. cos, exp, constants and variables. Due to the lack of SR algorithms that infer accurate expressions in reasonable time, I have worked on developing transformer-based models, trained on synthetically-generated datasets, that search with order of magnitudes less time.
My primary research interest is reinforcement learning (RL), particularly enabling RL agents to efficiently adapt to unseen tasks (meta/multi-task RL) by learning "nice" task representations (resp. good coverage policies) in the presence (resp. absence) of training rewards.
As the dynamics of control tasks are commonly governed by physical laws, I embarked upon the quest of developing RL agents that explicitly model dynamics with equations as I believe that it can enable prior knowledge and/or inductive bias injection, sample efficiency gains, better domain randomization (à-la-Sim2Real) and risk-control thanks to interpretability... Practically, this involves using symbolic regression (SR), the search of analytic expressions composed of mathematical operators, e.g. cos, exp, constants and variables. Due to the lack of SR algorithms that infer accurate expressions in reasonable time, I have worked on developing transformer-based models, trained on synthetically-generated datasets, that search with order of magnitudes less time.
Research Interests
Papers共 16 篇Author StatisticsCo-AuthorSimilar Experts
By YearBy Citation主题筛选期刊级别筛选合作者筛选合作机构筛选
时间
引用量
主题
期刊级别
合作者
合作机构
F. O. de Franca,M. Virgolin,M. Kommenda,M. S. Majumder,M. Cranmer,G. Espada,L. Ingelse,A. Fonseca, M. Landajuela,B. Petersen,R. Glatt, N. Mundhenk,C. S. Lee,J. D. Hochhalter, D. L. Randall,P. Kamienny,H. Zhang,G. Dick,A. Simon,B. Burlacu,Jaan Kasak, Meera Machado,Casper Wilstrup,W. G. La Cavaz
Irina Jurenka,Markus Kunesch,Kevin R. McKee,Daniel Gillick, Shaojian Zhu, Sara Wiltberger, Shubham Milind Phal,Katherine Hermann,Daniel Kasenberg,Avishkar Bhoopchand, Ankit Anand,Miruna Pîslar, Stephanie Chan,Lisa Wang,Jennifer She,Parsa Mahmoudieh, Aliya Rysbek,Wei-Jen Ko,Andrea Huber, Brett Wiltshire,Gal Elidan,Roni Rabin,Jasmin Rubinovitz, Amit Pitaru, Mac McAllister, Julia Wilkowski, David Choi,Roee Engelberg,Lidan Hackmon, Adva Levin, Rachel Griffin, Michael Sears, Filip Bar, Mia Mesar, Mana Jabbour,Arslan Chaudhry,James Cohan,Sridhar Thiagarajan,Nir Levine, Ben Brown,Dilan Gorur, Svetlana Grant, Rachel Hashimshoni,Laura Weidinger, Jieru Hu,Dawn Chen, Kuba Dolecki,Canfer Akbulut,Maxwell Bileschi,Laura Culp, Wen-Xin Dong,Nahema Marchal, Kelsie Van Deman, Hema Bajaj Misra, Michael Duah, Moran Ambar,Avi Caciularu, Sandra Lefdal, Chris Summerfield, James An,Pierre-Alexandre Kamienny, Abhinit Mohdi, Theofilos Strinopoulous,Annie Hale, Wayne Anderson,Luis C. Cobo,Niv Efron, Muktha Ananda,Shakir Mohamed,Maureen Heymans, Zoubin Ghahramani, Yossi Matias, Ben Gomes, Lila Ibrahim
CoRR (2024)
Cited0Views0EIBibtex
0
0
F. O. de Franca,M. Virgolin,M. Kommenda,M. S. Majumder,M. Cranmer,G. Espada,L. Ingelse,A. Fonseca, M. Landajuela,B. Petersen,R. Glatt, N. Mundhenk,C. S. Lee,J. D. Hochhalter, D. L. Randall,P. Kamienny, H. Zhang,G. Dick,A. Simon,B. Burlacu,Jaan Kasak, Meera Machado,Casper Wilstrup,W. G. La Cava
CoRR (2023)
CoRR (2021)
Load More
Author Statistics
#Papers: 16
#Citation: 688
H-Index: 8
G-Index: 12
Sociability: 5
Diversity: 1
Activity: 21
Co-Author
Co-Institution
D-Core
- 合作者
- 学生
- 导师
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn