Experience
Education
Bio
I am interested in:

Off-policy (counterfactual) Estimation/Learning for Reinforcement Learning and Contextual Bandits.
Stochastic Approximation for large-scale Machine Learning and Deep Learning.

I study applications of these ideas towards recommender systems and user-facing Machine Learning systems.