Optimizing Data Usage via Differentiable Rewards
Abstract:
To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model could potentially be trained better with a scorer that "adapts" to its current learning state and es...More
Code:
Data:
Tags
Comments