Optimizing Data Usage via Differentiable Rewards

Cited by: 0|Bibtex|Views68|Links

Abstract:

To acquire a new skill, humans learn better and faster if a tutor, based on their current knowledge level, informs them of how much attention they should pay to particular content or practice problems. Similarly, a machine learning model could potentially be trained better with a scorer that "adapts" to its current learning state and es...More

Code:

Data:

Your rating :
0

 

Tags
Comments