Neural Program Synthesis with Priority Queue Training
arXiv: Artificial Intelligence, Volume abs/1801.03526, 2018.
We consider the task of program synthesis in the presence of a reward function over the output of programs, where the goal is to find programs with maximal rewards. We employ an iterative optimization scheme, where we train an RNN on a dataset of K best programs from a priority queue of the generated programs so far. Then, we synthesize n...More
Full Text (Upload PDF)