Neural Sequence Model Training via α-divergence Minimization

Sotetsu Koyamada,Yuta Kikuchi,Atsunori Kanemura,Shin-ichi Maeda,Shin Ishii

arXiv: Machine Learning（2017）

引用 23|浏览57

暂无评分

摘要

We propose a new neural sequence model training method in which the objective function is defined by α-divergence. We demonstrate that the objective function generalizes the maximum-likelihood (ML)-based and reinforcement learning (RL)-based objective functions as special cases (i.e., ML corresponds to α→ 0 and RL to α→1). We also show that the gradient of the objective function can be considered a mixture of ML- and RL-based objective gradients. The experimental results of a machine translation task show that minimizing the objective function with α > 0 outperforms α→ 0, which corresponds to ML-based methods.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要