Self-Supervised Reinforcement Learning with dual-reward for knowledge-aware recommendation.

Wei Zhang,Yuanguo Lin,Yong Liu, Huanyu You,Pengcheng Wu,Fan Lin,Xiuze Zhou

Appl. Soft Comput.（2022）

引用 2|浏览56

暂无评分

摘要

To improve the recommendation accuracy and offer explanations for recommendations, Reinforcement Learning (RL) has been applied to path reasoning over knowledge graphs. However, in recommendation tasks, most existing RL methods learn the path-finding policy using only a short-term or single reward, leading to a local optimum and losing some potential paths. To address these issues, we propose a Self-Supervised Reinforcement Learning (SSRL) framework combined with dual-reward for knowledge-aware recommendation reasoning over knowledge graphs. Then, we improve Actor–Critic algorithm by using a dual-reward driven strategy, which combines short-term reward with long-term incremental evaluation. The improved algorithm helps the policy guide path reasoning in an overall situation. In addition, to find the most potential paths, in the improved Actor–Critic algorithm, a loss constraint of each sample is used as a reinforced signal to update the gradients. With some improvements against baselines, experimental results demonstrate the effectiveness of our framework.

查看译文

关键词

Reinforcement learning,Self-Supervised,Recommendation,Knowledge graph,Dual-reward

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要