Learning to Infer User Hidden States for Online Sequential Advertising

Zhaoqing Peng,Junqi Jin,Lan Luo,Yaodong Yang,Rui Luo,Jun Wang,Weinan Zhang,Haiyang Xu,Miao Xu,Chuan Yu,Tiejian Luo,Han Li,Jian Xu,Kun Gai

CIKM '20: The 29th ACM International Conference on Information and Knowledge Management Virtual Event Ireland October, 2020（2020）

引用 3|浏览307

暂无评分

摘要

To drive purchase in online advertising, it is of the advertiser's great interest to optimize the sequential advertising strategy whose performance and interpretability are both important. The lack of interpretability in existing deep reinforcement learning methods makes it not easy to understand, diagnose and further optimize the strategy.In this paper, we propose our Deep Intents Sequential Advertising (DISA) method to address these issues. The key part of interpretability is to understand a consumer's purchase intent which is, however, unobservable (called hidden states). In this paper, we model this intention as a latent variable and formulate the problem as a Partially Observable Markov Decision Process (POMDP) where the underlying intents are inferred based on the observable behaviors. Large-scale industrial offline and online experiments demonstrate our method's superior performance over several baselines. The inferred hidden states are analyzed, and the results prove the rationality of our inference.

查看译文

关键词

Partially Observable Markov Decision Process, Online Advertising

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要