A Chemical Domain Knowledge-Aware Framework for Multi-view Molecular Property Prediction.

Rui Hua,Xinyan Wang, Chuang Cheng,Qiang Zhu,Xuezhong Zhou

CCKS (Evaluation Track)(2022)

引用 0|浏览6
暂无评分
摘要
Molecular property prediction is becoming increasingly important in drug and material discovery, and many research works have demonstrated the great potential of machine learning techniques, especially deep learning. This paper presents our proposed solution for CCKS-2022 task 8, a chemical domain knowledge-aware framework for multi-view molecular property prediction. As a generative self-supervised approach to molecular graph representation learning, the framework is based on Knowledge-guided Pre-training of Graph Transformer (KPGT), which adopts a graph transformer guided by molecular fingerprint and descriptor knowledge. In the fine-tuning stage, combined with practical prediction problems, we fuse functional group information and chemical element knowledge graphs to predict molecular properties. From the perspective of chemical structure, KPGT provides structural information of molecular graphs (especially highlighting chemical bonds), and we further integrate chemical domain knowledge, using functional groups and chemical element knowledge graph, which is the information on physicochemical properties of atoms. From molecular graphs to functional groups, and to atoms, the molecular representation is jointly enhanced by multiple views from coarse to fine. When introducing functional group information and chemical element knowledge graph, we propose a novel BiLSTM-based recurrent module to accumulate domain knowledge. Our framework is able to simultaneously consider molecular graph, functional groups, and atomic physicochemical properties in practical predictions to better predict molecular properties. Finally, without using other external knowledge, the AUC-ROC of the test data reaches 0.88587, ranking second among 140 teams, which validates the performance of our approach.
更多
查看译文
关键词
Molecular property prediction,Chemical domain knowledge,Molecular representation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要