DL-SMILES#: A Novel Encoding Scheme for Predicting Compound Protein Affinity Using Deep Learning

COMBINATORIAL CHEMISTRY & HIGH THROUGHPUT SCREENING(2022)

引用 4|浏览6
暂无评分
摘要
Introduction: Drug repositioning aims to screen drugs and therapeutic goals from approved drugs and abandoned compounds that have been identified as safe. This trend is changing the landscape of drug development and creating a model of drug repositioning for new drug development. In the recent decade, machine learning methods have been applied to predict the binding affinity of compound proteins, while deep learning is recently becoming prominent and achieving significant performances. Among the models, the way of representing the compounds is usually simple, which is the molecular fingerprints, i.e., a single SMILES string. Methods: In this study, we improve previous work by proposing a novel representing manner, named SMILES#, to recode the SMILES string. This approach takes into account the properties of compounds and achieves superior performance. After that, we propose a deep learning model that combines recurrent neural networks with a convolutional neural network with an attention mechanism, using unlabeled data and labeled data to jointly encode molecules and predict binding affinity. Results: Experimental results show that SMILES# with compound properties can effectively improve the accuracy of the model and reduce the RMS error on most data sets. Conclusion: We used the method to verify the related and unrelated compounds with the same target, and the experimental results show the effectiveness of the method.
更多
查看译文
关键词
Deep learning, drug repositioning, drug-target interactions, IC50 value, SMILES string, compound properties
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要