Machine Learning Enables Prediction of Pyrrolysyl-tRNA Synthetase Substrate Specificity

ACS synthetic biology(2023)

引用 1|浏览18
暂无评分
摘要
Knowledge about the substrate scope for a given enzymeis informativefor elucidating biochemical pathways and also for expanding applicationsof the enzyme. However, no general methods are available to accuratelypredict the substrate specificity of an enzyme. Pyrrolysyl-tRNA synthetase(PylRS) is a powerful tool for incorporating various noncanonicalamino acids (NCAAs) into proteins, which enabled us to probe, image,rationally engineer, and evolve protein structure and function. However,the incorporation of a new NCAA typically requires the selection oflarge libraries of PylRS with randomized mutations at active sites,and this process requires multiple rounds of selection for each newsubstrate. Therefore, a single aminoacyl-tRNA synthetase with broadsubstrate promiscuity is ideal to facilitate widespread applicationsof the genetic NCAA incorporation technique. Herein, machine learningmodels were developed to predict the substrate specificity of PylRSto accept novel NCAAs that could be incorporated into proteins bythree PylRS mutants. The models were built from a training set of285 unique enzyme-substrate pairs of three PylRS mutants includingIFRS, BtaRS, and MFRS against 95 NCAAs. The best BaggingTree (BT)model was then used for virtually screening a NCAAs library containing1474 phenylalanine, tyrosine, tryptophan, and alanine analogues, and156 NCAAs were predicted to be accepted by at least one of the threePylRS mutants. Then, 27 NCAAs including 24 positive and 3 negativesubstrates were experimentally tested for their activities, and 20of the 24 positive substrates showed weak or strong activity and wereaccepted by at least one PylRS mutant, among which 11 NCAAs were neverreported to be incorporated into proteins before. Three negative substratesdid not show any activity. Experimental results suggested that theBT model provides a three-class classification accuracy of 0.69 anda binary classification accuracy of 0.86. This study expanded thesubstrate scope of three PylRS variants and provided a framework fordeveloping machine learning models to predict substrate specificityof other PylRS variants.
更多
查看译文
关键词
machine learning,pyrrolysyl-tRNA synthetase,noncanonical amino acids,enzyme engineering,substratespecificity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要