Multiple conformational states assembly of multidomain proteins using evolutionary algorithm based on structural analogues and sequential homologues

biorxiv(2023)

引用 2|浏览2
暂无评分
摘要
With the breakthrough of AlphaFold2, nearly all single-domain protein structures can be built at experimental resolution. However, accurate modelling of full-chain structures of multidomain proteins, particularly all relevant conformations for those with multiple states remain challenging. In this study, we develop a multidomain protein assembly method, M-SADA, for assembling multiple conformational states. In M-SADA, a multiple population-based evolutionary algorithm is proposed to sample multiple conformational states under the guidance of multiple energy functions constructed by combining homologous and analogous templates with inter-domain distances predicted by deep learning. On a developed benchmark dataset containing 72 multidomain proteins with multiple conformational states, the performance of M-SADA is significantly better than that of AlphaFold2 on multiple conformational states modelling, where 29/72 (40.3%) of proteins can be assembled with a TM-score >0.90 for highly distinct conformational states with M-SADA while AlphaFold2 does so in only 2/72 (2.8%) of proteins. Furthermore, M-SADA is tested on a developed benchmark dataset containing 296 multidomain proteins with single conformational state, and results show that the average TM-score of M-SADA on the best models is 0.913, which is 5.2% higher than that of AlphaFold2 models (0.868). ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
multiple conformational states assembly,multidomain proteins,evolutionary algorithm,structural analogues
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要