Chemical codes promote selective compartmentalization of proteins.

Henry R Kilgore, Itamar Chinn,Peter G Mikhael, Ilan Mitnikov, Catherine Van Dongen, Guy Zylberberg,Lena Afeyan,Salman Banani, Susana Wilson-Hawken, Tong Ihn Lee, Regina Barzilay,Richard A Young

bioRxiv : the preprint server for biology(2024)

引用 0|浏览0
暂无评分
摘要
Cells have evolved mechanisms to distribute ∼10 billion protein molecules to subcellular compartments where diverse proteins involved in shared functions must efficiently assemble. Such assembly is presumed to unfold as a result of specific interactions between biomolecules; however, recent evidence suggests that distinctive chemical environments within subcellular compartments may also play an important role. Here, we test the hypothesis that protein groups with shared functions also share codes that guide them to compartment destinations. To test our hypothesis, we developed a transformer large language model, called ProtGPS, that predicts with high performance the compartment localization of human proteins excluded from the training set. We then demonstrate ProtGPS can be used for guided generation of novel protein sequences that selectively assemble into specific compartments in cells. Furthermore, ProtGPS predictions were sensitive to disease-associated mutations that produce changes in protein compartmentalization, suggesting that this type of pathogenic dysfunction can be discovered in silico. Our results indicate that protein sequences contain not only a folding code, but also a previously unrecognized chemical code governing their distribution in specific cellular compartments.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要