Classification of bacterial nanowire proteins using Machine Learning and Feature Engineering model.

bioRxiv : the preprint server for biology(2023)

引用 0|浏览2
暂无评分
摘要
Nanowires (NW) have been extensively studied for spp. and spp. and are mostly produced by Type IV pili or multiheme c-type cytochrome. Electron transfer via NW is the most studied mechanism in microbially induced corrosion, with recent interest in application in bioelectronics and biosensor. In this study, a machine learning (ML) based tool was developed to classify NW proteins. A manually curated 999 protein collection was developed as an NW protein dataset. Gene ontology analysis of the dataset revealed microbial NW is part of membranal proteins with metal ion binding motifs and plays a central role in electron transfer activity. Random Forest (RF), support vector machine (SVM), and extreme gradient boost (XGBoost) models were implemented in the prediction model and were observed to identify target proteins based on functional, structural, and physicochemical properties with 89.33%, 95.6%, and 99.99% accuracy. Dipetide amino acid composition, transition, and distribution protein features of NW are key important features aiding in the model's high performance.
更多
查看译文
关键词
bacterial nanowire proteins,machine learning,classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要