Prediction Of Cyclin Protein Using Two-Step Feature Selection Technique

IEEE ACCESS(2020)

引用 8|浏览4
暂无评分
摘要
Cyclins are a family of proteins that regulate the cell cycle by activating cyclin-dependent kinases or a group of enzymes required in the cell cycle. Constructing a model to classify Cyclins is of importance to understand their function. It is urgent to construct a machine learning based model to identify Cyclins because of low similarity between the sequence of Cyclins. In this study, a method based on support vector machine (SVM) is developed to recognize Cyclins only using amino acid sequence information. 18 feature descriptors with a total of 13151-dimension features were extracted, and the feature dimension were reduced to 8 through feature selection technique. The reserved features show some of feature descriptors such as Autocorrelation, AAC and CTDC are important in the identification of Cyclins. Jackknife cross-validation results indicate our model would classify Cyclins with an accuracy of 91.9%, which is superior to a recent study using the same data set. Our work provides an important tool for discriminating Cyclins.
更多
查看译文
关键词
Amino acids, Feature extraction, Proteins, Correlation, Support vector machines, Cyclin, support vector machine, identification, feature selection, sequence information
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要