Designing template-free predictor for targeting protein-ligand binding sites with classifier ensemble and spatial clustering.

IEEE/ACM Trans. Comput. Biology Bioinform.(2013)

引用 95|浏览46
暂无评分
摘要
Accurately identifying the protein-ligand binding sites or pockets is of significant importance for both protein function analysis and drug design. Although much progress has been made, challenges remain, especially when the 3D structures of target proteins are not available or no homology templates can be found in the library, where the template-based methods are hard to be applied. In this paper, we report a new ligand-specific template-free predictor called TargetS for targeting protein-ligand binding sites from primary sequences. TargetS first predicts the binding residues along the sequence with ligand-specific strategy and then further identifies the binding sites from the predicted binding residues through a recursive spatial clustering algorithm. Protein evolutionary information, predicted protein secondary structure, and ligand-specific binding propensities of residues are combined to construct discriminative features; an improved AdaBoost classifier ensemble scheme based on random undersampling is proposed to deal with the serious imbalance problem between positive (binding) and negative (nonbinding) samples. Experimental results demonstrate that TargetS achieves high performances and outperforms many existing predictors. TargetS web server and data sets are freely available at: http://www.csbio.sjtu.edu.cn/bioinf/TargetS/ for academic use.
更多
查看译文
关键词
protein secondary structure prediction,improved adaboost classifier ensemble scheme,bonds (chemical),protein-ligand binding sites,recursive spatial clustering algorithm,residue ligand-specific binding propensity,template-free,drug design,random undersampling,learning (artificial intelligence),primary sequence,accurate protein-ligand binding site identification,proteins,accurate pocket identification,discriminative feature construction,targets predictor,binding sample-nonbinding sample imbalance problem,targeting protein-ligand binding site,ligand-specific strategy,sequences,ligand-specific template-free predictor,molecular biophysics,molecular configurations,ligand-specific prediction model,protein evolutionary information,sequence binding residue prediction,homology template,template-based method application,protein function analysis,spatial clustering,template-free predictor design,sampling methods,target protein 3d structure,bioinformatics,classifier ensemble,positive sample-negative sample imbalance problem,feature extraction,learning artificial intelligence,protein sequence,metals
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要