Classification And Identification Of Proteins By Means Of Common And Specific Amino Acid N-Tuples In Unaligned Sequences

COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE(1998)

引用 12|浏览20
暂无评分
摘要
Unaligned amino acid sequences can be characterized by their composition of amino acid n-tuples (i.e. doublets, triplets, quadruplets, etc.). In this study we investigated the performance of two statistics, termed commonality and specificity, that are derived from n-tuple counts using a set of G-protein coupled receptor (GPCR) sequences. The commonality of a tuple is defined as its relative occurrence in the sequences that belong to a given GPCR subtype. The specificity of a tuple is derived from its relative occurrence in the sequences of a given GPCR subtype and from its relative non-occurrence in the sequences that do not belong to this subtype. A graphical presentation, termed 'polygram', is described for the visualization of common and specific tuples. The method can be applied to the classification of unknown GPCR sequences. It can also be applied to the identification of fragments of GPCRs, such as may occur in chimeric receptors. The method is generally applicable to other protein families and other types of coding. (C) 1998 Elsevier Science Ireland Ltd. All rights reserved.
更多
查看译文
关键词
amino acid sequences, classification, G-protein coupled receptors, n-tuples, polygram
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要