SubVis: an interactive R package for exploring the effects of multiple substitution matrices on pairwise sequence alignment.

PEERJ(2017)

引用 1|浏览5
暂无评分
摘要
Understanding how proteins mutate is critical to solving a host of biological problems. Mutations occur when an amino acid is substituted for another in a protein sequence. The set of likelihoods for amino acid substitutions is stored in a matrix and input to alignment algorithms. The quality of the resulting alignment is used to assess the similarity of two or more sequences and can vary, according, toassumptions modeled' by the substitution matrix. Substitution strategies minorparameter variations are often grouped together in families. For example, the BLOSUM and PAM matrix families are commonly used because tbeY provide a standard, predefined way of modeling substitutions. However, researchers often do not know if a given matrix family or any individual matrix within a family is the most suitable. Furthermore, predefined matrix families may inaccurately reflect a particular hypothesis that a research er wishes to model or otherwise result in unsatisfactory alignments. In these cases,', the' ability to compare the effects of one or more custom matrices may be needed.This' laborious process is often performed manually because the ability to simultaneously', load multiple matrices and then compare their effects on alignments is not readily available in current software tools. This paper presents SubVis, an interactive R Package for loading and applying multiple substitution matrices to pairwise alignments. Users can simultaneously exPlbre alignments resulting from multiple predefined and custom substitution matrices. SubVis utilizes several of the alignment functions found in R, a common language among protein scientists. Functions are tied together with the Shiny platform which allows the modification of input parameters. Information regarding alignment quality and individual amino acid substitutions is displayed with the JavaScript language which.provides interactive. visualizations for revealing both high-level and low-level alignment format
更多
查看译文
关键词
Bioinformatics,Substitution matrix,Sequence alignment,R package,Visual analytics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要