FNphasing: a novel fast heuristic algorithm for haplotype phasing based on flow network model.

IEEE/ACM Trans. Comput. Biology Bioinform.(2013)

引用 1|浏览33
暂无评分
摘要
An enormous amount of sequence data has been generated with the development of new DNA sequencing technologies, which presents great challenges for computational biology problems such as haplotype phasing. Although arduous efforts have been made to address this problem, the current methods still cannot efficiently deal with the incoming flood of large-scale data. In this paper, we propose a flow network model to tackle haplotype phasing problem, and explain some classical haplotype phasing rules based on this model. By incorporating the heuristic knowledge obtained from these classical rules, we design an algorithm FNphasing based on the flow network model. Theoretically, the time complexity of our algorithm is (O(n(2)m+m(2)), which is better than that of 2SNP, one of the most efficient algorithms currently. After testing the performance of FNphasing with several simulated data sets, the experimental results show that when applied on large-scale data sets, our algorithm is significantly faster than the state-of-the-art Beagle algorithm. FNphasing also achieves an equal or superior accuracy compared with other approaches.
更多
查看译文
关键词
large-scale data,beagle algorithm,classical haplotype,sequence data,dna,large-scale data sets,accuracy,dna sequencing,computational biology problem,haplotype phasing,o (n2m+m2),fast heuristic algorithm,heuristic programming,physiological models,biology computing,molecular biophysics,efficient algorithm,algorithm design and analysis,fnphasing,data models,large-scale data set,flow network model,simulated data set,heuristic methods,hidden markov models,state-of-the-art beagle algorithm,classical rule,bioinformatics,computational biology,2snp,novel fast heuristic algorithm,phylogeny,flow network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要