Positional correlation analysis improves reconstruction of full-length transcripts and alternative isoforms from noisy array signals or short reads.

BIOINFORMATICS(2012)

引用 6|浏览2
暂无评分
摘要
A reconstruction of full-length transcripts observed by next-generation sequencer or tiling arrays is an essential technique to know all phenomena of transcriptomes. Several techniques of the reconstruction have been developed. However, problems of high-level noises and biases still remain and interrupt the reconstruction. A method is required that is robust against noise and bias and correctly reconstructs transcripts regardless of equipment used.We propose a completely new statistical method that reconstructs full-length transcripts and can be applied on both next-generation sequencers and tiling arrays. The method called ARTADE2 analyzes 'positional correlation', meaning correlations of expression values for every combination on genomic positions of multiple transcriptional data. ARTADE2 then reconstructs full-length transcripts using a logistic model based on the positional correlation and the Markov model. ARTADE2 elucidated 17 591 full-length transcripts from 55 transcriptome datasets and showed notable performance compared with other recent prediction methods. Moreover, 1489 novel transcripts were discovered. We experimentally tested 16 novel transcripts, among which 14 were confirmed by reverse transcription-polymerase chain reaction and sequence mapping. The method also showed notable performance for reconstructing of mRNA observed by a next-generation sequencer. Moreover, the positional correlation and factor analysis embedded in ARTADE2 successfully detected regions at which alternative isoforms may exist, and thus are expected to be applied for discovering transcript biomarkers for a wide range of disciplines including preemptive medicine.http://matome.base.riken.jptoyoda@base.riken.jpSupplementary data are available at Bioinformatics online.
更多
查看译文
关键词
artade2 analyzes,positional correlation analysis,noisy array signal,novel transcript,notable performance,reconstructs full-length transcript,positional correlation,alternative isoforms,recent prediction method,new statistical method,full-length transcript,tiling array,next-generation sequencer,computational biology,algorithms,transcriptome,markov chains,gene expression profiling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要