P-GRe : an efficient pipeline to maximised pseudogene prediction in plants/eucaryotes

biorxiv(2023)

引用 0|浏览2
暂无评分
摘要
Formerly considered as part of "junk DNA", pseudogenes are nowadays known for their role in the post-transcriptional regulation of functional genes. In addition, their identification allows a better understanding of gene evolution in the frame of multigenic families. Despite this, there is, to our knowledge, no fully automatic user-friendly software allowing the annotation of pseudogenes on a whole genome. Here, we present Pseudo-Gene Retriever (P GRe), a fully automated pseudogene prediction software requiring only a genome sequence and its corresponding GFF annotation file. P GRe detects the sequences of the pseudogenes on a whole genome and returns to the user all their genomic sequences and their pseudo-coding sequences. The ability of P GRe to finely reconstruct the structure of pseudogenes also allow to obtain a set of proteins virtually encoded by the predicted pseudogenes. We show here that in 70% of the cases, virtual proteins constructed by P GRe from Arabidopsis thaliana proteome and genome aligned better to their parent protein than their annotated counterpart. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要