Analysis of Insertional Sites of the SIRE1 Retroelement Family from Glycine Max Using GenBank BAC-end Sequences.

In Silico Biology(2008)

引用 24|浏览3
暂无评分
摘要
SIRE1 is a 2000-copy member of the Ty1/copia retroelement family found in the soybean genome and is closely related to sireviruses found in the genomes of other legumes. Although these elements closely resemble typical plant members of the Ty1/copia family, they are unusual in that they possess an envelope-like coding region immediately downstream of the reverse transcriptase gene. Despite its copy number, very few members of the SIRE1 family are currently present in publicly available genomic assemblies or draft contigs. However, fragments of family members are well-represented as BAC-ends in the GenBank Genome Survey Sequence database. This database was queried using the 5' and 3' ends of SIRE1 in order to catalog sequences into which SIRE1 members have integrated. Seven hundred and eighty-one unique SIRE1 insertions were identified and the majority of insertion sites constituted other repetitive elements, including Class I and Class II transposable elements and satellite DNAs. Ninety-four insertions were in single- or low-copy number sequences and three of these were homologous to characterized protein-coding genes. Examination of the ten bases flanking either side of SIRE1 revealed no clear consensus sequence, but the the distributions of A, C, G, and T at most of the positions were biased with strong statistical significance.
更多
查看译文
关键词
sire1 retroelement family,glycine max,bac-end
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要