Evaluation of sequence-based tools to gather more insight into the positioning of rhizogenic agrobacteria within the Agrobacterium tumefaciens species complex
biorxiv(2024)
摘要
Rhizogenic Agrobacterium , the causative agent of hairy root disease (HRD), is known for its high phenotypic and genetic diversity. The taxonomy of rhizogenic agrobacteria has undergone several changes in the past and is still somewhat controversial. While the classification of Agrobacterium strains was initially mainly based on phenotypic properties and the symptoms they induced on plants, more and more genetic information has been used along the years to infer Agrobacterium taxonomy. This has led to the definition of the so-called Agrobacterium tumefaciens species complex (Atsc), which comprises several genomospecies. Interestingly, the rhizogenic Agrobacterium strains are found in several of these genomospecies. Nevertheless, even up until today Agrobacterium strains, and in particular rhizogenic agrobacteria, are prone to misclassification and considerable confusion in literature.
In this study, we evaluated different phylogenetic analysis approaches for their use to improve Agrobacterium taxonomy and tried to gain more insight in the classification of strains into this complex genus, with a particular focus on rhizogenic agrobacteria.
The genome sequence analysis of 579 assemblies, comprising Agrobacterium , Allorhizobium and Rhizobium strains demonstrated that phylogenies based on single marker genes, such as the commonly used 16S rRNA and recA gene, do not provide sufficient resolution for proper delineation of the different genomospecies within the Atsc.
Our results revealed that (in silico) multi-locus sequences analysis (MLSA) in combination with average nucleotide identity (ANIb) at a 94.0% threshold delineates genomospecies accurately and efficiently. Additionally, this latter approach permitted the identification of two new candidate genomospecies.
### Competing Interest Statement
The authors have declared no competing interest.
* AAI
: Average amino acid identity
ANI
: Average nucleotide identity
ANIb
: ANI based on BLAST
ANIm
: ANI based on MUMmer
Atsc
: Agrobacterium tumefaciens species complex
AIC
: Akaike’s information criterion
BIC
: Bayesian information criterion
CDS
: Coding sequence
CGH
: Comparative genome hybridization
DDH
: DNA-DNA hybridization
GBDP
: Genome BLAST distance phylogeny
GC
: Guanine-cytosine
GTRCAT
: CAT General Time Reversible
HRD
: Hairy root disease
ICSP
: International Committee on Systematics of Prokaryotes
IJSEM
: International Journal of Systematic and Evolutionary Microbiology
MLSA
: Multi-locus sequence analysis
ML
: Maximum Likelihood
NCPPB
: National Collection of Plant Pathogenic Bacteria
NMDS
: Nonmetric multidimensional scaling
LPSN
: List of Prokaryotic names with Standing in Nomenclature
OGRI
: Overall genome relatedness indices
pRi
: Root-inducing plasmid
pTi
: Tumor-inducing plasmid
TETRA
: Tetranucleotide frequencies
WGS
: Whole genome sequencing
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要