Virus-Host Interactions Predictor (VHIP): machine learning approach to resolve microbial virus-host interaction networks

G. Eric Bastien,Rachel N Cable,Luis Zaman, Cecelia Batterbee, Anthony J Wing,Melissa Beth Duhaime

bioRxiv (Cold Spring Harbor Laboratory)(2023)

引用 0|浏览3
暂无评分
摘要
Viruses of microbes are ubiquitous biological entities that reprogram their hosts' metabolisms during infection in order to produce viral progeny, impacting the ecology and evolution of microbiomes with broad implications for human and environmental health. Advances in genome sequencing have led to the discovery of millions of novel viruses and an appreciation for the great diversity of viruses on Earth. Yet, with knowledge of only "who is there?" we fall short in our ability to infer the impacts of viruses on microbes at population, community, and ecosystem-scales. To do this, we need a more explicit understanding "who do they infect?" Here, we developed a novel machine learning model (ML), Virus-Host Interaction Predictor (VHIP), to predict virus-host interactions (infection/non-infection) from input virus and host genomes. This ML model was trained and tested on a high-value manually curated set of 8849 virus-host pairs and their corresponding sequence data. The resulting dataset, Virus Host Range network (VHRnet), is core to VHIP functionality. Each data point that underlies the VHIP training and testing represents a lab-tested virus-host pair in VHRnet, from which features of coevolution were computed. VHIP departs from existing virus-host prediction models in its ability to predict multiple interactions rather than predicting a single most likely host or host clade. As a result, VHIP is the first virus-host range prediction tool able to reconstruct the complexity of virus-host networks in natural systems. VHIP has an 87.8% accuracy rate at predicting interactions between virus-host pairs at the species level and can be applied to novel viral and host population genomes reconstructed from metagenomic datasets. Through the reconstruction of complete virus-host networks from novel data, VHIP allows for the integration of multilayer network theory into microbial ecology and opens new opportunities to study ecological complexity in microbial systems. ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
interactions,interactions,machine learning,virus-host,virus-host
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要