BioNE: Integration of network embeddings for supervised learning

biorxiv(2022)

引用 0|浏览3
暂无评分
摘要
A network embedding approach reduces the analysis complexity of large biological networks by converting them to lowdimensional vector representations (features/embeddings). These lower-dimensional vectors can then be used in machine learning prediction tasks with a wide range of applications in computational biology and bioinformatics. Several network embedding approaches have been proposed with different methods of generating vector representations. These network embedding approaches can be quite diverse in terms of data representation and implementation. Moreover, most were not originally developed for biological networks. Therefore comparing and assessing the performance of these diverse models in practice, in biological contexts, can be challenging. To facilitate such comparisons, we have developed the BioNE framework for integration of different embedding methods in prediction tasks. Using this framework one can easily assess, for instance, whether combined vector representations from multiple embedding methods offer complementary information with regards to the network features and thus better performance on prediction tasks. In this paper, we present the BioNE software suite for embedding integration, which applies network embedding methods following standardised network preparation steps, and integrates the vector representations achieved by these methods using three different techniques. BioNE enables selection of prediction models, oversampling methods, feature selection methods, cross-validation type and cross-validation parameters. Availability and implementation BioNE pipeline and detailed explanation of implementation is freely available on GitHub, at ### Competing Interest Statement The authors have declared no competing interest.
更多
查看译文
关键词
network embeddings,supervised learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要