Multiset partial least squares with rank order of groups for integrating multi-omics data

biorxiv(2024)

引用 0|浏览0
暂无评分
摘要
As more multi-omics data such as metabolome, proteome and transcriptome data are acquired, statistical analyses to integrate multi-omics data has been required. Multivariate analysis for integration of multi-omics data has been proposed such as multiset partial least squares (PLS). However, when we want to extract scores about the rank order of groups such as severity of disease as well as difference of groups, we could not always extract scores with rank order of groups by using multiset PLS. Therefore, we proposed multiset PLS with rank order of groups (multiset PLS-ROG) to integrate multiple omics dataset. We visualized multiple omics data by using multiset PLS-ROG in two studies of multi-omics and multi-organ derived metabolome data. After we visualize data and focus on a specific score with phenotype of interest, it is important to select compounds to identify biomarker candidates or make biological inferences. In ordinary principal component analysis (PCA) or PLS, we could select statistical significantly compounds correlated with PC or PLS score by using PC or PLS loadings. However, multiset PLS loading has not been used because its statistical property has not been clarified. Then, we clarified statistical property of multiset PLS and multiset PLS-ROG loading, and we could identify statistically significant compounds by using statistical hypothesis testing. We implemented multiset PLS-ROG to R loadings package, freely available in CRAN. ### Competing Interest Statement The author is an employee of Human Metabolome Technologies, Inc.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要