Covariances Simultaneous Component Analysis: a new method within a framework for modeling covariances

Journal of Chemometrics(2015)

引用 15|浏览19
暂无评分
摘要
In modern omics research, it is more rule than exception that multiple data sets are collected in a study pertaining to the same biological organism. In such cases, it is worthwhile to analyze all data tables simultaneously to arrive at global information of the biological system. This is the area of data fusion or multi-set analysis, which is a lively research topic in chemometrics, bioinformatics, and biostatistics. Most methods of analyzing such complex data focus on group means, treatment effects, or time courses. There is also information present in the covariances among variables within a group, because this relates directly to individual differences, heterogeneity of responses, and changes of regulation in the biological system.We present a framework for analyzing covariance matrices and a new method that fits nicely in this framework. This new method is based on combining covariance prototypes using simultaneous components and is, therefore, coined Covariances Simultaneous Component Analysis (COVSCA). We present the framework and our new method in mathematical terms, thereby explaining the (dis)similarities of the methods.Systems biology models based on differential equations illustrate the type of variation generated in real-life biological systems and how this type of variation can be modeled within the framework and with COVSCA. The method is subsequently applied to two real-life data sets from human and plant metabolomics studies showing biologically meaningful results. Copyright (c) 2015 John Wiley & Sons, Ltd. In modern omics research, multiple data sets are often collected in a study pertaining to the same biological organism. In such cases, it is worthwhile to analyze all data sets simultaneously. There is information present in the covariances among variables within a data set. We present a framework for analyzing covariance matrices and a new method (Covariances Simultaneous Component Analysis) that fits nicely in this framework. The method is applied to two real-life metabolomics data sets showing biologically meaningful results.
更多
查看译文
关键词
indirect fitting,derived data,INDSCAL,IDIOSCAL,multiblock data,metabolomics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要