Selective pivot logratio coordinates for partial least squares discriminant analysis modelling with applications in metabolomics

STAT(2023)

引用 0|浏览9
暂无评分
摘要
Data resulting from high-throughput biological experiments are frequently of relative nature. This implies that the most relevant information is on the shape of the data distribution over the biological features more than on the size of the measurements themselves. One well-established way to acknowledge this in statistical processing is through logratio analysis. In the current work, we introduce selective pivot logratio coordinates as a new type of orthonormal logratio coordinate representation for high-dimensional relative (a.k.a. compositional) data. This proposal is aimed to enhance the identification of biomarkers in the context of binary classification problems, which is a common setting of scientific studies in the field. These logratio coordinates are constructed so that the pivot coordinate representing a certain compositional part aggregates all pairwise logratios of that part to the rest but, unlike in the ordinary formulation, excludes those deviating from the main pattern. This novel coordinate system is embedded within a partial least squares discriminant analysis (PLS-DA) model for its practical application. Based on both synthetic and real-world metabolomic data sets, we demonstrate the enhanced performance of the novel approach when compared with other methods used in the area.
更多
查看译文
关键词
selective pivot logratio coordinates,partial least squares,discriminant analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要