Team: a multiple testing algorithm on the aggregation tree for flow cytometry analysis

ANNALS OF APPLIED STATISTICS(2023)

引用 1|浏览9
暂无评分
摘要
In immunology studies, flow cytometry is a commonly used multivari-ate single-cell assay. One key goal in flow cytometry analysis is to detect the immune cells responsive to certain stimuli. Statistically, this problem can be translated into comparing two protein expression probability density func-tions (PDFs) before and after the stimulus; the goal is to pinpoint the regions where these two PDFs differ. Further screening of these differential regions can be performed to identify enriched sets of responsive cells. In this paper we model identifying differential density regions as a multiple testing prob-lem. First, we partition the sample space into small bins. In each bin we form a hypothesis to test the existence of differential PDFs. Second, we develop a novel multiple testing method, called TEAM (testing on the aggregation tree method), to identify those bins that harbor differential PDFs while control-ling the false discovery rate (FDR) under the desired level. TEAM embeds the testing procedure into an aggregation tree to test from fine-to coarse -resolution. The procedure achieves the statistical goal of pinpointing density differences to the smallest possible regions. TEAM is computationally effi-cient, capable of analyzing large flow cytometry data sets in much shorter time compared with competing methods. We applied TEAM and competing methods on a flow cytometry data set to identify T cells responsive to the cy-tomegalovirus (CMV)-pp65 antigen stimulation. With additional downstream screening, TEAM successfully identified enriched sets containing monofunc-tional, bifunctional, and polyfunctional T cells. Competing methods either did not finish in a reasonable time frame or provided less interpretable re-sults. Numerical simulations and theoretical justifications demonstrate that TEAM has asymptotically valid, powerful, and robust performance. Overall, TEAM is a computationally efficient and statistically powerful algorithm that can yield meaningful biological insights in flow cytometry studies.
更多
查看译文
关键词
Flow cytometry,multiple testing,aggregation tree,distribution difference,false dis-covery proportion (FDP)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要