Systematic Normalization with Multiple Housekeeping Genes for the Discovery of Genetic Dependencies in Cancer

bioRxiv (Cold Spring Harbor Laboratory)(2020)

引用 0|浏览0
暂无评分
摘要
ABSTRACT Cancer results from complex interactions between genes that are misregulated. Although our understanding of the contribution of single genes to cancer is expansive, the interplay between genes in the context of this devastating disease remains to be understood. Using the Genomic Data Commons Data Portal through National Cancer Institute, we randomly selected ten data sets of breast cancer gene expression, acquired by RNA sequencing to be subjected to a computational method for the exploration of genetic interactions at a large scale. We focused on genes that suppress genome instability (GIS genes) since function or expression of these genes is often altered in cancer. In this paper, we show how to discover pairs of genes whose expressions demonstrate patterns of correlation. To ensure an inter-comparison across data sets, we tested statistical normalization approaches derived from the expression of randomly selected single housekeeping genes, or from the average of three. In addition, we systematically selected ten housekeeping genes for the purpose of normalization. Using normalized expression data, we determined R 2 values from linear models for all possible pairs of GIS genes and presented our results using heatmaps. Despite the heterogeneity of data, we observed that multiple gene normalization revealed more consistent correlations between pairs of genes, compared to using single gene expressions. We also noted that multiple gene normalization using ten genes outperformed normalization using three randomly selected genes. Since this study uses gene expression data from cancer tissues and begins to address the reproducibility of correlation between two genes, it complements other efforts to identify gene pairs that co-express in cancer cell lines. In the future, we plan to define consistent genetic correlations by using gene expression data derived from different types of cancer and multiple gene normalization. CCS CONCEPTS Applied computing → Computational biology . ACM Reference Format Oliver Bonham-Carter and Yee Mon Thu. 2019. Systematic Normalization with Multiple Housekeeping Genes for the Discovery of Genetic Dependencies in Cancer. In Niagara Falls, New York. ACM, New York, NY, USA, 10 pages. https://doi.org/10.1145/nnnnnnn.nnnnnnn
更多
查看译文
关键词
multiple housekeeping genes,genetic dependencies,cancer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要