Optimizing the Union of Intersections LASSO (UoI_LASSO) and Vector Autoregressive (UoI_VAR) Algorithms for Improved Statistical Estimation at Scale

arXiv: Learning(2018)

引用 23|浏览43
暂无评分
摘要
The analysis of scientific data of increasing size and complexity requires statistical machine learning methods that are both interpretable and predictive. Union of Intersections (UoI), a recently developed framework, is a two-step approach that separates model selection and model estimation. A linear regression algorithm based on UoI, UoI_LASSO, simultaneously achieves low false positives and low false negative feature selection as well as low bias and low variance estimates. Together, these qualities make the results both predictive and interpretable. In this paper, we optimize the UoI_LASSO algorithm for single-node execution on NERSC's Cori Knights Landing, a Xeon Phi based supercomputer. We then scale UoI_LASSO to execute on cores ranging from 68-278,528 cores on a range of dataset sizes demonstrating the weak and strong scaling of the implementation. We also implement a variant of UoI_LASSO, UoI_VAR for vector autoregressive models, to analyze high dimensional time-series data. We perform single node optimization and multi-node scaling experiments for UoI_VAR to demonstrate the effectiveness of the algorithm for weak and strong scaling. Our implementations enable to use estimate the largest VAR model (1000 nodes) we are aware of, and apply it to large neurophysiology data 192 nodes).
更多
查看译文
关键词
intersections lasso,uoi lasso,improved statistical estimation,uoi var,vector autoregressive
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要