A reliable effective terascale linear learning system

Journal of Machine Learning Research(2014)

引用 446|浏览96
暂无评分
摘要
We present a system and a set of techniques for learning linear predictors with convex losses on terascale data sets, with trillions of features, billions of training examples and millions of parameters in an hour using a cluster of 1000 machines. Individually none of the component techniques are new, but the careful synthesis required to obtain an efficient implementation is. The result is, up to our knowledge, the most scalable and efficient linear learning system reported in the literature. We describe and thoroughly evaluate the components of the system, showing the importance of the various design choices.
更多
查看译文
关键词
distributed machine learning,repeated online averaging,allreduce,hadoop,distributed l-bfgs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要