Large Data Flow Graphs In Limited Gpu Memory

Geert Janssen,Vladimir Zolotov,Tung D. Le

2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)（2019）

引用 2|浏览16

暂无评分

摘要

The size of a GPU's memory imposes strict limits both on the complexity of neural networks and the size of the data samples that can be processed. This paper presents methods to efficiently use GPU memory by the TensorFlow(1) machine learning framework for processing large data flow graphs of neural networks. The proposed techniques make use of swapping data stored in GPU memory to and from CPU memory, data compression, and serialization of computation. The data flow graph is modified by inserting nodes for data transfer and compression, and by defining control dependencies for serializing the execution of graph nodes. The locations for additional nodes and control dependencies are determined algorithmically by the analysis of the graph's topology and the complexity of operations implemented by the graph's nodes. Our experiments show the capability to process 3D-Unet [1] on 192(3)-sized images with batch size 4 and ResNet [2] models with more than 7 fold the maximum batch size.

查看译文

关键词

data flow graph,GPU memory,neural networks,data compression,data transfer,graph nodes,TensorFlow machine learning framework

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要