Memory-Efficient Deep Learning Inference in Trusted Execution Environments
2021 IEEE International Conference on Cloud Engineering (IC2E)(2021)
摘要
This study identifies and proposes techniques to alleviate two key bottlenecks to executing deep neural networks in trusted execution environments (TEEs): page thrashing during the execution of convolutional layers and the decryption of large weight matrices in fully-connected layers. For the former, we propose a novel partitioning scheme, y-plane partitioning, designed to (i) provide consistent e...
更多查看译文
关键词
Deep learning,Privacy,Quantization (signal),Convolution,Memory management,Hardware,Distance measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要