Memory-Efficient Deep Learning Inference in Trusted Execution Environments

2021 IEEE International Conference on Cloud Engineering (IC2E)(2021)

引用 4|浏览3
暂无评分
摘要
This study identifies and proposes techniques to alleviate two key bottlenecks to executing deep neural networks in trusted execution environments (TEEs): page thrashing during the execution of convolutional layers and the decryption of large weight matrices in fully-connected layers. For the former, we propose a novel partitioning scheme, y-plane partitioning, designed to (i) provide consistent e...
更多
查看译文
关键词
Deep learning,Privacy,Quantization (signal),Convolution,Memory management,Hardware,Distance measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要