Guaranteed Quantization Error Computation for Neural Network Model Compression

Wesley Cooke,Zihao Mo,Weiming Xiang

CoRR(2023)

引用 0|浏览5
暂无评分
摘要
Neural network model compression techniques can address the computation issue of deep neural networks on embedded devices in industrial systems. The guaranteed output error computation problem for neural network compression with quantization is addressed in this paper. A merged neural network is built from a feedforward neural network and its quantized version to produce the exact output difference between two neural networks. Then, optimization-based methods and reachability analysis methods are applied to the merged neural network to compute the guaranteed quantization error. Finally, a numerical example is proposed to validate the applicability and effectiveness of the proposed approach.
更多
查看译文
关键词
quantization error computation,compression,neural network,neural network model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要