Binary Representation and High Efficient Compression of 3D CNN Features for Action Recognition

Peiyin Xing,Peixi Peng,Yongsheng Liang,Tiejun Huang,Yonghong Tian

2020 Data Compression Conference (DCC)（2020）

引用 2|浏览62

暂无评分

摘要

A common framework of the action recognition is to collect the videos from different cameras into a cloud center firstly, and then perform the 3D CNN on the cloud server. Although directly, this framework will bring a huge burden to the cloud server and video transmission. To handle this challenge, the "front-cloud" collaborative processing architecture can be used. The most import issue is to compress the feature from 3D CNN effectively without significant loss of accuracy. We propose logarithmic quantization with a maximum value threshold and HEVC inter encoding for 3D CNN features. Experimental results on ResNet-50 and InceptionV1 show that the features can be represented by only 1 bit without significant loss of accuracy. The compression ratio of the quantized 1 bit features using HEVC inter coding can reach to 5000 times and the loss of accuracy is less than 1%.

查看译文

关键词

3D CNN features,action recognition,cloud center,cloud server,video transmission,front-cloud collaborative processing architecture,compression ratio,quantized 1 bit features,ResNet-50,InceptionV1

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要