Benchmarking deep neural networks for gesture recognition on embedded devices.

IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)(2022)

引用 1|浏览9
暂无评分
摘要
The gesture is one of the most used forms of communication between humans; in recent years, given the new trend of factories to be adapted to Industry 4.0 paradigm, the scientific community has shown a growing interest towards the design of Gesture Recognition (GR) algorithms for HumanRobot Interaction (HRI) applications. Within this context, the GR algorithm needs to work in real time and over embedded platforms, with limited resources. Anyway, when looking at the available scientific literature, the aim of the different proposed neural networks (i.e. 2D and 3D) and of the different modalities used for feeding the network (i.e. RGB, RGB-D, optical flow) is typically the optimization of the accuracy, without strongly paying attention to the feasibility over low power hardware devices. Anyway, the analysis related to the trade-off between accuracy and computational burden (for both networks and modalities) becomes important so as to allow GR algorithms to work in industrial robotics applications. In this paper, we perform a wide benchmarking focusing not only on the accuracy but also on the computational burden, involving two different architectures (2D and 3D), with two different backbones (MobileNet, ResNeXt) and four types of input modalities (RGB, Depth, Optical Flow, Motion History Image) and their combinations.
更多
查看译文
关键词
industrial robotics applications,wide benchmarking,computational burden,different backbones,input modalities,optical flow,deep neural networks,embedded devices,scientific community,Gesture Recognition algorithms,Human-Robot Interaction applications,GR algorithm,embedded platforms,available scientific literature,different proposed neural networks,RGB-D,low power hardware devices
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要