Lightweight 3D Human Pose Estimation Network Training Using Teacher-Student Learning

2020 IEEE Winter Conference on Applications of Computer Vision (WACV)(2020)

引用 26|浏览45
暂无评分
摘要
We present MoVNect, a lightweight deep neural network to capture 3D human pose using a single RGB camera. To improve the overall performance of the model, we apply the teacher-student learning method based knowledge distillation to 3D human pose estimation. Real-time post-processing makes the CNN output yield temporally stable 3D skeletal information, which can be used in applications directly. We implement a 3D avatar application running on mobile in real-time to demonstrate that our network achieves both high accuracy and fast inference time. Extensive evaluations show the advantages of our lightweight model with the proposed training method over previous 3D pose estimation methods on the Human3.6M dataset and mobile devices.
更多
查看译文
关键词
fast inference time,lightweight model,training method,Human3.6M dataset,MoVNect,lightweight deep neural network,single RGB camera,teacher-student learning method based knowledge distillation,real-time post-processing,3D avatar application,temporally stable 3D skeletal information,CNN output,lightweight 3D human pose estimation network training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要