SCSampler - Sampling Salient Clips From Video for Efficient Action Recognition
ICCV, pp. 6231-6241, 2019.
While many action recognition datasets consist of collections of brief, trimmed videos each containing a relevant action, videos in the real-world (e.g., on YouTube) exhibit very different properties: they are often several minutes long, where brief relevant clips are often interleaved with segments of extended duration containing littl...More
PPT (Upload PPT)