Video Jigsaw: Unsupervised Learning of Spatiotemporal Context for Video Action Recognition
2019 IEEE Winter Conference on Applications of Computer Vision (WACV), Volume abs/1808.07507, 2019.
Task analysisTrainingOptical imagingVisualizationTrackingMore(2+)
We propose a self-supervised learning method to jointly reason about spatial and temporal context for video recognition. Recent self-supervised approaches have used spatial context [9, 34] as well as temporal coherency  but a combination of the two requires extensive preprocessing such as tracking objects through millions of video fra...More
Full Text (Upload PDF)
PPT (Upload PPT)