Scenes-Objects-Actions: A Multi-task, Multi-label Video Dataset
ECCV, pp. 660-676, 2018.
This paper introduces a large-scale, multi-label and multi-task video dataset named Scenes-Objects-Actions (SOA). Most prior video datasets are based on a predefined taxonomy, which is used to define the keyword queries issued to search engines. The videos retrieved by the search engines are then verified for correctness by human annotato...More
PPT (Upload PPT)