Publication | Closed Access
Temporal Action Localization with Pyramid of Score Distribution Features
210
Citations
35
References
2016
Year
Unknown Venue
EngineeringMachine LearningVideo SummarizationVideo RetrievalVideo InterpretationImage AnalysisData SciencePattern RecognitionRobot LearningVideo TransformerAction LocalizationDanceMachine VisionVideo UnderstandingTemporal Action LocalizationDeep LearningComputer VisionClassification ArchitecturesHuman MovementArtsActivity Recognition
We investigate the feature design and classification architectures in temporal action localization. This application focuses on detecting and labeling actions in untrimmed videos, which brings more challenge than classifying presegmented videos. The major difficulty for action localization is the uncertainty of action occurrence and utilization of information from different scales. Two innovations are proposed to address this issue. First, we propose a Pyramid of Score Distribution Feature (PSDF) to capture the motion information at multiple resolutions centered at each detection window. This novel feature mitigates the influence of unknown action position and duration, and shows significant performance gain over previous detection approaches. Second, inter-frame consistency is further explored by incorporating PSDF into the state-of-the-art Recurrent Neural Networks, which gives additional performance gain in detecting actions in temporally untrimmed videos. We tested our action localization framework on the THUMOS'15 and MPII Cooking Activities Dataset, both of which show a large performance improvement over previous attempts.
| Year | Citations | |
|---|---|---|
Page 1
Page 1