Publication | Closed Access
Exploring Action Centers for Temporal Action Localization
24
Citations
53
References
2023
Year
Artificial IntelligenceEngineeringMachine LearningIntelligent SystemsAction CentersLocalizationVideo RetrievalVideo InterpretationImage AnalysisData SciencePattern RecognitionRobot LearningVideo TransformerHuman ActionsAction LocalizationDanceMachine VisionAction PatternComputer ScienceVideo UnderstandingTemporal Action LocalizationDeep LearningComputer VisionRoboticsActivity Recognition
Temporal action localization aims at detecting the temporal intervals of human actions in untrimmed videos. Most previous methods rely on locating and matching the start and end times of actions. However, action boundaries are ambiguous and uncertain in nature, which leads to inaccurate action localization and a lot of false positives. In this paper, we introduce a new framework for temporal action localization. It explicitly models temporal action centers to reduce unreliable action detection results caused by ambiguous action boundaries. Since action centers are highly related to semantic actions, they can be detected more reliably than the conventional action boundaries. As a result, our framework can exclude false positives and promote high-quality proposals. Based on action centers, we propose a triplet feature fusion mechanism. It performs neural message passing among the boundaries and the center as well as contextual regions outside of the proposal to enrich its representation. In addition, we introduce a centerness scoring method to suppress proposals deviating from the centers of action instances. Consequently, our network can retrieve high-quality action proposals and locate actions more precisely. Experimental results show our method outperforms state-of-the-art methods on the THUMOS14 and ActivityNet v1.3 datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1