Publication | Closed Access
Discriminative figure-centric models for joint action localization and recognition
229
Citations
24
References
2011
Year
EngineeringMachine LearningHuman Pose Estimation3D Pose EstimationVideo InterpretationImage AnalysisData SciencePattern RecognitionTemporal SmoothnessRobot LearningUcf-sports DatasetPerson LocationMachine VisionDanceJoint Action LocalizationVideo UnderstandingDeep LearningComputer VisionEye TrackingActivity RecognitionMotion Analysis
In this paper we develop an algorithm for action recognition and localization in videos. The algorithm uses a figure-centric visual word representation. Different from previous approaches it does not require reliable human detection and tracking as input. Instead, the person location is treated as a latent variable that is inferred simultaneously with action recognition. A spatial model for an action is learned in a discriminative fashion under a figure-centric representation. Temporal smoothness over video sequences is also enforced. We present results on the UCF-Sports dataset, verifying the effectiveness of our model in situations where detection and tracking of individuals is challenging.
| Year | Citations | |
|---|---|---|
Page 1
Page 1