Publication | Closed Access
Discriminative Hierarchical Modeling of Spatio-temporally Composable Human Activities
86
Citations
37
References
2014
Year
Unknown Venue
Physical ActivityEngineeringMachine LearningHuman Pose EstimationVideo InterpretationNatural Language ProcessingHuman ActivitiesImage AnalysisKinesiologyData SciencePattern RecognitionSimple Human ActionsRobot LearningVideo TransformerComplex Human ActivitiesHealth SciencesMachine VisionAction PatternVideo UnderstandingDeep LearningComputer VisionDiscriminative Hierarchical ModelingHuman MovementActivity RecognitionSpatio-temporal Model
This paper proposes a framework for recognizing complex human activities in videos. Our method describes human activities in a hierarchical discriminative model that operates at three semantic levels. At the lower level, body poses are encoded in a representative but discriminative pose dictionary. At the intermediate level, encoded poses span a space where simple human actions are composed. At the highest level, our model captures temporal and spatial compositions of actions into complex human activities. Our human activity classifier simultaneously models which body parts are relevant to the action of interest as well as their appearance and composition using a discriminative approach. By formulating model learning in a max-margin framework, our approach achieves powerful multi-class discrimination while providing useful annotations at the intermediate semantic level. We show how our hierarchical compositional model provides natural handling of occlusions. To evaluate the effectiveness of our proposed framework, we introduce a new dataset of composed human activities. We provide empirical evidence that our method achieves state-of-the-art activity classification performance on several benchmark datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1