Publication | Closed Access
MODEC: Multimodal Decomposable Models for Human Pose Estimation
432
Citations
18
References
2013
Year
Unknown Venue
EngineeringMachine LearningHuman Pose Estimation3D Pose EstimationBiometricsHuman ModellingImage AnalysisData SciencePattern RecognitionRobot LearningMachine VisionDecomposable ModelMode SelectionComputer ScienceVideo UnderstandingDeep LearningPose EstimationComputer VisionScene UnderstandingScene Modeling
We propose a multimodal, decomposable model for articulated human pose estimation in monocular images. A typical approach to this problem is to use a linear structured model, which struggles to capture the wide range of appearance present in realistic, unconstrained images. In this paper, we instead propose a model of human pose that explicitly captures a variety of pose modes. Unlike other multimodal models, our approach includes both global and local pose cues and uses a convex objective and joint training for mode selection and pose estimation. We also employ a cascaded mode selection step which controls the trade-off between speed and accuracy, yielding a 5x speedup in inference and learning. Our model outperforms state-of-the-art approaches across the accuracy-speed trade-off curve for several pose datasets. This includes our newly-collected dataset of people in movies, FLIC, which contains an order of magnitude more labeled data for training and testing than existing datasets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1