Publication | Closed Access
First-Person Daily Activity Recognition With Manipulated Object Proposals and Non-Linear Feature Fusion
45
Citations
38
References
2017
Year
Object ProposalsNovel PipelineConvolutional Neural NetworkEngineeringMachine LearningManipulated Object ProposalsHuman Pose EstimationAction Recognition (Movement Science)3D Pose EstimationWearable TechnologyAction Recognition (Computer Vision)Video InterpretationHuman-object InteractionImage AnalysisKinesiologyData ScienceNon-linear Feature FusionPattern RecognitionRobot LearningVideo TransformerHealth SciencesDetection FrameworksDanceMachine VisionObject DetectionComputer ScienceVideo UnderstandingDeep LearningComputer VisionMotion DetectionVideo AnalysisHuman MovementActivity RecognitionMotion Analysis
Most previous works on the first-person video recognition focus on measuring the similarity of different actions by using low-level features of objects interacting with humans. However, due to noisy camera motion and frequent changes in viewpoint and scale, they fail to capture and model highly discriminative object features. In this paper, we propose a novel pipeline for the first-person daily activity recognition. Our object feature extraction pipeline is inspired by the recent success of object hypotheses and deep convolutional neural network (CNN)-based detection frameworks. Our key contribution is a simple yet effective manipulated object proposal generation scheme. This scheme leverages motion cues, such as motion boundary and motion magnitude (in contrast, camera motion is usually considered as “noise” for most previous methods), to generate a more compact and discriminative set of object proposals, which are more closely related to the objects, which are being manipulated. Then, we learn more discriminative object detectors from these manipulated object proposals based on region-based CNN. Meanwhile, we develop a non-linear feature fusion scheme, which better combines object and motion features. We show in experiments that the proposed framework significantly outperforms the state-of-the-art recognition performance on a challenging first-person daily activity benchmark.
| Year | Citations | |
|---|---|---|
Page 1
Page 1