Publication | Closed Access
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis
2.9K
Citations
44
References
2016
Year
Unknown Venue
EngineeringMachine LearningHuman Pose Estimation3D Pose EstimationWearable TechnologyHuman Activity AnalysisVideo Interpretation3D Computer VisionImage AnalysisKinesiologyData ScienceMotion CapturePattern RecognitionNtu Rgb+dVideo TransformerHealth SciencesMachine VisionDanceAction ClassesVideo UnderstandingDeep LearningLarge Scale DatasetDeep Learning MethodsComputer VisionHuman MovementActivity Recognition
Recent approaches in depth-based human activity analysis achieved outstanding performance and proved the effectiveness of 3D representation for classification of action classes. Currently available depth-based and RGB+Dbased action recognition benchmarks have a number of limitations, including the lack of training samples, distinct class labels, camera views and variety of subjects. In this paper we introduce a large-scale dataset for RGB+D human action recognition with more than 56 thousand video samples and 4 million frames, collected from 40 distinct subjects. Our dataset contains 60 different action classes including daily, mutual, and health-related actions. In addition, we propose a new recurrent neural network structure to model the long-term temporal correlation of the features for each body part, and utilize them for better action classification. Experimental results show the advantages of applying deep learning methods over state-of-the-art handcrafted features on the suggested cross-subject and cross-view evaluation criteria for our dataset. The introduction of this large scale dataset will enable the community to apply, develop and adapt various data-hungry learning techniques for the task of depth-based and RGB+D-based human activity analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1