Publication | Open Access
Two-Stream Convolution Augmented Transformer for Human Activity Recognition
190
Citations
26
References
2021
Year
EngineeringMachine LearningHuman Pose EstimationWifi SignalsWearable TechnologyVideo InterpretationHuman ActivitiesKinesiologyImage AnalysisData SciencePattern RecognitionHuman Activity RecognitionVideo TransformerHealth SciencesMachine VisionComputer ScienceVideo UnderstandingMobile ComputingDeep LearningComputer VisionMobile SensingActivity Recognition
Recognition of human activities is an important task due to its far-reaching applications such as healthcare system, context-aware applications, and security monitoring. Recently, WiFi based human activity recognition (HAR) is becoming ubiquitous due to its non-invasiveness. Existing WiFi-based HAR methods regard WiFi signals as a temporal sequence of channel state information (CSI), and employ deep sequential models (e.g., RNN, LSTM) to automatically capture channel-over-time features. Although being remarkably effective, they suffer from two major drawbacks. Firstly, the granularity of a single temporal point is blindly elementary for representing meaningful CSI patterns. Secondly, the time-over-channel features are also important, and could be a natural data augmentation. To address the drawbacks, we propose a novel Two-stream Convolution Augmented Human Activity Transformer (THAT) model. Our model proposes to utilize a two-stream structure to capture both time-over-channel and channel-over-time features, and use the multi-scale convolution augmented transformer to capture range-based patterns. Extensive experiments on four real experiment datasets demonstrate that our model outperforms state-of-the-art models in terms of both effectiveness and efficiency.
| Year | Citations | |
|---|---|---|
Page 1
Page 1