Publication | Closed Access
Multi-sensor fusion for sport genre classification of user generated mobile videos
10
Citations
11
References
2013
Year
Unknown Venue
EngineeringMachine LearningBiometricsWearable TechnologySport GenreMultimedia AnalysisRobust Multimodal ApproachVideo RetrievalImage AnalysisData SciencePattern RecognitionMultimodal Sensor FusionVideo Content AnalysisMachine VisionMobile VideosMultimodal Signal ProcessingVideo UnderstandingSport Genre ClassificationDeep LearningMulti-sensor FusionComputer VisionDifferent Modality SetsActivity Recognition
We present a robust multimodal approach for classifying the sport genre in videos recorded by mobile phone users at a sport event. In addition to traditional audio-visual content analysis tools, we propose to analyze auxiliary sensor data (electronic compass data and accelerometer data) captured simultaneously with the video recording. By means of machine learning techniques, we build models of visual appearance, camera motion (from auxiliary sensor data) and audio scene, which are used for classifying the data from each modality. The sport genre is obtained by fusing the information provided by the models. We propose to use the quality of each modality as an indication of its reliability. Extensive experiments were performed on real test data collected at public sport events. We provide comparisons on the use of different modality sets and fusion methods. Finally, we show how the proposed methods achieve robust classification even in the considered unconstrained scenarios.
| Year | Citations | |
|---|---|---|
Page 1
Page 1