Publication | Closed Access
HMM based structuring of tennis videos using visual and audio cues
52
Citations
6
References
2003
Year
Unknown Venue
EngineeringVideo ProcessingMultimedia AnalysisVideo SummarizationAudio CuesVideo RetrievalSpeech RecognitionImage AnalysisPattern RecognitionVideo Content AnalysisTennis VideosDanceMachine VisionVideo UnderstandingComputer VisionVideo AnalysisEye TrackingSpeech ProcessingArtsHidden Markov ModelsStructure Analysis
This paper focuses on the use of hidden Markov models (HMMs) for structure analysis of videos, and demonstrates how they can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The basic temporal unit is the video shot. Visual features describe the audio events within a video shot. The video structure parsing relies on the analysis of the temporal interleaving of video shots, with respect to prior information about tennis content and editing rules. As a result, typical tennis scenes are identified. In addition, each shot is assigned to a level in the hierarchy described in terms of point, game and set.
| Year | Citations | |
|---|---|---|
Page 1
Page 1