Publication | Closed Access
Segmentation of lecture videos based on text: a method combining multiple linguistic features
49
Citations
28
References
2004
Year
Unknown Venue
Lecture VideosEngineeringMultimedia AnalysisVideo SummarizationVideo RetrievalCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingInformation RetrievalPattern RecognitionText SegmentationComputational LinguisticsVideo Content AnalysisTopic SegmentationLanguage StudiesManual SegmentationComputer ScienceVideo UnderstandingMultiple Linguistic FeaturesSegmentation AccuracySpeech ProcessingLinguistics
In multimedia-based e-learning systems, there are strong needs for segmenting lecture videos into topic units in order to organize the videos for browsing and to provide search capability. Automatic segmentation is highly desired because of the high cost of manual segmentation. While a lot of research has been conducted on topic segmentation of transcribed spoken text, most attempts rely on domain-specific cues and formal presentation format, and require extensive training; none of these features exist in lecture videos with unscripted and spontaneous speech. In addition, lecture videos usually have few scene changes, which imply that the visual information that most video segmentation methods rely on is not available. Furthermore, even when there are scene changes, they do not match with the topic transitions. In this paper, we make use of the transcribed speech text extracted from the audio track of video to segment lecture videos into topics. We review related research and propose a segmentation approach. Our approach utilizes features such as noun phrases and combines multiple content-based and discourse-based features. Our preliminary results show that the noun phrases are salient features and the combination of multiple features is promising to improve segmentation accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1