Publication | Open Access
Analysis and processing of lecture audio data
88
Citations
12
References
2004
Year
Unknown Venue
MusicSpeech CorpusSpoken Language ProcessingLanguage LearningCorpus LinguisticsSpeech RecognitionNatural Language ProcessingComputational LinguisticsLanguage AcquisitionAudio AnalysisLanguage StudiesHealth SciencesLinguisticsLanguage TechnologyAudio RetrievalSpeech CommunicationSpeech AnalysisAudio MiningLanguage Model PerplexitiesLanguage RecognitionSpeech ProcessingSpeech PerceptionVocabulary Usage PatternsLecture Audio DataSpoken Lecture Material
In this paper we report on our recent efforts to collect a corpus of spoken lecture material that will enable research directed towards fast, accurate, and easy access to lecture content. Thus far, we have collected a corpus of 270 hours of speech from a variety of undergraduate courses and seminars. We report on an initial analysis of the spontaneous speech phenomena present in these data and the vocabulary usage patterns across three courses. Finally, we examine language model perplexities trained from written and spoken materials, and describe an initial recognition experiment on one course.
| Year | Citations | |
|---|---|---|
Page 1
Page 1