Publication | Closed Access
Topic identification from audio recordings using word and phone recognition lattices
43
Citations
13
References
2007
Year
Unknown Venue
MusicEngineeringSpeech CorpusSpoken Language ProcessingCorpus LinguisticsAudio RecordingsText MiningSpeech RecognitionNatural Language ProcessingInformation RetrievalTopic IdentificationData MiningPattern RecognitionComputational LinguisticsPhoneticsAudio AnalysisLanguage StudiesAudio RetrievalComputer SciencePhone Recognition LatticesSpeech CommunicationAudio MiningSpeech Recognition LatticesMusic ClassificationTopic ModelLanguage RecognitionSpeech ProcessingAudio DocumentsLinguisticsSpeaker Recognition
In this paper, we investigate the problem of topic identification from audio documents using features extracted from speech recognition lattices. We are particularly interested in the difficult case where the training material is minimally annotated with only topic labels. Under this scenario, the lexical knowledge that is useful for topic identification may not be available, and automatic methods for extracting linguistic knowledge useful for distinguishing between topics must be relied upon. Towards this goal we investigate the problem of topic identification on conversational telephone speech from the Fisher corpus under a variety of increasingly difficult constraints. We contrast the performance of systems that have knowledge of the lexical units present in the audio data, against systems that rely entirely on phonetic processing.
| Year | Citations | |
|---|---|---|
Page 1
Page 1