Publication | Closed Access
Voice search of structured media data
27
Citations
9
References
2009
Year
Unknown Venue
MusicEngineeringCorpus LinguisticsSpeech RecognitionNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsPhoneticsRobust Speech RecognitionVoice RecognitionStructured DatabaseLanguage StudiesVoice SearchVoice Search ApplicationsAudio RetrievalAudio MiningSpeech ProcessingLinguisticsMusic MetadataMultimedia Search
This paper addresses the problem of using unstructured queries to search a structured database in voice search applications. By incorporating structural information in music metadata, the end-to-end search error has been reduced by 15% on text queries and up to 11% on spoken queries. Based on that, an HMM sequential rescoring model has reduced the error rate by 28% on text queries and up to 23% on spoken queries compared to the baseline system. Furthermore, a phonetic similarity model has been introduced to compensate speech recognition errors, which has improved the end-to-end search accuracy consistently across different levels of speech recognition accuracy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1