Publication | Closed Access
Cross-Domain and Cross-Language Portability of Acoustic Features Estimated by Multilayer Perceptrons
109
Citations
18
References
2006
Year
Unknown Venue
Phone ClassificationEngineeringMachine LearningSpoken Language ProcessingMultilayer PerceptronsMlp FeaturesAcoustic ModelingSpeech RecognitionNatural Language ProcessingPhoneticsRobust Speech RecognitionCross-language PortabilityVoice RecognitionLanguage StudiesComputer ScienceDeep LearningDistant Speech RecognitionSpeech CommunicationSpeech TechnologyMulti-speaker Speech RecognitionAcoustic FeaturesSpeech ProcessingSpeech InputSpeech PerceptionLinguistics
Recent results with phone-posterior acoustic features estimated by multilayer perceptrons (MLPs) have shown that such features can effectively improve the accuracy of state-of-the-art large vocabulary speech recognition systems. MLP features are trained discriminatively to perform phone classification and are therefore, like acoustic models, tuned to a particular language and application domain. In this paper we investigate how portable such features are across domains and languages. We show that even without retraining, English-trained MLP features can provide a significant boost to recognition accuracy in new domains within the same language, as well as in entirely different languages such as Mandarin and Arabic. We also show the effectiveness of feature-level adaptation in porting MLP features to new domains.
| Year | Citations | |
|---|---|---|
Page 1
Page 1