Publication | Closed Access
End-To-End Accent Conversion Without Using Native Utterances
34
Citations
22
References
2020
Year
Unknown Venue
EngineeringPhonologySpeech RecognitionNatural Language ProcessingPhoneticsComputational LinguisticsSpeech Synthesis ModelVoice RecognitionLanguage StudiesMachine TranslationSpeech SynthesisNative UtterancesLinguisticsSpeech OutputAccent ConversionSpeech CommunicationSpeech TechnologyOnline ConversionSpeech ProcessingSpeech PerceptionSpeech Translation
Techniques for accent conversion (AC) aim to convert non-native to native accented speech. Conventional AC methods try to convert only the speaker identity of a native speaker's voice to that of the non-native accented target speaker, leaving the underlying content and pronunciations unchanged. This hinders their practical use in real-world applications, because native-accented utterances are required at conversion stage. In this paper, we present an end-to-end framework, which is able to conduct AC from non-native-accented utterances without using any native-accented utterances during online conversion. We achieve this by independently extracting linguistic and speaker representations from non-native accented speech and condition a speech synthesis model on these representations to generate native-accented speech. Experiments on open-source data corpora show that the proposed system can convert Hindi-accented English speech into native American English speech with high naturalness, which is indistinguishable from native-accented recordings in terms of accent.
| Year | Citations | |
|---|---|---|
Page 1
Page 1