Publication | Closed Access
Cross-lingual voice conversion-based polyglot speech synthesizer for indian languages
13
Citations
11
References
2014
Year
Unknown Venue
Speech SciencesIndian LanguagesGmm TopologySpeech CorpusMultilingualismVoice EvaluationCorpus LinguisticsSpeech RecognitionSpeaker IdentificationVoice RecognitionLanguage StudiesAcoustic AnalysisSpoken Language UnderstandingHealth SciencesSpeech SynthesisSpeech OutputSpeech CommunicationSpeech TechnologyVoiceSpeech AcousticsSpeech ProcessingSpeech PerceptionPolyglot Speech CorpusPolyglot Speech SynthesizerLinguistics
A polyglot speech synthesizer, synthesizes speech for any given monolingual or multilingual text, in a single speaker’s voice. In this regard, a polyglot speech corpus is required. It is difficult to find a speaker proficient in multiple languages. Therefore, in the current work, by exploiting the acoustic similarity of phonemes across Indian languages, a polyglot speech corpus is obtained for four Indian languages and Indian English, using GMM-based cross-lingual voice conversion. The optimum target speaker and GMM topology is chosen based on the performance of a speaker identification system. It is observed that, the language that shares the most number of phonemes with the other languages, serves as the best target. A polyglot speech corpus derived in this target speaker’s voice, is further used to develop an HMM-based polyglot speech synthesizer. The performance of this synthesizer is evaluated in terms of speaker identity using ABX listening test, quality using mean opinion score (MOS) and speaker switching using subjective listening test.
| Year | Citations | |
|---|---|---|
Page 1
Page 1