Publication | Closed Access
HMM-based emphatic speech synthesis using unsupervised context labeling
19
Citations
10
References
2011
Year
Unknown Venue
EngineeringSpoken Language ProcessingCorpus LinguisticsSpeech RecognitionNatural Language ProcessingUnsupervised Context LabelingPhoneticsComputational LinguisticsSpeech InterfaceLanguage StudiesEmphasis ContextMachine TranslationSpeech SynthesisSpeech OutputExpressive SpeechComputer ScienceText-to-speechSpeech CommunicationSpeech TechnologyEmphasis LabelingSpeech ProcessingSpeech PerceptionLinguistics
This paper describes an approach to HMM-based expressive speech synthesis which does not require any supervised labeling process for emphasis context. We use appealing-style speech whose sentences were taken from real domains. To reduce the cost for labeling speech data with an emphasis context for the model training, we propose an unsupervised labeling technique of the emphasis context based on the difference between original and generated F0 patterns of training sentences. Although the criterion for the emphasis labeling is quite simple, subjective evaluation results reveal that the unsupervised labeling is comparable to the labeling conducted carefully by a human in terms of speech naturalness and emphasis reproducibility. Index Terms: HMM-based speech synthesis, expressive speech, emphasis expression, unsupervised labeling, F0 generation
| Year | Citations | |
|---|---|---|
Page 1
Page 1