Publication | Closed Access
Development of Assamese Phonetic Engine: Some issues
21
Citations
4
References
2013
Year
Unknown Venue
Spoken Language ProcessingPhonologySpeech RecognitionLanguage DocumentationPhoneticsLanguage AcquisitionVoice RecognitionLanguage StudiesHealth SciencesSpeech SynthesisPhonetic EngineLinguisticsAssamese Phonetic EngineSpeech SignalSpeech CommunicationSpeech TechnologyPhonology MorphologyLanguage RecognitionSpeech ProcessingSpeech PerceptionSpeech Interface
The phonetic engine is a system that performs speech signal to symbol transformation. This work describes some issues in the development of an Assamese Phonetic Engine (PE). International phonetic alphabet (IPA) is used as the phonetic unit to transcribe the speech database collected in three different modes, namely, reading, lecture and conversation modes. Only reading mode data is used for training and Hidden markov model (HMM) is used to model each phonetic unit without imposing any language or contextual constraint. The trained HMMs are used to derive a sequence of phonetic units from a test speech signal. Accuracy of 47.31%, 45.30% and 36.13% is achieved in reading, lecture and conversation mode, respectively. Confusion among the phonetic units specific to Assamese are discussed. Issues related to different recording modes, language and native speaker dependencies are discussed. The speech data is also collected in Hindi from three different sets of speakers to study speaker, language and native dependancies. Accuracy of 40.5%, 36.10% and 29.61% is achieved in native speaker dependent, native speaker independent and non-native speaker independent cases, respectively.
| Year | Citations | |
|---|---|---|
Page 1
Page 1