Publication | Open Access
Fractal dimensions of speech sounds: Computation and application to automatic speech recognition
123
Citations
11
References
1999
Year
EngineeringSpeech TurbulenceAcoustic ModelingSpeech RecognitionSpeech SegmentationPhoneticsAudio AnalysisRobust Speech RecognitionVoice RecognitionAcoustic Signal ProcessingHealth SciencesSpeech SoundsSignal ProcessingSpeech AnalysisSpeech CommunicationSpeech TechnologyAutomatic Speech RecognitionFractal DimensionsSpeech ProcessingFractal ModelsSpeech Perception
The dynamics of airflow during speech production may often result in some small or large degree of turbulence. In this paper, the geometry of speech turbulence as reflected in the fragmentation of the time signal is quantified by using fractal models. An efficient algorithm for estimating the short-time fractal dimension of speech signals based on multiscale morphological filtering is described, and its potential for speech segmentation and phonetic classification discussed. Also reported are experimental results on using the short-time fractal dimension of speech signals at multiple scales as additional features in an automatic speech-recognition system using hidden Markov models, which provide a modest improvement in speech-recognition performance.
| Year | Citations | |
|---|---|---|
Page 1
Page 1