Concepedia

Publication | Closed Access

Robust prosodic features for speaker identification

86

Citations

7

References

2002

Year

Abstract

The paper describes the use of prosodic features for speaker identification. Features based on the pitch and energy contours of speech are described and the relative importance of each feature for speaker identification is investigated. The mean and variance of the pitch period in voiced sections of speech are shown to be particularly useful at discriminating between speakers. Fusing these features with a hidden Markov model speaker identification system gave a marked improvement in figure of merit; over 30% gain was achieved on the six NIST 1995 evaluation tests presented. Handset variability is known to have an adverse effect on performance when traditional spectral features are used, e.g. cepstra. Results are presented showing that the prosodic features are more robust to handset variability.

References

YearCitations

Page 1