Publication | Closed Access
Detection of human speech in structured noise
72
Citations
8
References
2002
Year
Unknown Venue
EngineeringMachine LearningStructured NoiseSpeech RecognitionData SciencePattern RecognitionNoiseAudio AnalysisRobust Speech RecognitionAudio Signal AnalysisVoice RecognitionAcoustic AnalysisSpeech Signal AnalysisHealth SciencesComputer ScienceAcoustic SignalRadial Basis FunctionDistant Speech RecognitionSignal ProcessingSpeech CommunicationSpeech AcousticsSpeech ProcessingBinary DecisionSpeech InputSpeech PerceptionLinguisticsSpeaker Recognition
This paper describes research to develop an efficient system that provides a binary decision as to the presence of speech in a short (one to three second) time sample of an acoustic signal. A method which is efficient and reliably detects human speech in the presence of structured noise (such as wind, music, traffic sounds, etc.) is described. Two separate algorithms were developed. The first algorithm detects the presence of speech by testing for concave and/or convex formant shapes. The second algorithm is a statistical pattern classifier utilizing radial basis function (RBF) networks with mel-cepstra feature vectors. Classification errors are not consistent across these two different methods. As a consequence, we plan to reduce our error rate by fusion of these methods.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>
| Year | Citations | |
|---|---|---|
Page 1
Page 1