Publication | Closed Access
Audio-visual speech enhancement with AVCDCN (audio-visual codebook dependent cepstral normalization)
35
Citations
6
References
2003
Year
Unknown Venue
EngineeringSpeech CodingHealth SciencesNon-linear Enhancement TechniqueAudio Signal ProcessingAudio-visual Speech EnhancementAudio-visual Speech RecognitionSpeech EnhancementNoiseVisual InformationSpeech ProcessingRobust Speech RecognitionSpeech PerceptionDistant Speech RecognitionSignal ProcessingSpeech CommunicationSpeech TechnologySpeech Recognition
We introduce a non-linear enhancement technique called audio-visual codebook dependent cepstral normalization (AVCDCN) and we consider its use with both audio-only and audio-visual speech recognition. AVCDCN is inspired from CDCN, an audio-only enhancement technique that approximates the nonlinear effect of noise on speech with a piecewise constant function. Our experiments show that the use of visual information in AVCDCN allows significant performance gains over CDCN.
| Year | Citations | |
|---|---|---|
Page 1
Page 1