Publication | Closed Access
Lip detection for audio-visual speech recognition in-car environment
25
Citations
5
References
2010
Year
Unknown Venue
EngineeringBiometricsViola-jones AlgorithmSpeech RecognitionFace DetectionFacial Recognition SystemImage AnalysisPattern RecognitionRobust Speech RecognitionVoice RecognitionHealth SciencesMachine VisionLip DetectionDistant Speech RecognitionComputer VisionSpeech CommunicationVoiceVisual ModalityEye TrackingSpeech ProcessingSpeech InputCar Cabins
Acoustically, car cabins are extremely noisy and as a consequence audio-only, in-car voice recognition systems perform poorly. As the visual modality is immune to acoustic noise, using the visual lip information from the driver is seen as a viable strategy in circumventing this problem by using audio visual automatic speech recognition (AVASR). However, implementing AVASR requires a system being able to accurately locate and track the drivers face and lip area in real-time. In this paper we present such an approach using the Viola-Jones algorithm. Using the AVICAR [1] in-car database, we show that the Viola- Jones approach is a suitable method of locating and tracking the driver's lips despite the visual variability of illumination and head pose for audio-visual speech recognition system.
| Year | Citations | |
|---|---|---|
Page 1
Page 1