Publication | Closed Access
Prosody based co-analysis for continuous recognition of coverbal gestures
25
Citations
14
References
2003
Year
Unknown Venue
EngineeringMachine LearningSpeech ArticulationCorpus LinguisticsSpeech RecognitionNatural Language ProcessingImage AnalysisData SciencePattern RecognitionPhoneticsContinuous RecognitionMultimodal InteractionAutomatic RecognitionLanguage StudiesGesture ProcessingMultimodal Human Computer InterfaceAmerican Sign LanguageNatural SpeechComputer VisionSpeech CommunicationGesture RecognitionSpeech ProcessingSpeech PerceptionActivity RecognitionLinguistics
Although recognition of natural speech and gestures have been studied extensively, previous attempts at combining them in a unified framework to boost classification were mostly semantically motivated, e.g., keyword-gesture co-occurrence. Such formulations inherit the complexity of natural language processing. This paper presents a Bayesian formulation that uses a phenomenon of gesture and speech articulation for improving accuracy of automatic recognition of continuous coverbal gestures. The prosodic features from the speech signal were co-analyzed with the visual signal to learn the prior probability of co-occurrence of the prominent spoken segments with the particular kinematical phases of gestures. It was found that the above co-analysis helps in detecting and disambiguating small hand movements, which subsequently improves the rate of continuous gesture recognition. The efficacy of the proposed approach was demonstrated on a large database collected front the weather channel broadcast. This formulation opens new avenues for bottom-up frameworks of multimodal integration.
| Year | Citations | |
|---|---|---|
Page 1
Page 1