Publication | Closed Access
Multi-objective adversarial gesture generation
82
Citations
22
References
2019
Year
Unknown Venue
Artificial IntelligenceEngineeringMachine LearningSpeech RecognitionData ScienceMotion CaptureVirtual RealityGesture MotionRobot LearningGesture ProcessingHealth SciencesDanceMotion SynthesisComputer ScienceHuman Image SynthesisDeep LearningSpeech CommunicationGesture RecognitionConversational Virtual AgentsRealistic Gesture DynamicsHuman-computer InteractionSpeech Processing
Applications for conversational virtual agents are on the rise, but producing realistic non-verbal behavior for spoken utterances remains an unsolved problem. We explore the use of a generative adversarial training paradigm to map speech to 3D gesture motion. We define the gesture generation problem as a series of smaller sub-problems, including plausible gesture dynamics, realistic joint configurations, and diverse and smooth motion. Each sub-problem is monitored by separate adversaries. For the problem of enforcing realistic gesture dynamics in our output, we train a classifier to automatically detect gesture phases. We find adversarial training to be superior to the use of a standard regression loss and discuss the benefit of each of our training objectives. We recorded a dataset of over 6 hours of natural, unrehearsed speech with high-quality motion capture, as well as audio and video recording.
| Year | Citations | |
|---|---|---|
Page 1
Page 1