DeepDance: Music-to-Dance Motion Choreography With Adversarial Learning

TLDR

The creation of improvised dance choreographies is a key research area, with a central challenge being the effective probabilistic one‑to‑many mapping between music and dance to generate realistic, genre‑diverse performances. DeepDance is a GAN‑based cross‑modal framework designed to correlate music and dance motion, enabling the generation of dance sequences that match input music. Its generator predicts dance movements that fit the current music while a discriminator evaluates the performance, and motion‑consistency constraints in the loss function allow the model to produce long, realistic dance sequences; additionally, a large‑scale YouTube‑Dance3D dataset is constructed to reduce data collection costs. Experiments on existing music‑dance datasets and the new YouTube‑Dance3D dataset demonstrate that DeepDance accurately captures music‑dance correlations and can choreograph appropriate dance sequences.

Abstract

The creation of improvised dancing choreographies is an important research field of cross-modal analysis. A key point of this task is how to effectively create and correlate music and dance with a probabilistic one-to-many mapping, which is essential to create realistic dances of various genres. To address this issue, we propose a GAN-based cross-modal association framework, DeepDance, which correlates two different modalities (dance motion and music) together, aiming at creating the desired dance sequence in terms of the input music. Its generator is to predictively produce the dance movements best-fit to current music piece by learning from examples. In another hand, its discriminator acts as an external evaluation from the audience and judges the whole performance. The generated dance movements and the corresponding input music are considered to be well-matched if the discriminator cannot distinguish the generated movements from the training samples according to the estimated probability. By adding motion consistency constraints in our loss function, the proposed framework is able to create long realistic dance sequences. To alleviate the problem of expensive and inefficient data collection, we propose an effective approach to create a large-scale dataset, YouTube-Dance3D, from open data source. Extensive experiments on currently available music-dance datasets and our YouTube-Dance3D dataset demonstrate that our approach effectively captures the correlation between music and dance and can be used to choreograph appropriate dance sequences.

References

Page 1

	Year	Citations

Page 1