Concepedia

Publication | Closed Access

Multimodal Motion Prediction with Stacked Transformers

412

Citations

25

References

2021

Year

TLDR

Predicting multiple plausible future trajectories of nearby vehicles is essential for autonomous driving safety, yet existing methods struggle because latent features bias toward the most frequent mode and proposal‑based approaches depend heavily on prior knowledge. We propose mmTransformer, a novel transformer framework for multimodal motion prediction. mmTransformer uses stacked transformers to model multimodality at the feature level with fixed independent proposals, and a region‑based training strategy induces diversity among the generated trajectories. On the Argoverse dataset, mmTransformer achieves state‑of‑the‑art performance, substantially improving trajectory diversity and accuracy. Demo video and code are available at https://decisionforce.github.io/mmTransformer.

Abstract

Predicting multiple plausible future trajectories of the nearby vehicles is crucial for the safety of autonomous driving. Recent motion prediction approaches attempt to achieve such multimodal motion prediction by implicitly regularizing the feature or explicitly generating multiple candidate proposals. However, it remains challenging since the latent features may concentrate on the most frequent mode of the data while the proposal-based methods depend largely on the prior knowledge to generate and select the proposals. In this work, we propose a novel transformer framework for multimodal motion prediction, termed as mmTransformer. A novel network architecture based on stacked transformers is designed to model the multimodality at feature level with a set of fixed independent proposals. A region-based training strategy is then developed to induce the multimodality of the generated proposals. Experiments on Argoverse dataset show that the proposed model achieves the state-of-the-art performance on motion prediction, substantially improving the diversity and the accuracy of the predicted trajectories. Demo video and code are available at https://decisionforce.github.io/mmTransformer.

References

YearCitations

Page 1