Publication | Closed Access
The Animation Transformer: Visual Correspondence via Segment Matching
29
Citations
32
References
2021
Year
Animation TransformerScene AnalysisEngineeringVisual CorrespondenceVisual CorrespondencesImage AnalysisPattern RecognitionRobot LearningSynthetic Image GenerationGeometric ModelingMachine VisionStructure From MotionHuman Image SynthesisDeep LearningComputer VisionNatural SciencesComputer Stereo VisionScene UnderstandingMulti-view GeometryScene Modeling
Visual correspondence is a fundamental building block on the way to building assistive tools for hand-drawn animation. However, while a large body of work has focused on learning visual correspondences at the pixel-level, few approaches have emerged to learn correspondence at the level of line enclosures (segments) that naturally occur in hand-drawn animation. Exploiting this structure in animation has numerous benefits: it avoids the memory complexity of pixel attention over high resolution images and enables the use of real-world animation datasets that contain correspondence information at the level of per-segment colors. To that end, we propose the Animation Transformer (AnT) which uses a Transformer-based architecture to learn the spatial and visual relationships between segments across a sequence of images. By leveraging a forward match loss and a cycle consistency loss our approach attains excellent results compared to state-of-the-art pixel approaches on challenging datasets from real animation productions that lack ground-truth correspondence labels.
| Year | Citations | |
|---|---|---|
Page 1
Page 1