Publication | Closed Access
Learning Pixel Trajectories with Multiscale Contrastive Random Walks
34
Citations
51
References
2022
Year
Scene AnalysisEngineeringMachine LearningAction Recognition (Movement Science)Video ProcessingAction Recognition (Computer Vision)Optical FlowVideo InterpretationImage Sequence AnalysisImage AnalysisData SciencePattern RecognitionStochastic GeometryMachine VisionComputer ScienceVideo UnderstandingVideo Modeling TasksComputer VisionVideo AnalysisPixel TrajectoriesSpace-time CorrespondenceMotion Analysis
A range of video modeling tasks, from optical flow to multiple object tracking, share the same fundamental challenge: establishing space-time correspondence. Yet, approaches that dominate each space differ. We take a step to-wards bridging this gap by extending the recent contrastive random walk formulation to much denser, pixel-level spacetime graphs. The main contribution is introducing hierarchy into the search problem by computing the transition matrix between two frames in a coarse-to-fine manner, forming a multiscale contrastive random walk when ex-tended in time. This establishes a unified technique for self-supervised learning of optical flow, keypoint tracking, and video object segmentation. Experiments demonstrate that, for each of these tasks, the unified model achieves performance competitive with strong self-supervised approaches specific to that task. <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> Project page at https://jasonbian97.github.io/flowwalk
| Year | Citations | |
|---|---|---|
Page 1
Page 1