Publication | Closed Access
<i>AnyLoc</i>: Towards Universal Visual Place Recognition
136
Citations
33
References
2023
Year
Artificial IntelligenceEngineeringMachine LearningRobot LocalizationField RoboticsIntelligent RoboticsAutonomous SystemsVisual Place RecognitionLocalizationImage AnalysisData SciencePattern RecognitionAutonomous VehiclesRobot LearningUniversal Vpr SolutionVision RecognitionRobotics PerceptionMachine VisionVision RoboticsComputer Science3D Object RecognitionComputer VisionScene UnderstandingRobotics
Visual Place Recognition (VPR) is vital for robot localization. To date, the most performant VPR approaches are <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">environment- and task-specific:</i> while they exhibit strong performance in structured environments (predominantly urban driving), their performance degrades severely in unstructured environments, rendering most approaches brittle to robust real-world deployment. In this work, we develop a <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">universal</i> solution to VPR – a technique that works across a broad range of structured and unstructured environments (urban, outdoors, indoors, aerial, underwater, and subterranean environments) without any re-training or finetuning. We demonstrate that general-purpose feature representations derived from off-the-shelf self-supervised models <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">with no VPR-specific training</i> are the right substrate upon which to build such a universal VPR solution. Combining these derived features with <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">unsupervised feature aggregation</i> enables our suite of methods, <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">AnyLoc</i> , to achieve up to <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"><tex-math notation="LaTeX">$4\times$</tex-math></inline-formula> significantly higher performance than existing approaches. We further obtain a 6% improvement in performance by characterizing the semantic properties of these features, uncovering unique <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">domains</i> which encapsulate datasets from similar environments. Our detailed experiments and analysis lay a foundation for building VPR solutions that may be deployed <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">anywhere</i> , <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">anytime</i> , and across <italic xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">anyview</i> .
| Year | Citations | |
|---|---|---|
Page 1
Page 1