Publication | Closed Access
Robust Object Recognition with Cortex-Like Mechanisms
1.7K
Citations
61
References
2007
Year
EngineeringObject CategorizationSocial SciencesImage ClassificationImage AnalysisPattern RecognitionVision RecognitionCognitive ScienceMachine VisionComplex Visual ScenesRobust Object RecognitionDeep LearningMedical Image ComputingComputer VisionComplex SceneCategorizationObject RecognitionNeuroscienceNew General Framework
The framework relies on a universal, redundant dictionary of features capable of recognizing most object categories. The study introduces a biologically motivated hierarchical system that alternates template matching and maximum pooling to build invariant feature representations for complex visual scene recognition. The authors demonstrate the system on diverse tasks, from cluttered single‑object recognition to multiclass categorization and complex scene understanding that requires both shape‑ and texture‑based object recognition. The approach learns from few examples, competes with state‑of‑the‑art systems, and provides a plausibility proof for feedforward cortical object‑recognition models.
We introduce a new general framework for the recognition of complex visual scenes, which is motivated by biology: We describe a hierarchical system that closely follows the organization of visual cortex and builds an increasingly complex and invariant feature representation by alternating between a template matching and a maximum pooling operation. We demonstrate the strength of the approach on a range of recognition tasks: From invariant single object recognition in clutter to multiclass categorization problems and complex scene understanding tasks that rely on the recognition of both shape-based as well as texture-based objects. Given the biological constraints that the system had to satisfy, the approach performs surprisingly well: It has the capability of learning from only a few training examples and competes with state-of-the-art systems. We also discuss the existence of a universal, redundant dictionary of features that could handle the recognition of most object categories. In addition to its relevance for computer vision, the success of this approach suggests a plausibility proof for a class of feedforward models of object recognition in cortex.
| Year | Citations | |
|---|---|---|
1998 | 56.5K | |
2005 | 31.6K | |
1999 | 16.1K | |
1962 | 13.7K | |
1980 | 12.3K | |
2002 | 6.3K | |
1996 | 5.8K | |
1946 | 4.5K | |
2005 | 3.9K | |
1996 | 3.5K |
Page 1
Page 1