Publication | Closed Access
A deep learning pipeline for semantic facade segmentation
19
Citations
0
References
2017
Year
Unknown Venue
Convolutional Neural NetworkScene AnalysisMachine VisionImage AnalysisMachine LearningData SciencePattern RecognitionDeep Learning PipelineBuilding FacadesPixel-wise ClassificationRestricted Boltzmann MachinesScene UnderstandingScene InterpretationComputer ScienceEngineeringDeep LearningScene ModelingComputer Vision
We propose an algorithm that provides a pixel-wise classification of building facades. Building facades provide a rich environment for testing semantic segmentation techniques. They come in a variety of styles that reflect both appearance and layout characteristics. On the other hand, they exhibit a degree of stability in the arrangement of structures across different instances. We integrate appearance and layout cues in a single framework. The most likely label based on appearance is obtained through applying the state-of-the-art deep convolution networks. This is further optimized through Restricted Boltzmann Machines (RBM), applied on vertical and horizontal scanlines of facade models. Learning the probability distributions of the models via the RBMs is utilized in two settings. Firstly, we use them in learning from pre-seen facade samples, in the traditional training sense. Secondly, we learn from the test image at hand, in a way the allows the transfer of visual knowledge of the scene from correctly classified areas to others. Experimentally, we are on par with the reported performance results. However, we do not explicitly specify any hand-engineered features that are architectural scene dependent, nor do we include any dataset specific heuristics/thresholds.