Concepedia

Publication | Closed Access

Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network

1.7K

Citations

31

References

2017

Year

TLDR

Recent network designs stack small filters for efficiency, yet large kernels are crucial for dense per‑pixel prediction in semantic segmentation. The authors propose a Global Convolutional Network to jointly address classification and localization in semantic segmentation. The architecture incorporates a residual‑based boundary refinement module to sharpen object boundaries. The method achieves state‑of‑the‑art results, reaching 82.2 % on PASCAL VOC 2012 and 76.9 % on Cityscapes, surpassing prior work.

Abstract

One of recent trends [31, 32, 14] in network architecture design is stacking small filters (e.g., 1×1 or 3×3) in the entire network because the stacked small filters is more efficient than a large kernel, given the same computational complexity. However, in the field of semantic segmentation, where we need to perform dense per-pixel prediction, we find that the large kernel (and effective receptive field) plays an important role when we have to perform the classification and localization tasks simultaneously. Following our design principle, we propose a Global Convolutional Network to address both the classification and localization issues for the semantic segmentation. We also suggest a residual-based boundary refinement to further refine the object boundaries. Our approach achieves state-of-art performance on two public benchmarks and significantly outperforms previous results, 82.2% (vs 80.2%) on PASCAL VOC 2012 dataset and 76.9% (vs 71.8%) on Cityscapes dataset.

References

YearCitations

Page 1