Publication | Closed Access
Encoder–decoder network for pixel‐level road crack detection in black‐box images
413
Citations
52
References
2019
Year
Timely monitoring of pavement cracks is essential for road maintenance, yet black‑box cameras—widely used and affordable—make crack detection difficult because images often contain non‑road objects. The authors propose a pixel‑level detection method for identifying road cracks in black‑box images using a deep convolutional encoder–decoder network. The encoder–decoder network uses residual‑network convolutional layers to extract crack features and deconvolutional layers to localize them, and was trained on 427 of 527 black‑box images and tested on the remaining 100. ResNet‑152 with transfer learning achieved the best performance, with recall 71.98 %, precision 77.68 %, and IoU 59.65 %, proving the method optimal for pixel‑level crack detection in black‑box images.
Abstract Timely monitoring of pavement cracks is essential for successful maintenance of road infrastructure. Accurate information concerning crack location and severity enables proactive management of the infrastructure. Black‐box cameras, which are becoming increasingly widespread at an affordable price, can be used as efficient road‐image collectors over a wide area. However, the cracks in these images are difficult to detect, because the images containing them often include objects other than roads. Thus, we propose a pixel‐level detection method for identifying road cracks in black‐box images using a deep convolutional encoder–decoder network. The encoder consists of convolutional layers of the residual network for extracting crack features, and the decoder consists of deconvolutional layers for localizing the cracks in an input image. The proposed network was trained on 427 out of 527 images extracted from black‐box videos and tested on the remaining 100 images. Compared with VGG‐16, ResNet‐50, ResNet‐101, ResNet‐200 with transfer learning, and ResNet‐152 without transfer learning, ResNet‐152 with transfer learning exhibited the best performance, achieving recall, precision, and intersection of union of 71.98%, 77.68%, and 59.65%, respectively. The experimental results prove that the proposed method is optimal for detecting cracks in black‐box images at the pixel level.
| Year | Citations | |
|---|---|---|
Page 1
Page 1