Publication | Closed Access
PageNet
16
Citations
19
References
2017
Year
Unknown Venue
Document ProcessingConvolutional Neural NetworkMachine VisionMachine LearningDeep LearningImage AnalysisPattern RecognitionBorder RegionEngineeringText RecognitionScene UnderstandingDeep Learning SystemComputer ScienceMedical Image ComputingPixel-wise SegmentationImage SegmentationComputer Vision
When digitizing a document into an image, it is common to include a surrounding border region to visually indicate that the entire document is present in the image. However, this border should be removed prior to automated processing. In this work, we present a deep learning system, PageNet, which identifies the main page region in an image in order to segment content from both textual and non-textual border noise. In PageNet, a Fully Convolutional Network obtains a pixel-wise segmentation which is post-processed into a quadrilateral region. We evaluate PageNet on 4 collections of historical handwritten documents and obtain over 94% mean intersection over union on all datasets and approach human performance on 2 collections. Additionally, we show that PageNet can segment documents that are overlayed on top of other documents.
| Year | Citations | |
|---|---|---|
Page 1
Page 1