Concepedia

Publication | Closed Access

A hybrid page segmentation method

16

Citations

5

References

2002

Year

Abstract

A method of page segmentation using field separators and white streams is described and applied to the layout analysis of various types of printed pages which may have horizontal and vertical textlines. In complex page layouts, text columns which are printed closely together are often separated by thin black lines (field separators) or long white spaces (white streams). These separators are first extracted by horizontal and vertical scanning of a page, and then a global partitioning of the page into blocks is performed. Next in each block, black connected components are merged into textlines along the directions of separators horizontally or vertically. In experimental trials on various types of page layouts, such techniques produced robust and fast results.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">&gt;</ETX>

References

YearCitations

Page 1