Publication | Closed Access
Layout Analysis for Historical Manuscripts Using Sift Features
41
Citations
13
References
2011
Year
Unknown Venue
EngineeringComputer-aided DesignLayout EntityCorpus LinguisticsLayout AnalysisText MiningLayout Analysis MethodImage AnalysisInformation RetrievalPattern RecognitionText RecognitionDocument EngineeringText SegmentationCharacter RecognitionComputational GeometryGeometric ModelingOptical Character RecognitionComputer ScienceLayout EntitiesNatural SciencesDocument Processing
We propose a layout analysis method for historical manuscripts that relies on the part-based identification of layout entities. A layout entity -- such as letters of the text, initials or headings -- is composed of a set of characteristic segments or structures, which is dissimilar for distinct classes in the manuscripts under consideration. This fact is exploited in order to segment a manuscript page into homogeneous regions. Historical documents traditionally involve challenges such as uneven writing support and varying shapes of characters, fluctuating text lines, changing scripts and writing styles, and variance in the layout itself. Hence, a part-based detection of layout entities is proposed using a multi-stage algorithm for the localization of the entities, based on interest points. Results show that the proposed method is able to locate initials, headings and text areas in ancient manuscripts containing stains, tears and partially faded-out ink sufficiently well.
| Year | Citations | |
|---|---|---|
Page 1
Page 1