Publication | Closed Access
A Combined System for Text Line Extraction and Handwriting Recognition in Historical Documents
16
Citations
26
References
2014
Year
Unknown Venue
Handwriting RecognitionEngineeringCorpus LinguisticsText MiningNatural Language ProcessingMedieval Parzival DatabaseLanguage DocumentationInformation RetrievalPattern RecognitionText RecognitionText SegmentationAutomated ReadingCharacter RecognitionText Line ExtractionMachine TranslationOptical Character RecognitionText LocalizationCombined SystemText ProcessingDocument Processing
Automated reading of historical handwriting is needed to search and browse ancient manuscripts in digital libraries based on their textual content. In this paper, we present a combined system for text localization and transcription in page images. It includes flexible learning-based methods for layout analysis and handwriting recognition, which were developed in the context of the Swiss research project HisDoc. A comprehensive experimental evaluation is provided for the medieval Parzival database, demonstrating a promising word recognition accuracy of 93.0% with closed vocabulary. In order to harmonize the evaluation of the two document analysis tasks, we introduce a novel evaluation measure for text line extraction that takes substitution, deletion, as well as insertion errors into account.
| Year | Citations | |
|---|---|---|
Page 1
Page 1