Publication | Closed Access
An Information Extraction Model for Unconstrained Handwritten Documents
21
Citations
10
References
2010
Year
Unknown Venue
EngineeringMachine LearningStatistical Shallow ParsingInformation Extraction ModelCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData SciencePattern RecognitionText RecognitionComputational LinguisticsText SegmentationCharacter RecognitionText Line ShallowOptical Character RecognitionKnowledge DiscoveryComputer ScienceUnconstrained Handwritten DocumentsData ExtractionDocument Processing
In this paper, a new information extraction system by statistical shallow parsing in unconstrained handwritten documents is introduced. Unlike classical approaches found in the literature as keyword spotting or full document recognition, our approach relies on a strong and powerful global handwriting model. A entire text line is considered as an indivisible entity and is modeled with Hidden Markov Models. In this way, text line shallow parsing allows fast extraction of the relevant information in any document while rejecting at the same time irrelevant information. First results are promising and show the interest of the approach.
| Year | Citations | |
|---|---|---|
Page 1
Page 1