Publication | Closed Access
End-to-End Text Recognition Using Local Ternary Patterns, MSER and Deep Convolutional Nets
35
Citations
22
References
2014
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningSpeech RecognitionNatural Language ProcessingImage ClassificationImage AnalysisText-to-image RetrievalPattern RecognitionText RecognitionCharacter RecognitionIcdar 2003Machine VisionOptical Character RecognitionObject DetectionDeep LearningNatural Scene ImagesComputer VisionText ProcessingDeep Convolutional Nets
Text recognition in natural scene images is an application for several computer vision applications like licence plate recognition, automated translation of street signs, help for visually impaired people or image retrieval. In this work an end-to-end text recognition system is presented. For detection an AdaBoost ensemble with a modified Local Ternary Pattern (LTP) feature-set with a post-processing stage build upon Maximally Stable Extremely Region (MSER) is used. The text recognition is done using a deep Convolution Neural Network (CNN) trained with backpropagation. The system presented outperforms state of the art methods on the ICDAR 2003 dataset in the text-detection (F-Score: 74.2%), dictionary-driven cropped-word recognition (F-Score: 87.1%) and dictionary-driven end-to-end recognition (F-Score: 72.6%) tasks.
| Year | Citations | |
|---|---|---|
Page 1
Page 1