Publication | Closed Access
Explore Faster Localization Learning For Scene Text Detection
17
Citations
24
References
2023
Year
Unknown Venue
Convolutional Neural NetworkEngineeringMachine LearningLocalizationFaster Localization LearningText ProposalsImage AnalysisText-to-image RetrievalAccurate Text LocalizationPattern RecognitionText RecognitionText SegmentationVideo TransformerMachine VisionObject DetectionComputer ScienceDeep LearningComputer VisionProposed Fanet
Generally, pre-training and long-time training computation are necessary for obtaining a good-performance text detector based on deep networks. In this paper, we present a new scene text detection network (called FANet) with a Fast convergence speed and Accurate text localization. The proposed FANet is an end-to-end text detector based on transformer feature learning and normalized Fourier descriptor modeling, where the Fourier Descriptor Proposal Network and Iterative Text Decoding Network are designed to efficiently and accurately identify text proposals. Additionally, a Dense Matching Strategy and a well-designed loss function are also proposed for optimizing the network performance. Extensive experiments are carried out to demonstrate that the proposed FANet can achieve the SOTA performance with fewer training epochs and no pretraining. When we introduce additional data for pre-training, the proposed FANet can achieve SOTA performance on MSRA-TD500, CTW1500, and TotalText. The ablation experiments also verify the effectiveness of our contributions. Code is available at https://github.com/callsys/FANet.
| Year | Citations | |
|---|---|---|
Page 1
Page 1