Publication | Closed Access
ICDAR2017 Robust Reading Challenge on Omnidirectional Video
20
Citations
13
References
2017
Year
Unknown Venue
EngineeringVideo ProcessingVideo RetrievalCorpus LinguisticsNatural Language ProcessingMultimodal LlmImage AnalysisPattern RecognitionText RecognitionComputational ImagingText Localisation TaskMachine TranslationMachine VisionOptical Character RecognitionComputer ScienceMultimodal TranslationComputer VisionRobust Reading ChallengeScene UnderstandingIcdar 2017LinguisticsOmnidirectional Video
Results of ICDAR 2017 Robust Reading Challenge on Omnidirectional Video are presented. This competition uses Downtown Osaka Scene Text (DOST) Dataset that was captured in Osaka, Japan with an omnidirectional camera. Hence, it consists of sequential images (videos) of different view angles. Regarding the sequential images as videos (video mode), two tasks of localisation and end-to-end recognition are prepared. Regarding them as a set of still images (still image mode), three tasks of localisation, cropped word recognition and end-to-end recognition are prepared. As the dataset has been captured in Japan, the dataset contains Japanese text but also include text consisting of alphanumeric characters (Latin text). Hence, a submitted result for each task is evaluated in three ways: using Japanese only ground truth (GT), using Latin only GT and using combined GTs of both. Finally, by the submission deadline, we have received two submissions in the text localisation task of the still image mode. We intend to continue the competition in the open mode. Expecting further submissions, in this report we provide baseline results in all the tasks in addition to the submissions from the community.
| Year | Citations | |
|---|---|---|
Page 1
Page 1