Publication | Open Access
SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering
10
Citations
13
References
2021
Year
Artificial IntelligenceEngineeringMachine LearningCorpus LinguisticsNatural Language ProcessingMultimodal LlmText-to-image RetrievalInformation RetrievalData ScienceVisual GroundingComputational LinguisticsVisual Question AnsweringLanguage StudiesMachine TranslationMedical Visual QuestionQuestion AnsweringVision Language ModelDeep LearningSemantically-labeled Knowledge-enhanced DatasetComputer VisionAvailable DatasetVisual ReasoningLarge Bilingual DatasetLinguisticsHealth Informatics
Medical visual question answering (Med-VQA) has tremendous potential in healthcare. However, the development of this technology is hindered by the lacking of publicly-available and high-quality labeled datasets for training and evaluation. In this paper, we present a large bilingual dataset, SLAKE, with comprehensive semantic labels annotated by experienced physicians and a new structural medical knowledge base for Med-VQA. Besides, SLAKE includes richer modalities and covers more human body parts than the currently available dataset. We show that SLAKE can be used to facilitate the development and evaluation of Med-VQA systems. The dataset can be downloaded from http://www.med-vqa.com/slake.
| Year | Citations | |
|---|---|---|
Page 1
Page 1