SemEval-2023 Task 2: Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2)

TLDR

SemEval‑2023 Task 2 focused on fine‑grained multilingual named‑entity recognition, a popular challenge within the SemEval community. The task aimed to evaluate methods for detecting complex entity types such as WRITTENWORK, VEHICLE, and MUSICALGRP across 12 languages in monolingual, multilingual, and noisy settings. Evaluation was conducted on the MultiCoNER V2 dataset, which contains 2.2 million instances in Bangla, Chinese, English, Farsi, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, and Ukrainian. The competition received 842 submissions from 47 teams, with 34 system papers; results revealed that media titles and product names were the most difficult, external‑knowledge‑augmented transformers performed best, noisy data caused an average 10 % drop, and further research is needed to improve robustness on complex entities.

Abstract

We present the findings of SemEval-2023 Task 2 on Fine-grained Multilingual Named Entity Recognition (MultiCoNER 2). Divided into 13 tracks, the task focused on methods to identify complex fine-grained named entities (like WRITTENWORK, VEHICLE, MUSICALGRP) across 12 languages, in both monolingual and multilingual scenarios, as well as noisy settings. The task used the MultiCoNER V2 dataset, composed of 2.2 million instances in Bangla, Chinese, English, Farsi, French, German, Hindi, Italian., Portuguese, Spanish, Swedish, and Ukrainian. MultiCoNER 2 was one of the most popular tasks of SemEval-2023. It attracted 842 submissions from 47 teams, and 34 teams submitted system papers. Results showed that complex entity types such as media titles and product names were the most challenging. Methods fusing external knowledge into transformer models achieved the best performance, and the largest gains were on the Creative Work and Group classes, which are still challenging even with external knowledge. Some fine-grained classes proved to be more challenging than others, such as SCIENTIST, ARTWORK, and PRIVATECORP. We also observed that noisy data has a significant impact on model performance, with an average drop of 10% on the noisy subset. The task highlights the need for future research on improving NER robustness on noisy data containing complex entities.

References

Page 1

	Year	Citations
Enriching Word Vectors with Subword Information Piotr Bojanowski, Édouard Grave, Armand Joulin, Transactions of the Association for Computational Linguistics EngineeringMachine LearningSkipgram ModelWord VectorsCorpus Linguistics	2017	9.6K
Focal Loss for Dense Object Detection Tsung-Yi Lin, Priya Goyal, Ross Girshick, IEEE Transactions on Pattern Analysis and Machine Intelligence Image ClassificationConvolutional Neural NetworkImage AnalysisMachine VisionMachine Learning	2018	9.3K
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge Joshua Chin, Catherine Havasi Proceedings of the AAAI Conference on Artificial Intelligence EngineeringKnowledge ExtractionSemanticsSemantic WebWord Vectors	2017	2K
Analysis of Points of Interests Recommended for Leisure Walk Descriptions Payal Bajaj, Daniel Campos, Nick Craswell, arXiv (Cornell University)	2024	1.3K
A Unified MRC Framework for Named Entity Recognition Xiaoya Li, Jingrong Feng, Yuxian Meng,	2020	607
Dice Loss for Data-imbalanced NLP Tasks Xiaoya Li, Xiaofei Sun, Yuxian Meng,	2020	557
Results of the WNUT2017 Shared Task on Novel and Emerging Entity Recognition Leon Derczynski, Eric Nichols, Marieke van Erp,	2017	358
ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning Viet Dac Lai, Nghia Ngo, Amir Pouran Ben Veyseh, EngineeringMultilingualismMultilingual PretrainingLarge Language ModelLanguage Learning	2023	163
MuRIL: Multilingual Representations for Indian Languages Simran Khanuja, Diksha Bansal, Sarvesh Mehtani, arXiv (Cornell University) EngineeringIndian LanguagesCross-lingual RepresentationMultilingualismSecond Language Speaking	2021	157
Multilingual Translation with Extensible Multilingual Pretraining and Finetuning Yuqing Tang, Chau Tran, Xian Li, arXiv (Cornell University) Translation StudiesNatural Language ProcessingMultilingual TranslationMultilingual FinetuningEngineering	2020	151

Page 1