Publication | Open Access
CoNLL-X shared task on multilingual dependency parsing
976
Citations
56
References
2006
Year
Unknown Venue
Syntactic ParsingEngineeringDependency LinguisticsLanguage LearningCorpus LinguisticsText MiningNatural Language ProcessingApplied LinguisticsSyntaxData ScienceComputational LinguisticsGrammarLanguage StudiesMulti-lingual ParsingMachine TranslationSemantic ParsingShallow ParsingParsingTreebanksMultilingual DependencyTenth ConllParsing PerformanceLinguistics
The CoNLL shared task provides a common benchmark for evaluating multilingual dependency parsers, and the tenth iteration focused on parsing across 13 languages, summarizing participants’ approaches and results. This study converts 13 language treebanks into a unified dependency format, measures parsing performance, and seeks to identify factors that influence multilingual parsing difficulty. The authors standardize treebanks into a common dependency format and evaluate parsing accuracy across languages.
Each year the Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their systems on exactly the same data sets, in order to better compare systems. The tenth CoNLL (CoNLL-X) saw a shared task on Multilingual Dependency Parsing. In this paper, we describe how treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured. We also give an overview of the parsing approaches that participants took and the results that they achieved. Finally, we try to draw general conclusions about multi-lingual parsing: What makes a particular language, treebank or annotation scheme easier or harder to parse and which phenomena are challenging for any dependency parser?
| Year | Citations | |
|---|---|---|
Page 1
Page 1