Publication | Closed Access
Multi-criteria-based active learning for named entity recognition
233
Citations
19
References
2004
Year
Unknown Venue
Data AnnotationEngineeringMachine LearningAutomatic Annotation ToolCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningComputational LinguisticsEntity RecognitionLanguage StudiesNamed-entity RecognitionMachine TranslationHuman Annotation EffortsMultiple CriteriaEntity DisambiguationKnowledge DiscoveryComputer ScienceInformation ExtractionAnnotation ToolLinguisticsAutomatic Annotation
In this paper, we propose a multi-criteria-based active learning approach and effectively apply it to named entity recognition. Active learning targets to minimize the human annotation efforts by selecting examples for labeling. To maximize the contribution of the selected examples, we consider the multiple criteria: informativeness, representativeness and diversity and propose measures to quantify them. More comprehensively, we incorporate all the criteria using two selection strategies, both of which result in less labeling cost than single-criterion-based method. The results of the named entity recognition in both MUC-6 and GENIA show that the labeling cost can be reduced by at least 80% without degrading the performance.
| Year | Citations | |
|---|---|---|
Page 1
Page 1