Publication | Open Access
Stopping criteria for active learning of named entity recognition
78
Citations
14
References
2008
Year
Unknown Venue
EngineeringMachine LearningStatistical NlpCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceComputational LinguisticsEntity RecognitionLanguage EngineeringLanguage StudiesNamed-entity RecognitionMachine TranslationEntity DisambiguationNlp TaskKnowledge DiscoveryTerminology ExtractionInformation ExtractionActive LearningLinguisticsPo Tagging
Active learning is a proven method for reducing the cost of creating the training sets that are necessary for statistical NLP. However, there has been little work on stopping criteria for active learning. An operational stopping criterion is necessary to be able to use active learning in NLP applications. We investigate three different stopping criteria for active learning of named entity recognition (NER) and show that one of them, gradient-based stopping, (i) reliably stops active learning, (ii) achieves nearoptimal NER performance, (iii) and needs only about 20% as much training data as exhaustive labeling.
| Year | Citations | |
|---|---|---|
Page 1
Page 1