Publication | Open Access
Analysing Wikipedia and gold-standard corpora for NER training
60
Citations
15
References
2009
Year
Unknown Venue
EngineeringMassive CorpusSemantic WebCorpus LinguisticsNer TrainingText MiningNatural Language ProcessingInformation RetrievalComputational LinguisticsEntity RecognitionLanguage EngineeringLanguage StudiesNamed-entity RecognitionMachine TranslationNlp TaskTerminology ExtractionInformation ExtractionCostly Manual AnnotationLanguage CorpusAnnotationLinguistics
Named entity recognition (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used Wikipedia to automatically create a massive corpus of named entity annotated text.
| Year | Citations | |
|---|---|---|
Page 1
Page 1