Analysing Wikipedia and gold-standard corpora for NER training - Concepedia

Concepedia

Abstract

Named entity recognition (ner) for English typically involves one of three gold standards: muc, conll, or bbn, all created by costly manual annotation. Recent work has used Wikipedia to automatically create a massive corpus of named entity annotated text.

References

	Year	Citations

Page 1