Publication | Open Access
Detecting errors in part-of-speech annotation
120
Citations
12
References
2003
Year
Unknown Venue
EngineeringPart-of-speech TaggingCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingLanguage DocumentationData ScienceComputational LinguisticsGrammarLanguage StudiesMachine TranslationPart-of-speech AnnotationNlp TaskPenn Tree-bankTreebanksAnnotation ToolSpeech ProcessingMultiple TaggingsLinguisticsPo Tagging
We propose a new method for detecting errors in "gold-standard" part-of-speech annotation. The approach locates errors with high precision based on n-grams occurring in the corpus with multiple taggings. Two further techniques, closed-class analysis and finite-state tagging guide patterns, are discussed. The success of the three approaches is illustrated for the Wall Street Journal corpus as part of the Penn Tree-bank.
| Year | Citations | |
|---|---|---|
Page 1
Page 1