Publication | Open Access
Estimation of conditional probabilities with decision trees and an application to fine-grained POS tagging
180
Citations
14
References
2008
Year
Unknown Venue
EngineeringTaggingPart-of-speech TaggingCorpus LinguisticsText MiningSpeech RecognitionNatural Language ProcessingInformation RetrievalData ScienceData MiningDecision TreeComputational LinguisticsAttribute ProbabilitiesDecision Tree LearningLanguage StudiesStatisticsMachine TranslationPos TagsetsConditional ProbabilitiesNlp TaskKnowledge DiscoveryContextual ProbabilitiesComputer ScienceTreebanksSemantic TaggingDecision TreesFine-grained Pos TaggingLinguisticsPo Tagging
We present a HMM part-of-speech tagging method which is particularly suited for POS tagsets with a large number of fine-grained tags. It is based on three ideas: (1) splitting of the POS tags into attribute vectors and decomposition of the contextual POS probabilities of the HMM into a product of attribute probabilities, (2) estimation of the contextual probabilities with decision trees, and (3) use of high-order HMMs. In experiments on German and Czech data, our tagger outperformed state-of-the-art POS taggers.
| Year | Citations | |
|---|---|---|
Page 1
Page 1