Publication | Closed Access
Text chunking by combining hand-crafted rules and memory-based learning
43
Citations
9
References
2003
Year
Unknown Venue
Syntactic ParsingEngineeringCorpus LinguisticsText MiningNatural Language ProcessingSyntaxComputational LinguisticsLanguage EngineeringGrammarLanguage StudiesMachine TranslationMachine Learning MethodInformation ExtractionShallow ParsingResidual ErrorsHand-crafted RulesText ProcessingLinguisticsChunking
This paper proposes a hybrid of hand-crafted rules and a machine learning method for chunking Korean. In the partially free word-order languages such as Korean and Japanese, a small number of rules dominate the performance due to their well-developed postpositions and endings. Thus, the proposed method is primarily based on the rules, and then the residual errors are corrected by adopting a memory-based machine learning method. Since the memory-based learning is an efficient method to handle exceptions in natural language processing, it is good at checking whether the estimates are exceptional cases of the rules and revising them. An evaluation of the method yields the improvement in F-score over the rules or various machine learning methods alone.
| Year | Citations | |
|---|---|---|
Page 1
Page 1