Publication | Open Access
Self-training PCFG grammars with latent annotations across languages
87
Citations
21
References
2009
Year
Unknown Venue
Syntactic ParsingEngineeringPcfg-la ParserLatent AnnotationsLanguage LearningText MiningNatural Language ProcessingApplied LinguisticsSyntaxComputational LinguisticsGrammarLanguage StudiesMachine TranslationSelf-training Pcfg GrammarsGrammar InductionSemantic ParsingShallow ParsingParsingTreebanksLinguisticsPo Tagging
We investigate the effectiveness of self-training PCFG grammars with latent annotations (PCFG-LA) for parsing languages with different amounts of labeled training data. Compared to Charniak's lexicalized parser, the PCFG-LA parser was more effectively adapted to a language for which parsing has been less well developed (i.e., Chinese) and benefited more from self-training. We show for the first time that self-training is able to significantly improve the performance of the PCFG-LA parser, a single generative parser, on both small and large amounts of labeled training data. Our approach achieves state-of-the-art parsing accuracies for a single parser on both English (91.5%) and Chinese (85.2%).
| Year | Citations | |
|---|---|---|
Page 1
Page 1