Publication | Open Access
Extracting and evaluating general world knowledge from the Brown corpus
68
Citations
10
References
2003
Year
Unknown Venue
EngineeringKnowledge ExtractionTextual EntailmentSemanticsCorpus LinguisticsText MiningNatural Language ProcessingLanguage DocumentationData ScienceComputational LinguisticsLanguage EngineeringLanguage StudiesBrown CorpusMachine TranslationNlp TaskKnowledge DiscoveryTerminology ExtractionApproximate InterpretationInformation ExtractionSemantic ParsingAutomated ReasoningLanguage CorpusGeneral World KnowledgeLinguisticsComputational Semantics
We have been developing techniques for extracting general world knowledge from miscellaneous texts by a process of approximate interpretation and abstraction, focusing initially on the Brown corpus. We apply interpretive rules to clausal patterns and patterns of modification, and concurrently abstract general "possibilistic" propositions from the resulting formulas. Two examples are "A person may believe a proposition", and "Children may live with relatives". Our methods currently yield over 117,000 such propositions (of variable quality) for the Brown corpus (more than 2 per sentence). We report here on our efforts to evaluate these results with a judging scheme aimed at determining how many of these propositions pass muster as "reasonable general claims" about the world in the opinion of human judges. We find that nearly 60% of the extracted propositions are favorably judged according to our scheme by any given judge. The percentage unanimously judged to be reasonable claims by multiple judges is lower, but still sufficiently high to suggest that our techniques may be of some use in tackling the long-standing "knowledge acquisition bottleneck" in AI.
| Year | Citations | |
|---|---|---|
1996 | 2.1K | |
1999 | 683 | |
1999 | 464 | |
1992 | 108 | |
2002 | 100 | |
1993 | 76 | |
2001 | 76 | |
1992 | 60 | |
1990 | 33 | |
1999 | 15 |
Page 1
Page 1