Publication | Open Access
Using machine learning to maintain rule-based named-entity recognition and classification systems
79
Citations
11
References
2001
Year
Unknown Venue
EngineeringMachine LearningTaggingPart-of-speech TaggingSemantic WebCorpus LinguisticsText MiningNatural Language ProcessingInformation RetrievalData ScienceData MiningManual TaggingComputational LinguisticsGrammarLanguage StudiesNamed-entity RecognitionMachine TranslationClassification SystemKnowledge DiscoveryRule-based Named-entity RecognitionTerminology ExtractionIntelligent ClassificationComputer ScienceInformation ExtractionRule InductionClassification SystemsClassificationLinguistics
This paper presents a method that assists in maintaining a rule-based named-entity recognition and classification system. The underlying idea is to use a separate system, constructed with the use of machine learning, to monitor the performance of the rule-based system. The training data for the second system is generated with the use of the rule-based system, thus avoiding the need for manual tagging. The disagreement of the two systems acts as a signal for updating the rule-based system. The generality of the approach is illustrated by applying it to large corpora in two different languages: Greek and French. The results are very encouraging, showing that this alternative use of machine learning can assist significantly in the maintenance of rule-based systems.
| Year | Citations | |
|---|---|---|
Page 1
Page 1