Publication | Closed Access
Generalization and decision tree induction: efficient classification in data mining
118
Citations
27
References
2002
Year
Unknown Venue
EngineeringMachine LearningAttribute Oriented InductionScalability IssuesText MiningOptimization-based Data MiningInformation RetrievalData ScienceData MiningPattern RecognitionDecision TreeManagementData IntegrationDecision Tree LearningKnowledge Discovery ProcessStatisticsPredictive AnalyticsKnowledge DiscoveryIntelligent ClassificationComputer ScienceDecision Tree InductionEvolutionary Data MiningRule InductionEfficient InductionClassificationBig Data
Efficiency and scalability are fundamental issues concerning data mining in large databases. Although classification has been studied extensively, few of the known methods take serious consideration of efficient induction in large databases and the analysis of data at multiple abstraction levels. The paper addresses the efficiency and scalability issues by proposing a data classification method which integrates attribute oriented induction, relevance analysis, and the induction of decision trees. Such an integration leads to efficient, high quality, multiple level classification of large amounts of data, the relaxation of the requirement of perfect training sets, and the elegant handling of continuous and noisy data.
| Year | Citations | |
|---|---|---|
Page 1
Page 1