Publication | Closed Access
Mining top-K covering rule groups for gene expression data
146
Citations
32
References
2005
Year
Unknown Venue
EngineeringMachine LearningPattern DiscoveryPattern MiningGene Expression ProfilingData ScienceData MiningPattern RecognitionBiostatisticsPublic HealthGene Expression DataRcbt ClassifierRule GroupsKnowledge DiscoveryStatistical GeneticsFunctional GenomicsBioinformaticsFrequent Pattern MiningAssociation RuleRule InductionComputational BiologySystems BiologyBuilding Rcbt
In this paper, we propose a novel algorithm to discover the top-k covering rule groups for each row of gene expression profiles. Several experiments on real bioinformatics datasets show that the new top-k covering rule mining algorithm is orders of magnitude faster than previous association rule mining algorithms.Furthermore, we propose a new classification method RCBT. RCBT classifier is constructed from the top-k covering rule groups. The rule groups generated for building RCBT are bounded in number. This is in contrast to existing rule-based classification methods like CBA [19] which despite generating excessive number of redundant rules, is still unable to cover some training data with the discovered rules. Experiments show that the RCBT classifier can match or outperform other state-of-the-art classifiers on several benchmark gene expression datasets. In addition, the top-k covering rule groups themselves provide insights into the mechanisms responsible for diseases directly.
| Year | Citations | |
|---|---|---|
Page 1
Page 1