Publication | Closed Access
A high-performance distributed algorithm for mining association rules
28
Citations
15
References
2004
Year
Unknown Venue
Cluster ComputingEngineeringPattern DiscoveryNetwork AnalysisPattern MiningAssociation Rule MiningSuperlinear SpeedupText MiningData ScienceData MiningParallel ComputingAssociation RulesHigh-performance Data AnalyticsKnowledge DiscoveryComputer EngineeringComputer ScienceFrequent Pattern MiningAssociation RuleError ProbabilityParallel Programming
We present a new distributed association rule mining (D-ARM) algorithm that demonstrates superlinear speedup with the number of computing nodes. The algorithm is the first D-ARM algorithm to perform a single scan over the database. As such, its performance is unmatched by any previous algorithm. Scale-up experiments over standard synthetic benchmarks demonstrate stable run time regardless of the number of computers. Theoretical analysis reveals a tighter bound on error probability than the one shown in the corresponding sequential algorithm.
| Year | Citations | |
|---|---|---|
Page 1
Page 1