Publication | Closed Access
Balanced parallel FP-Growth with MapReduce
98
Citations
10
References
2010
Year
Unknown Venue
Cluster ComputingBalanced Parallel Fp-growthEngineeringPattern DiscoveryPattern MiningMap-reduceMining MethodsData ScienceData MiningFrequent Itemset MiningPfp AlgorithmParallel ComputingKnowledge DiscoveryComputer EngineeringComputer ScienceScalable ComputingFim AlgorithmsFrequent Pattern MiningAssociation RuleParallel ProcessingCloud ComputingParallel ProgrammingBig Data
Frequent itemset mining (FIM) plays an essential role in mining associations, correlations and many other important data mining tasks. Unfortunately, as the volume of dataset gets larger day by day, most of the FIM algorithms in literature become ineffective due to either too huge resource requirement or too much communication cost. In this paper, we propose a balanced parallel FP-Growth algorithm BPFP, based on the PFP algorithm [1], which parallelizes FP-Growth in the MapReduce approach. BPFP adds into PFP load balance feature, which improves parallelization and thereby improves performance. Through empirical study, BPFP outperformed the PFP which uses some simple grouping strategy.
| Year | Citations | |
|---|---|---|
Page 1
Page 1