Publication | Closed Access
Fast-mRMR: Fast Minimum Redundancy Maximum Relevance Algorithm for High-Dimensional Big Data
191
Citations
17
References
2016
Year
Cluster ComputingEngineeringMachine LearningBig Data IndexingFeature SelectionMap-reduceText MiningInformation RetrievalData ScienceData MiningPattern RecognitionHigh-dimensional Big DataParallel ComputingHigh-performance Data AnalyticsFeature EngineeringKnowledge DiscoveryComputer EngineeringComputer ScienceBig Data SearchDimensionality ReductionData IndexingComputational ScienceParallel ProgrammingIndexing TechniqueSimilarity SearchMassive Data ProcessingBig Data
With the advent of large-scale problems, feature selection has become a fundamental preprocessing step to reduce input dimensionality. The minimum-redundancy-maximum-relevance (mRMR) selector is considered one of the most relevant methods for dimensionality reduction due to its high accuracy. However, it is a computationally expensive technique, sharply affected by the number of features. This paper presents fast-mRMR, an extension of mRMR, which tries to overcome this computational burden. Associated with fast-mRMR, we include a package with three implementations of this algorithm in several platforms, namely, CPU for sequential execution, GPU (graphics processing units) for parallel computing, and Apache Spark for distributed computing using big data technologies.
| Year | Citations | |
|---|---|---|
Page 1
Page 1