Publication | Closed Access
Consistent Feature Selection for Pattern Recognition in Polynomial Time
169
Citations
23
References
2007
Year
Unknown Venue
EngineeringMachine LearningBiometricsFeature SelectionMachine Learning TasksOptimization-based Data MiningClassification MethodImage AnalysisData ScienceData MiningPattern RecognitionBiostatisticsConsistent Feature SelectionMachine VisionFeature EngineeringMinimal FeatureKnowledge DiscoveryComputer ScienceStatistical Pattern RecognitionBioinformaticsFeature ConstructionComputer VisionComputational BiologyPattern Recognition Application
We analyze two different feature selection problems: finding a minimal feature set optimal for classification (MINIMAL-OPTIMAL) vs. finding all features relevant to the target variable (ALL-RELEVANT). The latter problem is motivated by recent applications within bioinformatics, particularly gene expression analysis. For both problems, we identify classes of data distributions for which there exist consistent, polynomial-time algorithms. We also prove that ALL-RELEVANT is much harder than MINIMAL-OPTIMAL and propose two consistent, polynomial-time algorithms. We argue that the distribution classes considered are reasonable in many practical cases, so that our results simplify feature selection in a wide range of machine learning tasks.
| Year | Citations | |
|---|---|---|
Page 1
Page 1