Publication | Closed Access
Training Genetic Programming on Half a Million Patterns: An Example From Anomaly Detection
159
Citations
23
References
2005
Year
Artificial IntelligenceEngineeringMachine LearningData StructureTraining Genetic ProgrammingData ScienceData MiningPattern RecognitionGenetic AlgorithmMillion PatternsEvolution-based MethodIntrusion Detection SystemKnowledge DiscoveryComputer EngineeringGenetic Improvement ProgrammingComputer ScienceHierarchical Rss-dss AlgorithmBioinformaticsEvolutionary ProgrammingEvolutionary Data MiningComputational BiologyLearning Classifier SystemBig Data
The hierarchical RSS-DSS algorithm is introduced for dynamically filtering large datasets based on the concepts of training pattern age and difficulty, while utilizing a data structure to facilitate the efficient use of memory hierarchies. Such a scheme provides the basis for training genetic programming (GP) on a data set of half a million patterns in 15 min. The method is generic, thus, not specific to a particular GP structure, computing platform, or application context. The method is demonstrated on the real-world KDD-99 intrusion detection data set, resulting in solutions competitive with those identified in the original KDD-99 competition, while only using a fraction of the original features. Parameters of the RSS-DSS algorithm are demonstrated to be effective over a wide range of values. An analysis of different cost functions indicates that hierarchical fitness functions provide the most effective solutions.
| Year | Citations | |
|---|---|---|
Page 1
Page 1