Publication | Closed Access
Efficiently Mining Gene Expression Data via a Novel Parameterless Clustering Method
125
Citations
23
References
2005
Year
Clustering ProcessMachine LearningEngineeringGene RecognitionGene Expression ProfilingUnsupervised Machine LearningOptimization-based Data MiningCorrelation Search TechniqueData ScienceData MiningPattern RecognitionBiostatisticsPublic HealthMicroarray Data AnalysisGene Expression DataKnowledge DiscoveryFunctional GenomicsBioinformaticsFunctional Data AnalysisComputational BiologySystems BiologyFuzzy Clustering
Clustering analysis has been an important research topic in the machine learning field due to the wide applications. In recent years, it has even become a valuable and useful tool for in-silico analysis of microarray or gene expression data. Although a number of clustering methods have been proposed, they are confronted with difficulties in meeting the requirements of automation, high quality, and high efficiency at the same time. In this paper, we propose a novel, parameterless and efficient clustering algorithm, namely, Correlation Search Technique (CST), which fits for analysis of gene expression data. The unique feature of CST is it incorporates the validation techniques into the clustering process so that high quality clustering results can be produced on the fly. Through experimental evaluation, CST is shown to outperform other clustering methods greatly in terms of clustering quality, efficiency, and automation on both of synthetic and real data sets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1