Publication | Closed Access
DBSCAN: Past, present and future
569
Citations
29
References
2014
Year
Unknown Venue
EngineeringSpatial Data MiningSpatial ClusteringUnsupervised Machine LearningOptimization-based Data MiningDatabase SystemData ScienceData MiningPattern RecognitionDatabase ProcessingData IntegrationData ManagementDocument ClusteringData Analysis TechniquesGeographyKnowledge DiscoveryComputer ScienceDatabase TechnologyFuzzy Clustering
Data mining uses clustering to extract patterns from large spatial datasets; DBSCAN is a pioneering density‑based algorithm that can find arbitrarily shaped clusters amid noise but suffers from parameter sensitivity, density variation issues, and computational cost, prompting variants such as VDBSCAN, FDBSCAN, DD_DBSCAN, and IDBSCAN. The study surveys all existing DBSCAN variants that have been proposed. The authors critically evaluate each DBSCAN variant and list their limitations. The survey reveals that each variant has specific shortcomings, highlighting the need for further improvements.
Data Mining is all about data analysis techniques. It is useful for extracting hidden and interesting patterns from large datasets. Clustering techniques are important when it comes to extracting knowledge from large amount of spatial data collected from various applications including GIS, satellite images, X-ray crystallography, remote sensing and environmental assessment and planning etc. To extract useful pattern from these complex data sources several popular spatial data clustering techniques have been proposed. DBSCAN (Density Based Spatial Clustering of Applications with Noise) is a pioneer density based algorithm. It can discover clusters of any arbitrary shape and size in databases containing even noise and outliers. DBSCAN however are known to have a number of problems such as: (a) it requires user's input to specify parameter values for executing the algorithm; (b) it is prone to dilemma in deciding meaningful clusters from datasets with varying densities; (c) and it incurs certain computational complexity. Many researchers attempted to enhance the basic DBSCAN algorithm, in order to overcome these drawbacks, such as VDBSCAN, FDBSCAN, DD_DBSCAN, and IDBSCAN. In this study, we survey over different variations of DBSCAN algorithms that were proposed so far. These variations are critically evaluated and their limitations are also listed.
| Year | Citations | |
|---|---|---|
Page 1
Page 1