Concepedia

Publication | Closed Access

DBSCAN: Past, present and future

569

Citations

29

References

2014

Year

Unknown Author(s)

Unknown Venue

TLDR

Data mining uses clustering to extract patterns from large spatial datasets; DBSCAN is a pioneering density‑based algorithm that can find arbitrarily shaped clusters amid noise but suffers from parameter sensitivity, density variation issues, and computational cost, prompting variants such as VDBSCAN, FDBSCAN, DD_DBSCAN, and IDBSCAN. The study surveys all existing DBSCAN variants that have been proposed. The authors critically evaluate each DBSCAN variant and list their limitations. The survey reveals that each variant has specific shortcomings, highlighting the need for further improvements.

Abstract

Data Mining is all about data analysis techniques. It is useful for extracting hidden and interesting patterns from large datasets. Clustering techniques are important when it comes to extracting knowledge from large amount of spatial data collected from various applications including GIS, satellite images, X-ray crystallography, remote sensing and environmental assessment and planning etc. To extract useful pattern from these complex data sources several popular spatial data clustering techniques have been proposed. DBSCAN (Density Based Spatial Clustering of Applications with Noise) is a pioneer density based algorithm. It can discover clusters of any arbitrary shape and size in databases containing even noise and outliers. DBSCAN however are known to have a number of problems such as: (a) it requires user's input to specify parameter values for executing the algorithm; (b) it is prone to dilemma in deciding meaningful clusters from datasets with varying densities; (c) and it incurs certain computational complexity. Many researchers attempted to enhance the basic DBSCAN algorithm, in order to overcome these drawbacks, such as VDBSCAN, FDBSCAN, DD_DBSCAN, and IDBSCAN. In this study, we survey over different variations of DBSCAN algorithms that were proposed so far. These variations are critically evaluated and their limitations are also listed.

References

YearCitations

Page 1