Publication | Closed Access
A bootstrap testing procedure for investigating the number of subpopulations
22
Citations
16
References
1985
Year
EngineeringPopulation DynamicPopulation EcologyCombinatorial Data AnalysisData ScienceData MiningBiostatisticsPublic HealthStatisticsClustering (Nuclear Physics)Sampling (Statistics)Population StudySubpopulations CorrespondBootstrap ResamplingKth Nearest NeighborTest StatisticsPopulation DevelopmentStatistical InferenceDemographyClustering (Data Mining)
Determining the number of subpopulations from sample data is a major problem in cluster analysis. We assume in this study that the subpopulations correspond to modes of the population density function f. We then propose using test statistics based on the kth nearest neighbor clustering method to investigate the modality of f. A modified bootstrap procedure for estimating the sample significance levels of these statistics in the univariate case is described. The performance of this procedure in determining the number of subpopulations will be illustrated by generated and real data sets.
| Year | Citations | |
|---|---|---|
Page 1
Page 1