Publication | Open Access
A high-dimension two-sample test for the mean using cluster subspaces
20
Citations
15
References
2015
Year
Cluster SubspacesQuantitative GeneticsNew Test StatisticModified TestEngineeringHigh-dimensional MethodMultidimensional AnalysisStatistical GeneticsBiostatisticsStatistical InferenceDimensionality ReductionPublic HealthPopulation GeneticsStatisticsRandom Subspaces
A common problem in modern genetic research is that of comparing the mean vectors of two populations–typically in settings in which the data dimension is larger than the sample size–where Hotelling's test cannot be applied. Recently, a test using random subspaces was proposed, in which the data are randomly projected into several lower-dimensional subspaces, and Hotelling's test is well defined. Superior performance with competing tests was demonstrated when the variables were correlated. Following the research of random subspaces, a modified test was proposed that might make more efficient use of covariance structure at high dimension. Hierarchical clustering is performed first such that highly correlated variables are clustered together. Next, Hotelling's statistics are computed for every cluster-subspace and summed as the new test statistic. High performance was demonstrated via simulations and real data analysis.
| Year | Citations | |
|---|---|---|
Page 1
Page 1