Publication | Closed Access
A New Approach for Detecting Multivariate Outliers
14
Citations
14
References
2005
Year
Anomaly DetectionEngineeringGeneticsMax-eigen DifferenceGenomicsSequence AlignmentGene RecognitionData ScienceData MiningMolecular EcologyPattern RecognitionComputational GenomicsBiostatisticsNew ProcedureRobust DistanceStatisticsSequence AnalysisOutlier DetectionKnowledge DiscoveryStatistical GeneticsMultivariate OutliersBioinformaticsFunctional GenomicsComputational BiologyMedicineMultivariate Analysis
ABSTRACT This article proposes a new procedure named Max-Eigen difference (MED) for identifying outliers in multivariate data sets. Theoretical aspects of the procedure are briefly discussed. The proposed procedure is compared with the Mahalanobis distance (MD) and robust distance (RD) via two examples. It is indicated that the MED works better than MD and is comparable with RD. Finally, this procedure is applied during constructing a quadratic discriminant analysis which is used to splicing sites prediction for DNA sequences. Through the results of rice and human genome data sets, it can be seen that the robustified discriminant provides higher prediction accuracy than the usual discrimination method. Keywords: DonorMahalanobis distanceMEDOutlierRobust distanceMathematics Subject Classification: 62H0762H99 Acknowledgments We thank an anonymous referee for useful comments on an earlier version of the article. This work is partly supported by NSFC, grant 10371126.
| Year | Citations | |
|---|---|---|
Page 1
Page 1