Publication | Closed Access
Breaking up is Hard to Do: The Heartbreak of Dichotomizing Continuous Data
419
Citations
7
References
2002
Year
EngineeringContinuous VariableGeneralizability TheorySocial CategorizationPsychologyData ScienceData MiningBiasManagementData IntegrationCognitive Bias MitigationContent AnalysisDecision TheoryData ManagementStatisticsDichotomizing Continuous DataSelection BiasKnowledge DiscoveryExperimental PsychologyDataset CreationOutcome MeasureData SetQuantitative Social Science ResearchType Ii ErrorData HeterogeneityData Modeling
Researchers often take variables that are measured on a continuum and then break them into categories (for example, above or below some cut-point), either to place subjects into groups or as an outcome measure. In this article, we show that the rationales given for this practice are weak and that categorization results in lost information, reduced power of statistical tests, and increased probability of a Type II error. Dichotomizing a continuous variable is justified only when the distribution of that variable is highly skewed or its relation with another variable is nonlinear.
| Year | Citations | |
|---|---|---|
Page 1
Page 1