Concepedia

Publication | Closed Access

Scaling regression inputs by dividing by two standard deviations

2.4K

Citations

14

References

2007

Year

TLDR

Regression coefficient interpretation depends on the scale of the inputs. The study proposes scaling numeric variables by two standard deviations to enable direct comparison of coefficients. The authors implemented an R function that rescales numeric predictors by two standard deviations and applied it to a linear regression on election data and a multilevel logistic regression on NYC rodent prevalence. Rescaling by two standard deviations makes coefficients directly comparable, especially for binary predictors, and is recommended as a default practice. © 2007 John Wiley & Sons, Ltd.

Abstract

Abstract Interpretation of regression coefficients is sensitive to the scale of the inputs. One method often used to place input variables on a common scale is to divide each numeric variable by its standard deviation. Here we propose dividing each numeric variable by two times its standard deviation, so that the generic comparison is with inputs equal to the mean ±1 standard deviation. The resulting coefficients are then directly comparable for untransformed binary predictors. We have implemented the procedure as a function in R. We illustrate the method with two simple analyses that are typical of applied modeling: a linear regression of data from the National Election Study and a multilevel logistic regression of data on the prevalence of rodents in New York City apartments. We recommend our rescaling as a default option—an improvement upon the usual approach of including variables in whatever way they are coded in the data file—so that the magnitudes of coefficients can be directly compared as a matter of routine statistical practice. Copyright © 2007 John Wiley & Sons, Ltd.

References

YearCitations

Page 1