Publication | Closed Access
The Impact of Model Selection on Inference in Linear Regression
276
Citations
11
References
1990
Year
Conditional Coverage RatesEngineeringHigh-dimensional MethodData ScienceSelection BiasPredictive AnalyticsManagementPredictive ModelingEconometricsBiostatisticsStatistical InferenceRegression AnalysisData SplittingModel ComparisonStatisticsAbstract Model SelectionRegression Testing
Abstract Model selection and inference are usually treated as separate stages of regression analysis, even though both tasks are performed on the same set of data. Once a model has been selected, one typically proceeds as though one has a fresh data set generated by the selected model. Here, we present Monte Carlo results on the coverage rates of confidence regions for the regression parameters, conditional on the selected model order. The conditional coverage rates are much smaller than the nominal coverage rates, obtained by assuming that the model was known in advance. Furthermore, the overall coverage rate is much smaller than the nominal value. A possible remedy based on data splitting is suggested.
| Year | Citations | |
|---|---|---|
Page 1
Page 1