After this variable reduction
step, a more limited number of covariables will remain for predictive
modeling. A frequently applied method to achieve variable reduction
is stepwise selection of covariables.
Stepwise
selection is usually applied in a forward or backward way.
If stepwise
selection is applied, it is generally agreed upon that backward
selection is preferable to forward selection (Harrell
et al., 1996). The stopping rule for inclusion or
exclusion usually applies the standard significance
level for testing of hypotheses (alpha=0.05). It has,
however, been demonstrated that alpha=0.05 is too small in relatively
small data sets. The power
is too low to identify important predictors as statistically
significant. Therefore, if stepwise selection is used in a small
data set, it should be used in a backward manner with a high
p-value (e.g. 0.20 or 0.50) (Steyerberg
et al., 2000a).
QUESTION
7.3
Stepwise
selection is a method aimed at:
The one
advantage of stepwise selection is that a small, readily
interpretable model arises, which contains the most important
predictors in a prediction problem.
The numerous
disadvantages of stepwise selection are, however, known
from separate studies (Chatfield,
1995) (Harrell
et al., 1996) (Steyerberg
et al., 1999). Disadvantages include: