The Linear
Predictor
In the regression
models that we consider, the regression formula is a linear
combination of regression coefficients and predictors. The values
that result from the regression formula may also be referred
to as the "linear predictor" or "prognostic index."
The linear
predictor (LP) summarizes the prognosis for every patient. The
LP can be calculated for the patients in the data set under
study as: LP = a + b1x1
+ b2x2
+ … + bixi,
with a denoting the intercept, and b1
to bi regression coefficients
for i covariables x1
to xi. In the data set under
study, the LP by definition has a perfect fit when we include
the LP in a regression model:
Y
~ a + b LP,
Where the
slope b is unity and the intercept a is zero.
The LP can
also be calculated for patients not included in the modeling
process:
LPmodel
= amodel + b1,
model x1,
new + b2,
model x2,
new + … + bI,
model xI,
new, with a denoting the intercept
from the model, b1,
model to bi,
model the regression coefficients
from the model, and xi, new
to xi, new
the covariables for the new patients. We can include this LP
in a regression model:
Ynew
~ anew + bnew
LPmodel,
Where, if
the model provides perfectly valid predictions, the slope bnew
is unity and the intercept anew
zero.
In practice,
however, the slope of the LP is often less than unity in new
patients (Spiegelhalter,
1986) (Van
Houwelingen and Le Cessie, 1990) (Harrell
et al., 1996): high predictions are too high and
low predictions too low. Hence, the linear predictor is an important
concept to assess model performance (see: Development of Regression
Models, Evaluation
of Performance) and, more specifically, calibration
of predictions (see: Development of Regression Models, Evaluation
of Performance).