The standardize function returns a list of class
standardized, which has a print method,
and which can additionally be used to place new data into the same
standardized space as the data passed in the call to standardize
using the predict function.
The standardized list contains the following elements.
The call to standardize which created the
object.
The scale argument to standardize.
The regression formula in standardized space (with new
names) which can be used along with the data element to fit
regressions. It has an attribute standardized.scale which is the
same as the scale element of the object (this allows users and
package developers to write regression-fitting functions which can tell
if the input is from a standardized object).
The regression family.
A data frame containing the regression variables in a
standardized space (renamed to have valid variable names corresponding
to those in the formula element).
The offset passed through the offset argument to
standardize (scaled if family = gaussian), or
NULL if the offset argument was not used.
A list containing unevaluated calls which allow the
predict method to work.
A data frame with the name of the original variable, the corresponding name in the standardized data frame and formula, and the class of the variable in the standardized data frame.
A named list of contrasts for all factors included as
predictors, or NULL if no predictors are factors.
A named list of levels for random effects grouping factors,
or NULL if there are no random effects.
In the variables data frame, the Variable column contains the
name of the variable in the original formula passed to standardize.
The Standardized Name column contains the name of the variable in the standardized
formula and data frame. The original variable name is altered such that the
original name is still recoverable but is also a valid variable name for
regressions run using the formula and data elements of the
standardized object. For example, exp(x) would become
exp_x and log(x + 1) would become log_x.p.1. If
the indicator function is used, this can lead to a long and possibly
difficult to interpret name; e.g. I(x1 > 0 & x2 < 0) would become
I_x1.g.0.a.x2.l.0. In such cases, it is better to create the variable
explicitly in the data frame and give it a meaningful name; in this case,
something like mydata$x1Pos_x2Neg <- mydata$x1 > 0 & mydata$x2 < 0,
and then use x1Pos_x2Neg in the call to standardize.
The Class column in the variables data frame takes the
following values (except for non-gaussian responses, which are left
unaltered, and so may have a different class; the class for the response is
always preceded by response.).
A numeric vector.
A numeric matrix resulting from a call to
poly.
A numeric vector resulting from a call to
scale_by.
A numeric matrix resulting from a call to
poly nested within a call to
scale_by.
An unordered factor.
An ordered factor.
A random effects grouping factor.
If the offset function was used within the formula passed to
standardize, then the variable is numeric and labeled as
offset. The formula element of the standardize
object contains offset calls to ensure regression fitting functions use
them properly. If the offset argument was used in the call to
standardize (rather than putting offset calls in the
formula), then the offset is not in the variables data frame (it
is in the offset element of the standardized object).
The standardized object has a printing method which displays the call,
formula, and variable frame along with an explanation of the
standardization. The is.standardized function returns
TRUE if an object is the result of a call to standardize
and FALSE otherwise. The predict
method places new data into the same standardized space as the data
passed to the original standardize call.
Christopher D. Eager <eager.stats@gmail.com>
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.