dblm: Adjusting distance-based linear regression model
In geosptdb: Spatio-Temporal Radial Basis Functions with Distance-Based Methods (Optimization, Prediction and Cross Validation)

dblm	R Documentation

Adjusting distance-based linear regression model

Description

dblm is a linear model variety where explanatory information is coded as distances among individuals so these distances can also be computed from observed explanatory variables (a mix of continuous, qualitative explanatory variables or from more general quantities). The response is a continuous variable as in the classic linear model.

lm is used internally to adjust a distance-based linear regression model. The method considers the Gower's distance for mixed covariates (numeric, ordered, or factor), for explanation on the meaning of distance-based linear regression model and distance of Gower see the bibliography references below.

Usage

dblm(data,y,sc,ev.min, ...)

Arguments

`data`	matrix or data frame containing the explanatory variables. These variables can be numeric, ordered, or factor. Symmetric or asymmetric binary variables should be numeric and only contain 0 and 1. character variables will be converted to factor. NAs are tolerated. With these variables are built, the principal coordinates which later become the regressors in the linear model.
`y`	the response variable used to fit the model
`sc`	the value of the correlation squared to select the principal coordinates more related to the response variable. The default value is 0.003.
`ev.min`	the minimum value to select the eigenvalues. These eigenvalues must be positive, the default value is 0.007
`...`	further parameters to be passed to the `gowdis` function (see `gowdis`) of low level.

Details

The dblm model builds; principal coordinates matrix, eigenvalues, and a linear regression model. gowdis function used in dblm compute the Gower (1971) similarity coefficient exactly as described by Podani (1999), then converts it to a dissimilarity coefficient by using D = 1 - S. It integrates variable weights as described by Legendre and Legendre (1998).

Value

A list containing the following components:

`table`	table with eigenvalues, correlations squared, and percentages of inertia associated with the most statistically significant principal coordinates (5%) with the response variable.
`ev`	the `n` eigenvalues computed during the scaling process (see `cmdscale`).
`cp`	the `k` most statistically significant principal coordinates (5%) with the response variable.
`dbmodel`	returns a list of summary statistics of the fitted linear model.

References

Cuadras, CM., Arenas C. and Fortiana, J. (1996). Some computational aspects of a distance-based model for prediction. Communications in Statistics B - Simulation and Computation 25, 593-609.

Cuadras, CM. and Arenas, C. (1990).A distance-based regression model for prediction with mixed data. Communications in Statistics A - Theory and Methods 19, 2261-2279

Gower, J. C. (1971). A general coefficient of similarity and some of its properties. Biometrics 27:857-871.

Legendre, P. and Legendre, L. (1998). Numerical Ecology. 2nd English edition. Amsterdam: Elsevier.

Melo, C. E. (2012). Analisis geoestadistico espacio tiempo basado en distancias y splines con aplicaciones. PhD. Thesis. Universitat de Barcelona. 276 p. [link]

Podani, J. (1999). Extending Gower's general coefficient of similarity to ordinal characters. Taxon 48:331-340.

Examples

# considering 10 principal coordinates (constructed from a distance-based linear 
# regression model)
## Not run: 
data(croatia.temp)
data(croatiadb)
croatia.temp[,7] <- as.factor(croatia.temp[,7])
dblm1 <- dblm(data=croatia.temp,y=croatiadb$MTEMP) 
str(dblm1)

## End(Not run)

geosptdb documentation built on June 22, 2025, 1:06 a.m.