robdist | R Documentation |
Scrubbing with robust distance.
robdist(
X,
RD_cutoff = 4,
RD_quantile = 0.99,
trans = c("none", "robust-YJ", "SHASH"),
bootstrap_n = 1000,
bootstrap_alpha = 0.01,
projection = c("ICA", "PCA"),
nuisance = "DCT4",
center = TRUE,
scale = TRUE,
comps_mean_dt = FALSE,
comps_var_dt = FALSE,
PESEL = TRUE,
kurt_quantile = 0.99,
get_dirs = FALSE,
full_PCA = FALSE,
get_outliers = TRUE,
cutoff = 4,
seed = 0,
ICA_method = c("C", "R"),
skip_dimred = FALSE,
verbose = FALSE
)
X |
Wide numeric data matrix ( |
RD_cutoff |
Default: |
RD_quantile |
Quantile cutoff...? |
trans |
Apply a transformation prior to univariate outlier detection?
Three options: |
bootstrap_n |
Use bootstrapping to estimate the robust distance null
distribution? If so, set this to the number of bootstraps. Default:
|
bootstrap_alpha |
If using bootstrap ( |
projection |
One of the following: |
nuisance |
Nuisance signals to regress from each column of Detrending is highly recommended for time-series data, especially if there are many time points or evolving circumstances affecting the data. Additionally, if kurtosis is being used to select the projection directions, trends can induce positive or negative kurtosis, contaminating the connection between high kurtosis and outlier presence. Detrending should not be used with non-time-series data because the observations are not temporally related. Additional nuisance regressors can be specified like so:
|
center, scale |
Center the columns of the data by their medians, and scale the
columns of the data by their median absolute deviations (MADs)? Default: Note that centering and scaling occur after nuisance regression, so even if
|
comps_mean_dt, comps_var_dt |
Stabilize the mean and variance of each
projection component's timecourse prior to computing kurtosis and leverage?
These arguments should be Slow-moving mean and variance patterns in the components will interfere with
the roles of kurtosis and leverage in identifying outliers. While
Overall, for fMRI we recommend enabling |
PESEL |
Use |
kurt_quantile |
What quantile cutoff should be used to select the
components? Default: We model each component as a length |
get_dirs |
Should the projection directions be returned? This is the
|
full_PCA |
Only applies to the PCA projection. Return the full SVD?
Default: |
get_outliers |
Should outliers be flagged based on |
cutoff |
Median leverage cutoff value. Default: |
seed |
Set a seed right before the call to |
ICA_method |
The |
skip_dimred |
Skip dimension reduction? Default: |
verbose |
Should occasional updates be printed? Default: |
A "robdist"
object, i.e. a list with components
...
...
...
library(fastICA)
rdx = robdist(Dat1[seq(70),seq(800,950)])
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.