Description Usage Arguments Details Value Examples

Conduct PCA on variables with biggest variance in high dimensional data matrix

1 | ```
pcv(x, cols=5, sites=5000)
``` |

`x` |
name of data matrix |

`cols` |
number of principal components to extract |

`sites` |
number of attributes to consider |

pcv assumes data in a numeric matrix and variable major format, i.e. every line corresponds to to a variable, while the columns correspond to the individual observations. This is commonly the case for data in high throughput experiments where the number of data points per individuals is high (> 10,000), while the size of batches is comparably small (dozens to hundreds). Variables with missing values are disregarded for the selection.

Use t() to transpose individual major data sets beforehand.

pcv selects the attributes with the highest variance up to the numbers provided, but takes considerations to limit these to the actual size of the present data set.

This is often used as first step in high throughput measurements to detect global effects of known batch variables.

matrix with rows corresponding to observations and columns to extracted components. Values denote the scores on the extracted components for the respective observations.

1 2 |

Embedding an R snippet on your website

Add the following code to your website.

For more information on customizing the embed code, read Embedding Snippets.