Description Usage Arguments Value Author(s) Examples
Search for the optimal sample order by using the Extended Nearest Insertion
1 |
Data |
gene-by-sample matrix or isoform-by-sample matrix.It should be rescaled to values bwteen [-1,1]. |
Seq |
NULL or a vector indicates the sample order. if specified, the samples will be first reordered by this vector. |
NChun |
number of starting points for polynomial fitting. |
RdmStart |
whether the start points are randomly selected. |
Ndg |
degree of polynomial. |
This function performs the extended nearest insertion (ENI). The ENI algorithm searchs for the optimal sample order which minimizes the MSE of sliding polynomial regression (SPR). This function will call PipeShiftCDF() function, which fits SPR to each row of the data. For each gene/isoform, SPR fits NChun polynomial curves with different starting points (samples). The samples with smaller order than the start point will be appended to follow the last sample when fitting. So each fitting consider same number of samples. If RdmStart = TRUE, the start points are randomly selected. Otherwise they are evenly sampled along the sample order. The aggregated MSE of a fit (using a specific start point) is defined as the summation of the MSEs of all genes/isoforms considered here. The MSE of the SPR is defined as the largest aggregated MSE across fits using different start points. The output returns the optimal order which provides the smallest SPR MSE.
Ning Leng
1 2 3 4 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.