Description Usage Arguments Value Author(s) References
Performs the second step of the ISOpure purification algorithm, taking tumor data and normal profiles and returning the a list, ISOpureS2model, with all the updated parameters.
1  ISOpure.step2.PPE(tumordata, BB, ISOpureS1model, MIN_KAPPA, logging.level)

tumordata 
(same as for ISOpureS1) a GxD matrix representing gene expression profiles of heterogeneous (mixed) tumor samples, where G is the number of genes, D is the number of tumor samples. 
BB 
(same as for ISOpureS1) represents B = [b_1 ... b_(K1)] matrix (from Genome Medicine paper) a Gx(K1) matrix, where (K1) is the number of normal profiles (β_1,...,β_(K1)), G is the number of genes. These are the normal profiles representing normal cells that contaminate the tumor samples (i.e. normal samples from the same tissue location as the tumor). The minimum element of BB must be greater than 0 – i.e. every gene/transcript must be observed on some level in each normal sample. 
ISOpureS1model 
output model list from ISOpureS1 code 
MIN_KAPPA 
(optional) The minimum value allowed for the strength parameter kappa placed over the reference cancer profile m (see Quon et al, 2013). By default, this is set to 1/min(BB), such that the log likelihood of the model is always finite. However, when the min(BB) is very small, this forces MIN_KAPPA to be very large, and can sometimes cause the reference profile m to look too much like a 'normal profile' (and therefore you may observe the tumor samples having low % cancer content estimates). If this is the case, you can try setting MIN_KAPPA=1, or some other small value. For reference, for the data presented in Quon et al., 2013, MIN_KAPPA is on the order of 10^5. 
logging.level 
(optional) A string that gives the logging threshold for futile.logger. The possible options are 'TRACE', 'DEBUG', 'INFO', 'WARN', 'ERROR', 'FATAL'. Currently the messages in ISOpureR are only in the categories 'INFO', 'WARN', and 'FATAL', and the default setting is 'INFO'. Setting a setting for the entire package will override the setting for a particular function. 
ISOpureS2model, a list with the following important fields:
theta 
a DxK matrix, giving the fractional composition of each tumor sample. Each row represents a tumor sample that was part of the input, and the first K1 columns correspond to the fractional composition with respect to the Source Panel contaminants. The last column represents the fractional composition of the pure cancer cells. In other words, each row sums to 1, and element (i,j) of the matrix denotes the fraction of tumor i attributable to component j (where the last column refers to cancer cells, and the first K1 columns refer to different 'normal cell' components). The 'cancer', or tumor purity, estimate of each tumor is simply the last column of theta. 
alphapurities 
(same as ISOpureS1) tumor purities (alpha_i in paper), same as the last column of the theta variable, pulled out for user convenience  not changed in step 2 
cc_cancerprofiles 
purified cancer profiles. This matrix is of the same dimensionality as tumordata, and is also on the same scale (i.e. although ISOpureS2 treats purified cancer profiles as parameters of a multinomial distribution, we rescale them to be on the same scale as the input tumor profiles – see Genome Medicine paper). Column i of cc_cancerprofiles corresponds to column i of tumordata. 
total_loglikelihood 
log likelihood of the model 
omega 
(internal parameter, same as ISOpureS1) prior over the reference cancer profile  not changed in step 2 
vv 
(internal parameter) hyperparameters from Dirichlet distribution, representing both mean and strength of a Dirichlet distribution over theta 
kappa 
(internal parameter) the strength parameter over the Dirichlet distribution over cc, given the reference cancer parameter, mm 
mm_weights, theta_weights, omega_weights 
(internal parameters) used in the optimization of mm, theta, and omega (instead of performing constrained optimization on these positively constrained variables directly, we optimize their logs in an unconstrained fashion.) 
log_BBtranspose, PPtranspose, log_all_rates: 
(internal parameters) used in the calculations of loglikelihood 
MIN_KAPPA 
(internal parameter) as described in the Arguments section 
Gerald Quon, Catalina Anghel, Francis Nguyen
G Quon, S Haider, AG Deshwar, A Cui, PC Boutros, QD Morris. Computational purification of individual tumor gene expression profiles. Genome Medicine (2013) 5:29, http://genomemedicine.com/content/5/3/29.
G Quon, QD Morris. ISOLATE: a computational strategy for identifying the primary origin of cancers using highthoroughput sequencing. Bioinformatics 2009, 25:28822889 http://bioinformatics.oxfordjournals.org/content/25/21/2882.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.