Cell_filtering | R Documentation |
gRNA targets Cas9 to a specific gene locus, but only in 70%-80% will generate true loss-of-function of the targeted gene(Sternberg and Doudna, 2015). Therefore, to estimate the ranking of impact of different perturbation, it is necessary to filter cells with invalid edits.
Cell_filtering(expression_profile, perturb_information, cpu_num = 4, cell_num_threshold = 30, umi = 0.01, pvalue = 0.05, vargene_min_num = 5, filtered_rate = 0.9, plot = FALSE, plot_path = "./invalid_rate.pdf")
expression_profile |
A dataframe showing the expression profile after performing the function of "Cell_qc()" and "Data_imputation()". |
perturb_information |
A character vector showing the information of sample after performing the function of "Cell_qc()" and "Data_imputation()". |
cpu_num |
The cpu number for parallel computation. The default is 4. Parallel computation is strongly recommeneded to use because this step may take long time without parallel computation. |
cell_num_threshold |
A cutoff, the minimal perturbed cell number for each perturbation. The default is 30. |
umi |
The cutoff of average umi to select the differentially expressed genes. The default is 0.01. |
pvalue |
The p value to select the differentially expressed genes. The default is 0.05. |
vargene_min_num |
The minimal number of differentially expressed genes. The default is 5. For a perturbation, if the number of differentially expressed genes are less than 5, this perturbation will be filtered directory. |
filtered_rate |
The default is 0.9. For a specific perturbation, if the influenced cells filtered amount to 90% or higher among all, then such a perturbation was filtered. |
plot |
FALSE by default. If TRUE, plot the graph to show the ratio of filtered cells for each perturbation. |
plot_path |
The path of the graph you plot. It works only when the parameter "plot" is TRUE. |
expression_profile |
The expression profile after performing these filtering steps. |
perturb_information |
The information (perturbation names and sample names) of cells retained after performing these filtering steps. |
perturb_information_abandon |
The information (perturbation names and sample names) abandoned after performing these filtering steps. |
filter_record |
The summary of filtering by these steps. |
zero_rate |
The proportion of zero expression value in all cells for each perturbation. |
Bin Duan
1. Sternberg, S.H. & Doudna, J.A. Expanding the Biologist's Toolkit with CRISPR-Cas9. Mol Cell 58, 568-574 (2015). 2. Lappalainen, T. et al. Transcriptome and genome sequencing uncovers functional variation in humans. Nature 501, 506-511 (2013).
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.