In VPetukhov/dropEstAnalysis: Analysis to produce figures from dropEst paper

Content by themes

Dataset annotations:
Origin of background cells in Human/mouse datasets:
Filtration of low-quality cells
UMI corrections:
1. Correction effect on 10x BMMCs dataset
2. UMI trimming on 10x AML035 Post-transplant dataset
Cell barcode merge validation
1. Merge of human/mouse mixtures
2. Number of molecules per cell
Runtimes
1. inDrop BMCs
2. inDrop ESCs

Content by figures

Some figures were created with Python code and they are not published here. Please, write personally to request the code.

Main figures

Figure 1. Skewed distribution of UMIs leads to increased number of UMI collisions.
- ~~A. Skewness of UMI distribution~~
- ~~B. Uneven nucleotide frequencies~~
- C. Estimated number of collisions
Figure 2. Comparison of UMI collision and sequencing error correction methods.
Figure 3. Correcting for Cellular Barcode errors.
- A. Origin of background cells
- ~~B. Number of equidistant adjacent CBs~~
- C. Increase in #molecules per cell
Figure 4. Selection of the optimal size threshold for 10x BMMCs dataset.
Figure 5. Filtration of low-quality cells for the 10x 8k PBMCs dataset.
Figure 6. Filtration of low-quality cells for the inDrop mouse BMCs dataset.

Supplementary figures

~~S1. Skewness of UMI distributions.~~
S2. Simulation of UMI collision frequencies
~~S3. Probability of observing adjacent UMIs in small genes.~~
~~S4. Recognition of UMI errors by base calling quality.~~
S5. Impact of non-uniform distribution on UMI collisions
S6. UMI collisions on trimmed data
S7. Magnitude of UMI correction
S8. Comparison of UMI correction algorithms on trimmed data
S9. Initial labeling of high-quality cells based on cell size distributions
S10. Human and mouse cell mixture dataset by 10x
S11. Robustness of different classifiers to training errors
S12. Annotation of the 10x Frozen BMMCs dataset
S13. Annotation of the 10x 8k PBMCs dataset
S14. Annotation of the inDrop BMCs dataset
S15. Classification of low- and high-quality cells on 10x data
S16. Comparison of the initial label assignments with the cell quality score predicted by the algorithm
S17. Classification of low- and high-quality cells on inDrop mouse pancreatic cells data

Content by tables

Main tables

Table 1. Analysis of merge targets on human/mouse mixture datasets
- 10x
- Drop-seq
Table 2. 5-fold CV results (mean ± sd)

Supplementary tables

VPetukhov/dropEstAnalysis documentation built on Dec. 28, 2019, 8:16 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com