Description Usage Format Details Source
A dataset containing alternative and reference allele read counts per cell and heterozygous variant, derived from a single-cell RNA-seq dataset of the lymphoblastoid cell-line of HapMap individual NA12878.
1 |
An acset with four elements, featdata, refcount, altcount and
phenodata, see new_acset
for a description of these elements.
The acset contains data for 2809 variants and 28 single cells.
Allele counts were generated by alignment of the RNA-seq data to the haplo-
genomes of NA12878 and subsequently running samtools mpileup
using variants called as heterozygous within the DNA-seq data of the
individual. Variants were filtered on being within RefSeq genes, in the dbSNP
database and successfully phased (via transmission from the parental genome
data). Variants were further filtered using the allele count data to not
monoallelically express the same allele across cells (see
filter_homovars
) and on having imbalanced allelic expression in
at least 3 cells (see filter_var_gt
. Features were filtered on
having at least two such variants (see filter_feat_nminvar
).
For further details on the generation of the allele count data see the
Supplemental Data in Edsgard et al, scphaser: Haplotype Inference Using
Single-Cell RNA-Seq Data, Bioinformatics, 2016.
RNA-seq fastq files ftp://ftp-trace.ncbi.nlm.nih.gov/sra/sra-instant/reads/ByStudy/sra/SRP/SRP018/SRP018838/
RNA-seq meta-infoftp://ftp.ncbi.nlm.nih.gov/geo/series/GSE44nnn/GSE44618
Haplo-genomes http://sv.gersteinlab.org/NA12878_diploid
DNA-seq variants http://sv.gersteinlab.org/NA12878_diploid/NA12878_diploid_2012_dec16/CEUTrio.HiSeq.WGS.b37.bestPractices.phased.hg19.vcf.gz
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.