| tcga | R Documentation |
A subset of the publicly available NIH's the Cancer Genome Atlas (TCGA)
whole exome sequencing data containing non-synonymous SNV mutations.
More specifically,
the subset of the TCGA data with
Variant_Type == "SNP" and tumors from the following 10 cancer sites:
BREAST, COLORECTAL, ESOPHAGEAL, KIDNEY, LIVER,
LUNG, OVARIAN, PANCREATIC, PROSTATE, and SKIN
tcga
A dataset with 1,395,665 rows and 14 columns:
the patient (tumor) label. Obtained by extracting the first 16 characters of the column'Tumor_Sample_Barcode' in the original TCGA data.
the gene label
Chromosome label
the variant label. Obtained by concatenating the columns labeled 'Hugo_Symbol', "HGVSp_Short', 'Chromosome', 'Start_Position', 'Tumor_Seq_Allele1', and 'Tumor_Seq_Allele2' in the original TCGA data
Start Position of alteration on the chromosome
End Position of alteration on the chromosome
.
.
.
Protein code of alteration
Type of alteration
.
Histological subtype for each tumor
Cancer site of origin of each tumor
https://github.com/cBioPortal/datahub
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.