knitr::opts_chunk$set(echo = TRUE)

Instruction to generate a COSMIC dataset for filtering function of the TMBtool package

COSMIC datasets can no longer be download without a registered account. Moreover their consistent size makes it more suitable as an external source to be downloaded seperately. Here we provide the instructure to download and format the file for usage with the TMBleR package.

Cosmic data is required for one of the mutation filters that removes known "cancer" mutation. This is achieved by setting remove.cancer = TRUE when calling applyFilter(). If you are planning to use the remove.cancer filter, please follow the intructions to retrieve the COSMIC data and import it into TMBleR.

Download COSMIC database:

If genome GRCh37

gzcat CosmicCodingMuts.vcf.gz > CosmicCodingMuts_hg19.vcf

If genome GRCh38

gzcat CosmicCodingMuts.vcf.gz > CosmicCodingMuts_hg38.vcf

Add dataset to TMBleR:

formatCOSMIC( input_file = "~/Downloads/CosmicCodingMuts.vcf"
              , "hg19"
              , output_file = "~/Downloads/COSMIC_hg19")

load the file into the memory using the load() function

load("~/Downloads/COSMIC_hg19")

COSMIC_hg19 should now appear in the environment ls().



acc-bioinfo/TMBleR documentation built on Dec. 18, 2021, 10:21 p.m.