The human genome reference used here is RefSeq transcripts in version hg19 from UCSC Genome Browser. The transcripts with NM marker ID, which are protein-codeing, were selected to be our reference database and provided as hg19DBNM.rda.
A data frame with 39997 rows and 7 variables:
RefSeq name with its corrsponding gene symbol
1-22, X and Y
starting position, in basepair number
ending position, in basepair number
positive or negative strand, in + or - symbols
This reference provides region information, including chromosome number, starting position, ending position, strand and gene symbols, for converting copy number alteration data into human genes.
UCSC Genome Browser: http://hgdownload.cse.ucsc.edu/downloads.html