Milestone1.md

Section 1: Update

Now it is able to: in the "scripts" folder, there is a script to automatically download data and unzip in the "R" folder, final_project.R now can do simple data processing and comparing and output chart. I compare the portions of smokers and non-smokers among patients of pancreatic cancer and output a chart.

Section 2: Next Steps Once figured out how to use HTSeq-count files, I am going to compare genes of smoker and non-smoker.

Section 3: Data.

Data can be retrieved at here. These are clinicial and htseq-count files downloaded at GDC

Section 4: Known Issues.

Right now don't know how to use HTSeq-count files. I thought the file name is the case id number used by clinical and exposure data as I was planing comparing genes of smoker and non-smoker, but turns out these are totally unrelated.



XiaoyunZhouusc/final_project documentation built on Dec. 18, 2021, 7:23 p.m.