QC.GUI: Generate interactive plot for Quality Control of a dataset

Description Usage Arguments Value Note Author(s) Examples

Description

A Shiny, Plotly and Web Brower based analysis interface. QC.GUI() is aimed to provide mdsplot, densityPlot, Type-I&Type-II densityplot, dendrogram(no interactable) and heatmap for top 1000 variale CpGs. In the first tab,mdsplot are plotted based on the distance calculated by top 1000 variable CpGs. For dendrogram, if there are only less than 10 samples, the distance between samples are calculated by all CpGs, if there are more than 10 samples, QC.GUI() would apply SVD doconvolution on the dataset first then extract top significant components as latent variabls and calculate distance between samples. For the heatmap, if your dataset contains less than 1000 CpGs, all CpGs would be plotted, but if your dataset contains more than 1000 CpGs, the top 1000 variable CpGs would be selected and plot.

Usage

1
2
3
    QC.GUI(beta=myLoad$beta,
           pheno=myLoad$pd$Sample_Group,
           arraytype="450K")

Arguments

beta

A matrix of values representing the methylation scores for each sample (M or B). Better to be imputed and normalized data. (default = myNorm)

pheno

This is a categorical vector representing phenotype of factor wish to be analysed, for example "Cancer", "Normal"... Tow or even more phenotypes are allowed. (default = myLoad$pd$Sample_Group)

arraytype

Choose microarray type is 450K or EPIC. (default = "450K")

Value

Totally five tabs would be generated on opened webpage.

mdsplot

A mdsplot used to see the clustering result and similarity between sampels.

TypeDensity

A two-line density Plot indicate Type-I CpGs and Type-II CpGs.

QCplot

Beta distribution of each sample. You may use it to check samples with low qualities.

Dendrogram

Dendrogram of all samples. If there are only less than 10 samples, the distance between samples are calculated by all CpGs, if there are more than 10 samples, QC.GUI() would apply SVD doconvolution on the dataset first then extract top significant components as latent variabls and calculate distance between samples.

heatmap

Heatmap for top 1000 variale CpGs.

Note

Please make sure you are running R locally or connected with local graph software(X11) remotely.

Author(s)

Yuan Tian

Examples

1
2
3
4
5
    ## Not run: 
        myLoad <- champ.load(directory=system.file("extdata",package="ChAMPdata"))
        QC.GUI()
        
## End(Not run)

ucl-medical-genomics/ChAMP documentation built on June 26, 2019, 12:11 a.m.