View source: R/high_level_functions.R
isoformSwitchAnalysisPart2 | R Documentation |
This high-level function adds the results of the external sequence analysis supplied (if any), then proceeds to analyze alternative splicing. Then functional consequences of the isoform switches are identified and isoform switch analysis plots are created for the top n
isoform switches. Lastly a plot summarizing the functional consequences is created. This function is meant to be used after isoformSwitchAnalysisPart1 have been used.
isoformSwitchAnalysisPart2(
### Core arguments
switchAnalyzeRlist,
### External annotation arguments
codingCutoff = NULL,
removeNoncodinORFs,
pathToCPATresultFile = NULL,
pathToCPC2resultFile = NULL,
pathToPFAMresultFile = NULL,
pathToIUPred2AresultFile = NULL,
pathToNetSurfP2resultFile = NULL,
pathToSignalPresultFile = NULL,
pathToDeepLoc2resultFile = NULL,
pathToDeepTMHMMresultFile = NULL,
### Analysis and output arguments
n = Inf,
consequencesToAnalyze = c(
'intron_retention',
'coding_potential',
'ORF_seq_similarity',
'NMD_status',
'domains_identified',
'domain_isotype',
'IDR_identified',
'IDR_type',
'signal_peptide_identified'
),
pathToOutput = getwd(),
fileType = 'pdf',
outputPlots = TRUE,
### Other arguments
quiet = FALSE
)
switchAnalyzeRlist |
The |
codingCutoff |
Numeric indicating the cutoff used by CPAT/CPC2 for distinguishing between coding and non-coding transcripts.
|
removeNoncodinORFs |
A logic indicating whether to remove ORF information from the isoforms which the CPAT analysis classifies as non-coding. This can be particular useful if the isoform (and ORF) was predicted de-novo but is not recommended if ORFs was imported from a GTF file. This will affect all downstream analysis and plots as both analysis of domains and signal peptides requires that ORFs are annotated (e.g. analyzeSwitchConsequences will not consider the domains (potentially) found by Pfam if the ORF have been removed). |
pathToCPATresultFile |
Path to the CPAT result file. If the webserver is used please download the tab-delimited file from the bottom of the result page and give that as input, else simply supply the result file. See analyzeCPAT for details. |
pathToCPC2resultFile |
Path to the CPC2 result file. If the webserver is used please download the tab-delimited file from the bottom of the result page and give that as input, else simply supply the result file. See analyzeCPC2 for details. |
pathToPFAMresultFile |
A string indicating the full path to the Pfam result file(s). If multiple result files were created (multiple web-server runs) just supply all the paths as a vector of strings. If the webserver is used you need to copy paste the result part of the mail you get into a empty plain text document (notepad, sublimetext TextEdit or similar (aka not word)) and save that. See analyzePFAM for details. |
pathToIUPred2AresultFile |
A string indicating the full path to the NetSurfP-2 result csv file. See analyzeIUPred2A for details. |
pathToNetSurfP2resultFile |
A string indicating the full path to the NetSurfP-2 result csv file. See analyzeNetSurfP3 for details. |
pathToSignalPresultFile |
A string indicating the full path to the SignalP result file(s). If multiple result files were created (multiple web-server runs) just supply all the paths as a vector of strings. If using the web-server the results should be copy pasted into a empty plain text document (notepad, sublimetext TextEdit or similar (aka not word)) and save that. See analyzeSignalP for details. |
pathToDeepLoc2resultFile |
A string indicating the full path to the DeepLoc2 result file(s). If multiple result files were created (multiple web-server runs) just supply all the paths as a vector of strings. See |
pathToDeepTMHMMresultFile |
A string indicating the full path to the DeepTMHMM result file. Can be gziped. If multiple result files were created (multiple web-server runs) just supply all the paths as a vector of strings. |
n |
The number of top genes (after filtering and sorted according to |
consequencesToAnalyze |
A vector of strings indicating what type of functional consequences to analyze. Do note that there is bound to be some differences between transcripts (else there would be identical). See details in analyzeSwitchConsequences for full list of usable strings and their meaning. Default is c('intron_retention','coding_potential','ORF_seq_similarity','NMD_status','domains_identified','signal_peptide_identified') (corresponding to analyze: intron retention, CPAT result, ORF AA sequence similarity, NMD status, PFAM domains annotated and signal peptides annotated by Pfam). |
pathToOutput |
A path to the folder in which the plots should be made. Default is working directory ( getwd() ). |
fileType |
A string indicating which file type is generated. Available options are \'pdf\' and \'png\'. Default is pdf. |
outputPlots |
A logic indicating whether all isoform switches as well as the summary of functional consequences should be saved in the directory specified by |
quiet |
A logic indicating whether to avoid printing progress messages (incl. progress bar). Default is FALSE |
This function performs the second part of a Isoform Analysis Workflow by:
Adding external sequence analysis (see analyzeCPAT, analyzeCPC2, analyzePFAM and analyzeSignalP)
Predict functional consequences of switching (see analyzeSwitchConsequences)
Output Isoform Switch Consequence plots for all genes where there is a significant isoform switch (see switchPlot)
Output a visualization of general consequences of isoform switches.
This function
Returns the supplied switchAnalyzeRlist
now annotated with all the analysis described above
Generate one folder per comparison of conditions containing the isoform switch analysis plot of all genes with significant isoforms switches
Saves 3 plots summarizing the overall consequences of all the isoform switchces.
Kristoffer Vitting-Seerup
Vitting-Seerup et al. The Landscape of Isoform Switches in Human Cancers. Mol. Cancer Res. (2017).
analyzeCPAT
analyzeCPC2
analyzeIUPred2A
analyzeNetSurfP3
analyzePFAM
analyzeSignalP
analyzeAlternativeSplicing
extractSwitchSummary
analyzeSwitchConsequences
switchPlotTopSwitches
### Please note
# The way of importing files in the following example with
# "system.file('pathToFile', package="IsoformSwitchAnalyzeR") is
# specialized way of accessing the example data in the IsoformSwitchAnalyzeR package
# and not smoothing you need to do - just supply the string e.g.
# "/path/to/externalAnalysis/toolResult.txt" pointing to the result file.
### Load example data
data("exampleSwitchListIntermediary")
### Subset for quick runtime
exampleSwitchListIntermediary <- subsetSwitchAnalyzeRlist(
exampleSwitchListIntermediary,
abs(exampleSwitchListIntermediary$isoformFeatures$dIF) > 0.4
)
### Run part 2
exampleSwitchListAnalyzed <- isoformSwitchAnalysisPart2(
switchAnalyzeRlist = exampleSwitchListIntermediary,
pathToCPC2resultFile = system.file("extdata/cpc2_result.txt", package = "IsoformSwitchAnalyzeR"),
pathToPFAMresultFile = system.file("extdata/pfam_results.txt", package = "IsoformSwitchAnalyzeR"),
pathToIUPred2AresultFile = system.file("extdata/iupred2a_result.txt.gz", package = "IsoformSwitchAnalyzeR"),
pathToSignalPresultFile = system.file("extdata/signalP_results.txt", package = "IsoformSwitchAnalyzeR"),
pathToDeepLoc2resultFile = system.file("extdata/deeploc2.csv", package = "IsoformSwitchAnalyzeR"),
pathToDeepTMHMMresultFile = system.file("extdata/DeepTMHMM.gff3", package = "IsoformSwitchAnalyzeR"),
codingCutoff = 0.5, # since we are using CPC2
removeNoncodinORFs = TRUE, # Because ORF was predicted de novo
outputPlots = FALSE # keeps the function from outputting the plots from this example code
)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.