predictTFBS | R Documentation |
Predicts the binding sites of studied transcription factors in a studied condition by applying a predictive model on potential binding sites located by a prior pattern-matching analysis and annotated with genomic features extracted at their location.
predictTFBS(
TFBSmodel,
TFBSdata,
studiedTFs = NULL,
show_annotations = FALSE,
score_threshold = 0.86
)
TFBSmodel |
A |
TFBSdata |
A named character vector as output by the |
studiedTFs |
A character vector setting the names of the studied transcription factors (for which the
location of the binding sites has to be predicted). This names have to match with one of the names of the
|
show_annotations |
A logical. Default = |
score_threshold |
A numeric (comprised between 0 and 1). Sets the minimum prediction score output by the
|
Remark: the features included in the datasets encoded in the files given by TFBSdata
have to correspond
to those integrated by the predictive model set by TFBSmodel
.
A data.table
listing the predicted binding sites. The 'TF
' column annotates the potential binding sites
with their cognate transcription factor. Additionally, the data.table
describes, for the potential
binding sites, the chromosomic coordinates, the closest transcript (relatively to the transcript start site) and the prediction score.
Optionally, the data.table
might also include the genomic features used to make the predictions.
importGenomicData()
for importing genomic data, buildTFBSmodel()
to train a predictive model of
transcription factor binding sites, and plotPredictions()
to vizualize the results for a given potential target gene.
genomic_data.ex <- c(CE = system.file("extdata/conserved_elements_example.bed", package = "Wimtrap"),
DGF = system.file("extdata/DGF_example.bed", package = "Wimtrap"),
DHS = system.file("extdata/DHS_example.bed", package = "Wimtrap"),
X5UTR = system.file("extdata/x5utr_example.bed", package = "Wimtrap"),
CDS = system.file("extdata/cds_example.bed", package = "Wimtrap"),
Intron = system.file("extdata/intron_example.bed", package = "Wimtrap"),
X3UTR = system.file("extdata/x3utr_example.bed", package = "Wimtrap")
)
imported_genomic_data.ex <- importGenomicData(biomart = FALSE,
genomic_data = genomic_data.ex,
tss = system.file("extdata/tss_example.bed", package = "Wimtrap"),
tts = system.file("extdata/tts_example.bed", package = "Wimtrap"))
TFBSdata.ex <- getTFBSdata(pfm = system.file("extdata/pfm_example.pfm", package = "Wimtrap"),
TFnames = c("PIF3", "TOC1"),
organism = NULL,
genome_sequence = system.file("extdata/genome_example.fa", package = "Wimtrap"),
imported_genomic_data = imported_genomic_data.ex)
TFBSmodel.ex <- buildTFBSmodel(TFBSdata = TFBSdata.ex,
ChIPpeaks = c(PIF3 = system.file("extdata/PIF3_example.bed", package = "Wimtrap"),
TOC1 = system.file("extdata/TOC1_example.bed", package = "Wimtrap")),
TFs_validation = "PIF3")
PIF3BS.predictions <- predictTFBS(TFBSmodel.ex,
TFBSdata.ex,
studiedTFs = "PIF3")
##To get the transcripts whose expression is potentially regulated by PIF3 do as follows:
PIF3_regulated.predictions <- as.character(PIF3BS.predictions$transcript[!duplicated(PIF3BS.predictions)])
###If you want to consider only the gene model,
###then do as follows:
PIF3_regulated.predictions <- unlist(strsplit(PIF3_regulated.predictions, "[.]"))[seq(1, 2*length(PIF3_regulated.predictions),2)]
PIF3_regulated.predictions <- PIF3_regulated.predictions[!duplicated(PIF3_regulated.predictions)]
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.