getFeature: Feature construction.

Description Usage Arguments Value Examples

View source: R/AStrap.R

Description

Extract features from the sequences around splice sites, A total of 511 features.

Usage

1
getFeature(ASdata)

Arguments

ASdata

A data frame with nine columns: donorSeq, acceptorSeq, seq, length, Ddown10, Ddown20, Aup10, Aup20, Aup30. donorSeq is the sequence of of the [-2, +3] region of donor sites; acceptorSeq is the sequence of the [-2, +3] region of acceptor site; seq is the sequence of the splice region; length is the length of the splice region; Ddown10 is the sequence of the downstream 10bp region of donor site; Ddown20 is the sequence of the downstream 20bp region of donor site; Aup10 is the sequence of the upstream 10bp region of acceptor site; Aup20 is the sequence of the upstream 20bp region of acceptor site; Aup30 is the sequence of the upstream 30bp region of acceptor site;

Value

A list of features.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
##Example 1: based on pairwise sequence alignment and transcript
##sequences.
##Loading pairwise alignment data
load(system.file("data","sample_Aligndata.Rdata",package = "AStrap"))
head(Aligndata)

##Loading transcript sequence
trSequence.path <- system.file("extdata","example_TRsequence.fasta",
                              package = "AStrap")
trSequence <-  readDNAStringSet(trSequence.path,format = "fasta")

##Extract sequence around splice site based on the transcript sequence
Aligndata <- extract_IsoSeq_tr(Aligndata,trSequence )
colnames(Aligndata)
head(Aligndata$Ddown10)
head(Aligndata$Aup10)
##Loading the consensus matrix of the sequences of the [-2,+3]
##region of acceptor sites
load(system.file("data","example_PWM_acceptor.Rdata",
                 package = "AStrap"))
##Loading the consensus matrix of the sequences of the [-2,+3]
##region of donor sites
load(system.file("data","example_PWM_donor.Rdata",
                 package = "AStrap"))
##Construction the feature space
feature <- getFeature(Aligndata)
ncol(feature)

##Example2: based on the alternative splicing database and genome.
path <- system.file("extdata","sample_riceAS.txt",package = "AStrap")
rice_ASdata <-read.table(path,sep="\t",head = TRUE,
                         stringsAsFactors = FALSE)
head(rice_ASdata)
##Loading geneome using the package of BSgenome
library("BSgenome.Osativa.MSU.MSU7")
rice_ASdata<- extract_IsoSeq_ge(rice_ASdata,Osativa)

BMILAB/AStrap documentation built on Nov. 20, 2020, 4:03 p.m.