| run_phobius | R Documentation |
This function creates a data frame with columns containing transcript IDs and corresponding output from Phobius. The Phobius output includes a location for each amino acid, with O representing extracellular, M representing transmembrane, S representing signal, and i representing intracellular.
run_phobius(AA_seq, fasta_file_name)
AA_seq |
A data frame outputted by the get_pairs function containing the gene names, transcript IDs, APPRIS annotations, and protein sequences for each transcript. |
fasta_file_name |
Path to fasta file containing amino acid sequences |
A data frame containing each transcript ID and the corresponding membrane location for each amino acid in its sequence formatted as a string
In order for this function to work, there needs to be a .fasta file containing the amino acid sequences for each transcript called "AA.fasta" saved to the working directory. Additionally, the file saves a copy of the returned data frame in csv format to the working directory.
tmhmm_folder_name <- "~/TMHMM2.0c"
if (check_tmhmm_install(tmhmm_folder_name)) {
currwd <- getwd()
AA_seq <- get_pairs(system.file("extdata", "crb1_example.csv",
package = "surfaltr"
), TRUE, "mouse", TRUE)
topo <- run_phobius(AA_seq, paste(getwd(), "/AA.fasta", sep = ""))
setwd(currwd)
}
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.