Description Usage Arguments Details Value Note Author(s) Examples
homolog protein Uniprot ID match
1 2 |
data |
dataset of protein information.Column Names should contain "ori.ID" and "ENTRY.NAME". "ori.ID" is Uniprot ID |
db1.path |
fasta file, database of transfered species |
db2.path |
fasta file, database of original species |
out.folder |
blast result output folder, the folder path should be the same with db1.path |
blast.path |
blast+ software install path |
evalue |
blast threshold, the lower means more rigorous |
verbose |
integer level of verbosity. Zero means silent, 1 means have Diagnostic Messages. |
homolog protein Uniprot ID match is based on the ENTRY.NAME, gene name and sequence homophyly in two different species or different version of database.
a data.frame included 4 columns: ori.ID, ENTRY.NAME, new.ID, match.type.
This function should install blast+ software, Version 2.7.1. blast+ download website:ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/ db1.path, db2.path, out.folder are both need the complete path. Out.folder and db1.path should be in the same folder. Path should have no special character. data should have colname: ori.ID, ENTRY.NAME.
Kefu Liu
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 | rm(list = ls())
library(DDPNA)
#database file should be downloaded and extracted the rar file in C Disk.
#blast+ software should be downloaded and installed in C Disk.
#the example file should be downloaded and decompress in C Disk.
data(Sample_ID_data)
# need to install blast+ software
out.folder = "C:/DDPNA_test/";
blast.path = "C:/blast/ncbi-blast-2.7.1+/bin/";
if(file.exists(out.folder) & dir.exists(blast.path))
data <- ID_match(ID_data,
db1.path = "C:/DDPNA_test/new.fasta",
db2.path = "C:/DDPNA_test/ori.fasta",
out.folder = out.folder,
blast.path = blast.path,
evalue = 0.1, verbose = 1)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.