| utils | R Documentation |
utils Convience functions for genespace, not meant to be called
directly by the user. Little documentation support provided, use at your own
risk.
.onAttach startup messages
check_integer Checks and parses integer arguments to GENESPACE
functions. Replaces values of x not in range (min, max) with the minimum or
maximum values. If a single value and a value that is not coercible to an
integer is specified, returns the default value. If na.rm = TRUE and
onlySingleValue = FALSE, drops NAs.
from the vector
check_numeric See check_integer. Same but for numeric values.
check_character See check_integer. Same but for character values.
check_logical See check_integer. Same but for logical values.
check_filePathParam QC of user-specified parameter
check_onlyDNA QC to ensure the peptides are actually peptides
read_aaFasta read fasta-formatted peptide sequences
get_nAA count the number of amino acids by gene
read_bed read and check a raw bed file with four columns.
align_charLeft for a vector of character strings, add " " to the right
side so they all align to the left when printed
align_charRight for a vector of character strings, add " " to the left
side so they all align to the right when printed
read_orthofinderSpeciesIDs Parses the SpeciesIDs.txt file into a
data.table and returns to R.
read_orthofinderSequenceIDs Reads the sequence
IDs:gene name dictionary into memory.
get_nSeqs Counts the number of lines with ">" in a file. If the output
is not convertible to an integer, returns NA.
check_annotFiles ensure the annotations match correctly
check_MCScanXhInstall check that MCScanX_h can be called
parse_ogs read and parse orthofinder orthogroups.tsv files
parse_hogs read and parse orthofinder phylogenetically hierarchical
orthogroup (N0.tsv) files
parse_orthologues read and parse orthofinder orthologs
round_toInteger flexible rounding to any integer.
add_rle run-length equivalent conversion, either as the length of the
runs or the unique run ids.
gs_colors get a set of colors from the genespace palette
clus_igraph cluster connected subgraphs from pairwise observations
are_colors check if a vector is coercible to R colors
scale_between scale a vector between a range
read_combBed ensures consistent combBed IO
write_combBed ensures consistent combBed IO
read_allBlast ensures consistent allBlast IO
write_allBlast ensures consistent allBlast IO
read_synHits ensures consistent synHit IO
write_synHits ensures consistent synHit IO
add_alpha add transparency to a color
read_intSynPos utility to read interpolated syntenic position files
write_intSynPos utility to write interpolated syntenic position files
get_orthofinderVersion Checks that orthofinder is installed and if so,
returns the installed version.
get_diamondVersion Checks that DIAMOND is installed and if so,
returns the installed version.
theme_genespace specifies publication-style themes. Col here is the
color of the panel.background.
download_exampleData downloads chicken and human annotations from NCBI
interp_approx use approx to interpolate missing positions based on
the positions of the bounder x/y coordinates.
read_refGenomeSynHits read in all syntenic hits files involving a
single reference genome and, where necessary, invert the hits so that the
reference genome is always the query (genome1).
read_refGenomeAllBlast read in all hits files involving a
single reference genome and, where necessary, invert the hits so that the
reference genome is always the query (genome1).
get_bedInBlk splits the bed file so that two entries (query and target
) match the physical bounds of blocks in the hits object.
add_array2bed add array ID to the bed file
add_arrayReps2bed add array representative genes to the combined bed
object
write_pangenes utility to correctly write in long-formatted pan=gene
text files.
read_pangenes utility to correctly read in long-formatted pangene text
files.
find_nnHit find anchor hits that are nearest to non-anchor xy position
flag_hitsInRadius given a vector of anchors, pulls xy positions within
radius of anchors using dbscan
flag_hitsInBlk finds hits within the bounding coordinates of blocks
get_orderedTips Respect ordering of tree when ladderized
pull_pairwise Builds new pairwise files in /syntenicHits
.onAttach(...)
check_integer(
x,
min = -Inf,
max = Inf,
default = NA,
na.rm = FALSE,
onlySingleValue = length(x) <= 1
)
check_numeric(
x,
min = -Inf,
max = Inf,
default = NA,
na.rm = FALSE,
onlySingleValue = length(x) <= 1
)
check_character(
x,
default = NULL,
na.rm = FALSE,
onlySingleValue = length(x) <= 1
)
check_logical(x, default = NA, na.rm = FALSE, onlySingleValue = length(x) <= 1)
check_filePathParam(filepath)
check_onlyDNA(path)
read_aaFasta(path)
get_nAA(path)
read_bed(filepath)
align_charLeft(x)
align_charRight(x)
read_orthofinderSpeciesIDs(filepath)
read_orthofinderSequenceIDs(filepath)
get_nSeqs(filepath)
check_annotFiles(filepath, genomeIDs)
check_MCScanXhInstall(filepath)
parse_ogs(filepath, genomeIDs)
parse_hogs(filepath)
parse_orthologues(filepath)
round_toInteger(x, to)
add_rle(x, which = "n")
gs_colors(n = 10)
clus_igraph(id1, id2)
are_colors(col)
scale_between(x, min, max, scale1toMean = TRUE)
read_combBed(filepath)
write_combBed(x, filepath)
read_allBlast(filepath, ...)
write_allBlast(x, filepath)
read_synHits(filepath, ...)
write_synHits(x, filepath)
add_alpha(col, alpha = 1)
read_intSynPos(filepath)
write_intSynPos(x, filepath)
get_orthofinderVersion(filepath)
get_diamondVersion(filepath)
theme_genespace(col = "black")
download_exampleData(filepath)
interp_approx(x, y)
read_refGenomeSynHits(gsParam, refGenome)
read_refGenomeAllBlast(gsParam, refGenome)
get_bedInBlk(hits, bed)
add_array2bed(bed, synBuff, maxIter = 10, reorder = TRUE)
add_arrayReps2bed(bed)
write_pangenes(x, filepath)
read_pangenes(x, filepath, ...)
find_nnHit(x, y, isAnchor, radius)
flag_hitsInRadius(x, y, isAnchor, radius)
flag_hitsInBlk(x, y, blkID)
get_orderedTips(treFile, ladderize = TRUE, genomeIDs)
pull_pairwise(gsParam, verbose)
... |
additional parameters passed on to other functions
|
x |
single-value parameter, string, integer, numeric, list, vector |
min |
if x is an integer or numeric, the minimum value allowed |
max |
if x is an integer or numeric, the minimum value allowed |
default |
if there is a problem with x, replace with this value |
na.rm |
logical, should NA's be dropped |
onlySingleValue |
logical, should long a single valuebe returned? |
filepath |
file.path |
path |
file path character string |
genomeIDs |
character vector of genomeIDs |
to |
integer, top end value |
which |
character specifying what to use |
n |
integer, number of observations |
id1 |
character, first id |
id2 |
character, second id |
col |
color |
scale1toMean |
logical, should values be scaled to 1? |
alpha |
numeric, transparency |
y |
numeric, y values |
gsParam |
genespace parameters, see init_genespace. |
refGenome |
character string specifying the reference genome |
hits |
data.table of syntenic hits |
bed |
data.table containing the combined bed object |
synBuff |
see init_genespace. |
maxIter |
integer, the maximum number of iterations to use |
reorder |
logical, should the gene rank position be re-ordered? |
isAnchor |
logical, is a hit an anchor? |
radius |
numeric, the 2d search radius. |
blkID |
vector of block IDs |
treFile |
file.path to the tree file. |
ladderize |
logical, should the tree be ladderized? |
verbose |
logical, should updates be printed to the console? |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.