| PgR6MS | R Documentation |
PgR6 with Methods and Sequences. Final users should use pagoo
instead of this, since is more easy to understand.
Inherits: PgR6M
pagoo::PgR6 -> pagoo::PgR6M -> PgR6MS
sequencesA DNAStringSetList with the
set of sequences grouped by cluster. Each group is accessible as were a list. All
Biostrings methods are available.
core_sequencesLike $sequences, but only showing core
sequences.
cloud_sequencesLike $sequences, but only showing cloud
sequences as defined above.
shell_sequencesLike $sequences, but only showing shell
sequences, as defined above.
pagoo::PgR6$add_metadata()pagoo::PgR6$drop()pagoo::PgR6$recover()pagoo::PgR6$save_pangenomeRDS()pagoo::PgR6$write_pangenome()pagoo::PgR6M$cg_exp_decay_fit()pagoo::PgR6M$dist()pagoo::PgR6M$gg_barplot()pagoo::PgR6M$gg_binmap()pagoo::PgR6M$gg_curves()pagoo::PgR6M$gg_dist()pagoo::PgR6M$gg_pca()pagoo::PgR6M$gg_pie()pagoo::PgR6M$pan_pca()pagoo::PgR6M$pg_power_law_fit()pagoo::PgR6M$rarefact()pagoo::PgR6M$runShinyApp()new()Create a PgR6MS object.
PgR6MS$new( data, org_meta, cluster_meta, core_level = 95, sep = "__", DF, group_meta, sequences, verbose = TRUE )
dataA data.frame or DataFrame containing at least the
following columns: gene (gene name), org (organism name to which the gene belongs to),
and cluster (group of orthologous to which the gene belongs to). More columns can be added as metadata
for each gene.
org_meta(optional) A data.frame or DataFrame
containing additional metadata for organisms. This data.frame must have a column named "org" with
valid organisms names (that is, they should match with those provided in data, column org), and
additional columns will be used as metadata. Each row should correspond to each organism.
cluster_meta(optional) A data.frame or DataFrame
containing additional metadata for clusters. This data.frame must have a column named "cluster" with
valid organisms names (that is, they should match with those provided in data, column cluster), and
additional columns will be used as metadata. Each row should correspond to each cluster.
core_levelThe initial core_level (that's the percentage of organisms a core cluster must be in to be
considered as part of the core genome). Must be a number between 100 and 85, (default: 95). You can change it
later by using the $core_level field once the object was created.
sepA separator. By default is '__'(two underscores). It will be used to
create a unique gid (gene identifier) for each gene. gids are created by pasting
org to gene, separated by sep.
DFDeprecated. Use data instead.
group_metaDeprecated. Use cluster_meta instead.
sequencesCan accept: 1) a named list of named character vector. Name of list
are names of organisms, names of character vector are gene names; or 2) a named list of DNAStringSetList
objects (same requirements as (1), but with BStringSet names as gene names); or 3) a DNAStringSetList
(same requirements as (2) but DNAStringSetList names are organisms names).
verboselogical. Whether to display progress messages when loading class.
An R6 object of class PgR6MS. It contains basic fields and methods for analyzing a pangenome. It also contains additional statistical methods for analyze it, methods to make basic exploratory plots, and methods for sequence manipulation.
core_seqs_4_phylo()A field for obtaining core gene sequences is available (see below), but for creating a phylogeny with this sets is useful to: 1) have the possibility of extracting just one sequence of each organism on each cluster, in case paralogues are present, and 2) filling gaps with empty sequences in case the core_level was set below 100%, allowing more genes (some not in 100% of organisms) to be incorporated to the phylogeny. That is the purpose of this special function.
PgR6MS$core_seqs_4_phylo(max_per_org = 1, fill = TRUE)
max_per_orgMaximum number of sequences of each organism to be taken from each cluster.
filllogical. If fill DNAStringSet with empty DNAString in cases where
core_level is set below 100%, and some clusters with missing organisms are also considered.
A DNAStringSetList with core genes. Order of organisms on each cluster is conserved, so it is easier
to concatenate them into a super-gene suitable for phylogenetic inference.
clone()The objects of this class are cloneable with this method.
PgR6MS$clone(deep = FALSE)
deepWhether to make a deep clone.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.