View source: R/parsePhyloProfile.R
parseInfoProfile | R Documentation |
Creating main dataframe for the input phylogenetic profiles based on selected input taxonomy level (e.g. strain, species) and reference taxon. The output contains the number of paralogs, the max/min/mean/median of VAR1 and VAR2.
parseInfoProfile(inputDf, sortedInputTaxa, taxaCount, coorthoCOMax)
inputDf |
input profiles in long format |
sortedInputTaxa |
sorted taxonomy data for the input taxa (check sortInputTaxa()) |
taxaCount |
dataframe counting present taxa in each supertaxon |
coorthoCOMax |
maximum number of co-orthologs allowed |
A dataframe contains all info for the input phylogenetic profiles. This full processed profile that is required for several profiling analyses e.g. estimation of gene age (?estimateGeneAge) or identification of core gene (?getCoreGene).
Vinh Tran tran@bio.uni-frankfurt.de
createLongMatrix
, sortInputTaxa
,
calcPresSpec
, mainLongRaw
library(dplyr)
data("mainLongRaw", package="PhyloProfile")
taxonIDs <- getInputTaxaID(mainLongRaw)
sortedInputTaxa <- sortInputTaxa(
taxonIDs, "class", "Mammalia", NULL, NULL
)
taxaCount <- sortedInputTaxa %>% dplyr::group_by(supertaxon) %>%
summarise(n = n(), .groups = "drop")
coorthoCOMax <- 999
parseInfoProfile(
mainLongRaw, sortedInputTaxa, taxaCount, coorthoCOMax
)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.