View source: R/G4iM.Grinder.Funs.R
GiGList.Analysis | R Documentation |
GiGList.Analysis
produces a one-row summary of the results from a G4iMGrinder
output list (referred to as a “GiGList”). It aggregates counts/densities of putative quadruplex or i-Motif structures, applying optional filters on minimum score, frequency, and structure length. The resulting summary facilitates high-level comparisons across different genomes or experimental conditions.
GiGList.Analysis(
GiGList,
iden,
ScoreMin = c(20, 40),
FreqMin = 10,
LengthMin = 50,
Density = 100000,
byDensity = TRUE
)
GiGList |
|
iden |
|
ScoreMin |
|
FreqMin |
|
LengthMin |
|
Density |
|
byDensity |
|
Filters on score, frequency, and length can be applied simultaneously, producing separate result columns for each filter. Known quadruplex-forming or known NOT-to-form-quadruplex sequences (if included in the GiGList
) are also reported. Additionally, columns indicating the percentage of “unique” structures (frequency = 1) are computed as a measure of overall structural redundancy.
A one-row data.frame
summarizing the GiGList
, with potential columns such as:
Name |
Character, derived from the name assigned to |
iden |
Character, the |
Length |
Integer, total genomic length from the |
SeqG , SeqC |
Numeric, percentage of G or C in the analyzed sequence(s). |
nM2a , nM2b , nM3a , nM3b |
Count or density of structures found by Methods 2a, 2b, 3a, or 3b respectively (if those methods were used). |
...S{|x|} , ...F{|y|} , ...L{|z|} |
Suffixes denoting columns filtered by score ( |
...KTFQ , ...KNTFQ |
Suffixes denoting structures that match known-to-form or known-NOT-to-form quadruplex sequences. |
...UniqPercent |
Columns indicating the percentage of unique structures within each method. |
Config |
A character column summarizing the configuration of analysis or filter settings. |
Name of the G4iMGrinder result list, inherited from GiGList
.
Identification tag, matching iden
argument.
Total length of the sequence(s) as computed by G4iMGrinder
.
Percentages of G and C nucleotides in the genome.
Counts or densities of structures detected by each method (2a, 2b, 3a, 3b).
Suffixes appended for each filter: “S” for score, “F” for frequency, “L” for length. X indicates the numeric threshold (e.g., 20, 10, or 50).
Columns indicating, for each method, how many structures are known-to-form quadruplexes or known-NOT-to-form quadruplexes (counts or densities).
Percentage of unique results (frequency=1
) within each method.
A text string describing the filters and configurations used in the summary.
Column names depend on the analysis conditions (methods used, filter thresholds, etc.). Combining or binding multiple GiGList.Analysis
summaries requires consistent parameters. Results may not align if different G4iMGrinder settings are used.
Efres Belmonte-Reche
Belmonte-Reche, E. and Morales, J. C. (2019). G4-iM Grinder: when size and frequency matter. G-Quadruplex, i-Motif and higher order structure search and analysis tool. NAR Genomics and Bioinformatics, 2. DOI: 10.1093/nargab/lqz005
https://academic.oup.com/nargab/article/2/1/lqz005/5576141
G4iMGrinder
for generating the GiGList
, and other related utilities.
# Example usage:
# Suppose 'Rs' is a G4iMGrinder result list (GiGList) for a DNA G-quadruplex search
Rs <- G4iMGrinder(Name = "TestGenome", Sequence = "ACGT...")
# Summarize with default filters
Summary <- GiGList.Analysis(
GiGList = Rs,
iden = "Parasite"
)
print(Summary)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.