merge | R Documentation |
Wrapper to GMQL JOIN operator
It takes in input two datasets, respectively known as anchor (left) and experiment (right) and returns a dataset of samples consisting of regions extracted from the operands according to the specified conditions (a.k.a genometric_predicate and region_attribute predicate). The number of generated output samples is the Cartesian product of the number of samples in the anchor and in the experiment dataset (if joinBy is not specified). The output metadata are the union of the input metadata, with their attribute names prefixed with left or right dataset name, respectively.
## S4 method for signature 'GMQLDataset,GMQLDataset'
merge(
x,
y,
genometric_predicate = NULL,
region_output = "CAT",
joinBy = conds(),
reg_attr = c("")
)
x |
GMQLDataset class object |
y |
GMQLDataset class object |
genometric_predicate |
it is a list of DISTAL objects.
For details of DISTAL objects see:
|
region_output |
single string that declares which region is given in output for each input pair of left dataset and right dataset regions satisfying the genometric predicate and/or the region attribute predicate:
|
joinBy |
|
reg_attr |
vector of strings made up by region field attribute names, whose values in the paired left and right dataset regions must be equal in order to consider the two paired regions. If specified, region_output cannot be INT or CAT. |
GMQLDataset object. It contains the value to use as input for the subsequent GMQLDataset method
## This statement initializes and runs the GMQL server for local execution
## and creation of results on disk. Then, with system.file() it defines
## the path to the folders "DATASET" and "DATASET_GDM" in the subdirectory
## "example" of the package "RGMQL" and opens such folders as a GMQL
## datasets named TSS and HM, respectively, using CustomParser
init_gmql()
test_path <- system.file("example", "DATASET", package = "RGMQL")
test_path2 <- system.file("example", "DATASET_GDM", package = "RGMQL")
TSS = read_gmql(test_path)
HM = read_gmql(test_path2)
## Given a dataset HM and one called TSS with a sample including
## Transcription Start Site annotations, this statement searches for those
## regions of HM that are at a minimal distance from a transcription start
## site (TSS) and takes the first/closest one for each TSS, provided that
## such distance is lesser than 120K bases and joined TSS and HM
## samples are obtained from the same provider (joinby clause).
join_data = merge(TSS, HM, genometric_predicate = list(MD(1), DLE(120000)),
conds("provider"), region_output = "RIGHT")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.