Description Usage Arguments Value Note See Also Examples
View source: R/hiReadsProcessor.R
Given a group of values or genomic positions per read/clone, the function tries to yield a unique OTU (operation taxinomical unit) ID for the collection based on overlap of locations to other reads/clones by grouping. This is mainly useful when each read has many locations which needs to be considered as one single group of sites.
1 2 |
posID |
a vector of groupings for the value parameter (i.e. Chr,strand). Required if psl.rd parameter is not defined. |
value |
a vector of integer locations/positions that needs to be binned, i.e. genomic location. Required if psl.rd parameter is not defined. |
readID |
a vector of read/clone names which is unique to each row, i.e. deflines. |
grouping |
additional vector of grouping of length posID or psl.rd by which to pool the rows (i.e. samplenames). Default is NULL. |
psl.rd |
a GRanges object returned from |
maxgap |
max distance allowed between two non-overlapping position to trigger the merging. Default is 5. |
parallel |
use parallel backend to perform calculation with
|
a data frame with binned values and otuID shown alongside the original input. If psl.rd parameter is defined, then a GRanges object.
The algorithm for making OTUs of sites is as follows:
for each grouping & posID, fix values off by maxgap parameter
create bins of fixed values per readID
assign arbitrary numeric ID to each distinct bins above & obtain its frequency
perform overlap b/w readIDs with only one value (singletons) to readIDs with >1 value (non-singletons)
- for any overlapping values, tag non-singleton readID with the ID of singleton readID
- if non-singleton readID matched with more than one singleton readID, then pick on at random
for any non-tagged & non-singleton readIDs, perform an overlap of values within themselves using the maxgap parameter
- tag any overlapping positions across any readID with the ID of most frequently occuring bin
positions with no overlap are left as is with the original arbitrary ID
clusterSites
, isuSites
,
crossOverCheck
, findIntegrations
,
getIntegrationSites
, pslToRangedObject
1 2 3 4 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.