calculateJaccard: Calculate Jaccard metric

View source: R/CNVMetricsOverlapInternalMethods.R

calculateJaccardR Documentation

Calculate Jaccard metric

Description

Calculate Jaccard metric using overlapping regions between two samples.

Usage

calculateJaccard(sample01, sample02)

Arguments

sample01

a GRanges which contains a collection of genomic ranges representing copy number events for the first sample.

sample02

a GRanges which contains a collection of genomic ranges representing copy number events for the second sample.

Details

The method calculates the Jaccard metric using overlapping regions between the samples. All regions present in both samples are used for the calculation of the metric.

The Jaccard metric is calculated by dividing the size of the intersection by the size of the union of the two sets. If the the size of the union of the two sets is zero; the value NA is returned instead. The strand of the regions is not taken into account while calculating the intersection.

Value

a numeric, the value of the Jaccard metric. If the metric cannot be calculated, NA is returned.

Author(s)

Astrid DeschĂȘnes

References

Jaccard, P. (1912), The Distribution of the Flora in the Alpine Zone. New Phytologist, 11: 37-50. DOI: https://doi.org/10.1111/j.1469-8137.1912.tb05611.x

Examples


## Load required package to generate the two samples
require(GenomicRanges)

## Generate two samples with identical sequence levels
sample01 <- GRanges(seqnames="chr1", 
    ranges=IRanges(start=c(1905048, 4554832, 31686841), 
    end=c(2004603, 4577608, 31695808)), strand="*")
sample02 <- GRanges(seqnames="chr1", 
    ranges=IRanges(start=c(1995066, 31611222), 
    end=c(2204505, 31689898)), strand="*")

## Calculate Sorensen metric    
CNVMetrics:::calculateJaccard(sample01=sample01, sample02=sample02)
    

adeschen/CNVMetrics documentation built on July 19, 2023, 10:24 p.m.