dissim_vector: Morisita-Horne Similarity Combination Function
In dlemas/microbes: tools for analyzing microbiome data

Description Usage Arguments Details Value Note Author(s) References See Also Examples

This function calculates the Morisita-Horne similarity between samples that meet the given requirements.

1	dissim_vector(otu.normed, meta, same, different)

`otu.normed`	Data frame of normalized OTU counts with study_id as the column names and OTU names as the row.names.
`meta`	Data Frame of metadata.
`same`	A vector containing the names of the metadata columns that are to be the same.
`different`	A vector containing the names of the metadata columns that are to be different.

no additional details

The returned output is a vector containing the Morisita-Horne Simmilarity between all sample pairs that meet the above restrictions.

no further notes

atomczik

http://www.ncbi.nlm.nih.gov/pubmed/19043404

no additional links

##---- Should be DIRECTLY executable !! ----
##-- ==>  Define data, use random,
##--	or do  help(data=index)  for the standard data sets.

## The function is currently defined as
function (otu.normed, meta, same, different) 
{
    w <- length(row.names(meta))
    fake.same <- "fake"
    fake.same[1:w] <- "same"
    fake.different <- "fake"
    fake.different[1:w] <- 1:w
    meta.f <- cbind(meta, fake.same, fake.different)
    same <- c(same, "fake.same")
    different <- c(different, "fake.different")
    otu.normed.t = setNames(data.frame(t(otu.normed[, ])), row.names(otu.normed))
    otu.normed.t$study_id <- row.names(otu.normed.t)
    merged.data <- merge(meta.f, otu.normed.t, by = "study_id")
    row.names(merged.data) <- merged.data$study_id
    n <- length(row.names(merged.data))
    m <- length(meta.f)
    p <- length(merged.data)
    q <- m + 1
    s.n <- length(same)
    d.n <- length(different)
    out <- "blank"
    out[1:n] <- "blank"
    for (i in 1:n) {
        sample.itself <- merged.data[i, q:p]
        df <- merged.data
        for (j in 1:s.n) {
            s = which(names(df) == same[j])
            idx = which(df[, s] == merged.data[i, s])
            df = df[idx, ]
        }
        for (k in 1:d.n) {
            d = which(names(df) == different[k])
            idx = which(!(df[, d] == merged.data[i, d]))
            df = df[idx, ]
        }
        df.subset <- df[, q:p]
        df.subset <- rbind(sample.itself, df.subset)
        dissim <- "blank"
        r <- length(row.names(df.subset))
        for (l in 1:r) {
            dissim[l] <- morisita.horn(sample.itself, df.subset[l, 
                ])
        }
        dissim <- dissim[-1]
        out[i] <- list(dissim)
        dissim.vector <- unlist(out)
        dissim.vector <- suppressWarnings(as.numeric(dissim.vector))
        dissim.vector <- as.numeric(sort(dissim.vector))
        dissim.vector <- as.numeric(dissim.vector[c(TRUE, FALSE)])
    }
    return(dissim.vector)
  }