find_motifs | R Documentation |
Finding motifs in sequences. This function searches and simultaneously counts continuous and discontinuous motifs in a sequence vector.
It is used in the turbo_gliph
function to identify local similarities.
find_motifs(seqs, q = 2:4, kmer_mindepth = NULL, discontinuous = FALSE)
seqs |
character vector. This vector must contain the sequences whose motifs are to be identified and quantified. |
q |
accepts a numeric vector of motif lengths you want to find. By default it searches for motifs of size 2, 3 and 4. |
kmer_mindepth |
numeric. By default 3. Minimum observations of kmer for it to be evaluated. This is the minimum number of times a kmer should be observed in the sample set in order for it to be considered for being returned. |
discontinuous |
logical. By default |
find_motifs
returns a data frame with two columns.
The first column contains the motifs and the second column the frequency of the motifs.
utils::data("gliph_input_data") sample_seqs <- base::as.character(gliph_input_data$CDR3b) res <- find_motifs(seqs = sample_seqs)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.