create_ngrams | R Documentation |
Creates the vector of all possible n_grams (for given n
).
create_ngrams(n, u, possible_grams = NULL)
n |
|
u |
|
possible_grams |
number of possible n-grams. If not |
See Details section of count_ngrams
for more
information about n-grams naming convention. The possible information about distance
must be added by hand (see examples).
a character vector. Elements of n-gram are separated by dot.
Input data must be a matrix or data frame of numeric elements.
# bigrams for standard aminoacids
create_ngrams(2, 1L:20)
# bigrams for standard aminoacids with positions, 10 amino acid long sequence, so
# only 9 bigrams can be located in sequence
create_ngrams(2, 1L:20, 9)
# bigrams for DNA with positions, 10 nucleotide long sequence, distance 1, so only
# 8 bigrams in sequence
# paste0 adds information about distance at the end of n-gram
paste0(create_ngrams(2, 1L:4, 8), "_0")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.