Extract sentences from a batch of text lines.
a character vector.
a regular expression matching an End-Of-Sentence delimiter.
TRUE or FALSE? Should the first character of the matches be appended to the returned sentences (with a space)?
tknz_sent() splits text into sentences using a list of
single character delimiters, specified by the parameter
Specifically, when an EOS token is found, the next sentence begins at the
first position in the input string not containing any of the EOS tokens
or white space (so that entries like
"Hi there!!!" or
"Hello . . ." are both recognized as a single sentence).
FALSE, the delimiters are stripped off from
the returned sequences, which means that all delimiters are treated
In the absence of any
returns the input as is, since parts of text corresponding to different
entries of the input vector
x are understood as parts of separate
a character vector, each entry of which corresponds to a single sentence.
tknz_sent("Hi there! I'm using `sbo`.")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.