lex: Break a string into labelled tokens based upon a set of...
In coolbutuseless/minilexer: A Simple Tool for Lexing Text Data

Break a string into labelled tokens based upon a set of patterns

1	lex(text, patterns, debug = FALSE)

`text`	a single character string
`patterns`	a named vector of character strings. Each string represents a regex to match a token, and the name of the string is the label for the token. If the regex contains a captured group it will be left as is, otherwise the whole regex will be turned into a captured group. The patterns are used in order such that an early match takes precedence over any later match.
`debug`	print more debugging information about the matching. default: FALSE