stringi: Fast and Portable Character String Processing Facilities

A collection of character string/text/natural language processing tools for pattern searching (e.g., with 'Java'-like regular expressions or the 'Unicode' collation algorithm), random string generation, case mapping, string transliteration, concatenation, sorting, padding, wrapping, Unicode normalisation, date-time formatting and parsing, and many more. They are fast, consistent, convenient, and - thanks to 'ICU' (International Components for Unicode) - portable across all locales and platforms. Documentation about 'stringi' is provided via its website at <> and the paper by Gagolewski (2022, <doi:10.18637/jss.v103.i02>).

Getting started

Package details

AuthorMarek Gagolewski [aut, cre, cph] (<>), Bartek Tartanus [ctb], and others (stringi source code); Unicode, Inc. and others (ICU4C source code, Unicode Character Database)
MaintainerMarek Gagolewski <>
Licensefile LICENSE
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:

Try the stringi package in your browser

Any scripts or data that you put into this service are public.

stringi documentation built on May 29, 2024, 8:16 a.m.