strip_accents: Strip accents from a piece of text.

View source: R/tokenization.R

strip_accentsR Documentation

Strip accents from a piece of text.

Description

(R implementation of BasicTokenizer._run_strip_accents from BERT: tokenization.py.)

Usage

strip_accents(text)

Arguments

text

A character scalar, encoded as utf-8.

Value

text with accents removed.


jonathanbratt/RBERT documentation built on Jan. 26, 2023, 4:15 p.m.