remove_corrupt_utf8: Remove Corrupt UTF8

Description Usage Arguments Examples

View source: R/prepare.R

Description

Remove corrupt UTF8 characters that might cause issues, recommended.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
remove_corrupt_utf8(text)

## S3 method for class 'corpus'
remove_corrupt_utf8(text)

## S3 method for class 'documents'
remove_corrupt_utf8(text)

## S3 method for class 'document'
remove_corrupt_utf8(text)

Arguments

text

An object inheriting of class document or corpus.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
## Not run: 
init_textanalysis()

# build document
doc <- string_document("this document is clean")

# replaces in place!
remove_corrupt_utf8(doc)

## End(Not run)

news-r/textanalysis documentation built on Nov. 4, 2019, 9:40 p.m.