cld2: Google's Compact Language Detector 2

Bindings to Google's C++ library Compact Language Detector 2 (see <https://github.com/cld2owners/cld2#readme> for more information). Probabilistically detects over 80 languages in plain text or HTML. For mixed-language input it returns the top three detected languages and their approximate proportion of the total classified text bytes (e.g. 80% English and 20% French out of 1000 bytes). There is also a 'cld3' package on CRAN which uses a neural network model instead.

Getting started

Package details

AuthorJeroen Ooms [aut, cre] (<https://orcid.org/0000-0002-4035-0289>), Dirk Sites [cph] (Author of CLD2 C++ library)
MaintainerJeroen Ooms <jeroen@berkeley.edu>
LicenseApache License 2.0
Version1.2.4
URL https://docs.ropensci.org/cld2/ (docs) https://github.com/ropensci/cld2 (devel) https://github.com/cld2owners/cld2 (upstream)
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("cld2")

Try the cld2 package in your browser

Any scripts or data that you put into this service are public.

cld2 documentation built on Oct. 27, 2022, 1:06 a.m.