text_one_hot: One-hot encode a text into a list of word indexes in a...
In keras: R Interface to 'Keras'

View source: R/preprocessing.R

text_one_hot

R Documentation

One-hot encode a text into a list of word indexes in a vocabulary of size n.

Description

One-hot encode a text into a list of word indexes in a vocabulary of size n.

Usage

text_one_hot(
  input_text,
  n,
  filters = "!\"#$%&()*+,-./:;<=>?@[\\]^_`{|}~\t\n",
  lower = TRUE,
  split = " ",
  text = NULL
)

Arguments

`input_text`	Input text (string).
`n`	Size of vocabulary (integer)
`filters`	Sequence of characters to filter out such as punctuation. Default includes basic punctuation, tabs, and newlines.
`lower`	Whether to convert the input to lowercase.
`split`	Sentence split marker (string).
`text`	for compatibility purpose. use `input_text` instead.

Value

List of integers in ⁠[1, n]⁠. Each integer encodes a word (unicity non-guaranteed).

See Also

Other text preprocessing: make_sampling_table(), pad_sequences(), skipgrams(), text_hashing_trick(), text_to_word_sequence()

keras documentation built on May 29, 2024, 3:20 a.m.

keras index

Package overview Frequently Asked Questions Getting Started with Keras Guide to Keras Basics Guide to the Functional API Guide to the Sequential Model Saving and serializing models Training Callbacks Training Visualization Using Pre-Trained Models Writing Custom Keras Layers Writing Custom Keras Models

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com