View source: R/preprocessing.R
pad_sequences | R Documentation |
Pads sequences to the same length
pad_sequences(
sequences,
maxlen = NULL,
dtype = "int32",
padding = "pre",
truncating = "pre",
value = 0
)
sequences |
List of lists where each element is a sequence |
maxlen |
int, maximum length of all sequences |
dtype |
type of the output sequences |
padding |
'pre' or 'post', pad either before or after each sequence. |
truncating |
'pre' or 'post', remove values from sequences larger than maxlen either in the beginning or in the end of the sequence |
value |
float, padding value |
This function transforms a list of num_samples
sequences (lists
of integers) into a matrix of shape (num_samples, num_timesteps)
.
num_timesteps
is either the maxlen
argument if provided, or the length
of the longest sequence otherwise.
Sequences that are shorter than num_timesteps
are padded with value
at
the end.
Sequences longer than num_timesteps
are truncated so that they fit the
desired length. The position where padding or truncation happens is
determined by the arguments padding
and truncating
, respectively.
Pre-padding is the default.
Matrix with dimensions (number_of_sequences, maxlen)
Other text preprocessing:
make_sampling_table()
,
skipgrams()
,
text_hashing_trick()
,
text_one_hot()
,
text_to_word_sequence()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.