step_categorical_column_with_vocabulary_file | R Documentation |
Use this function when the vocabulary of a categorical variable is written to a file.
step_categorical_column_with_vocabulary_file( spec, ..., vocabulary_file, vocabulary_size = NULL, dtype = tf$string, default_value = NULL, num_oov_buckets = 0L )
spec |
A feature specification created with |
... |
Comma separated list of variable names to apply the step. selectors can also be used. |
vocabulary_file |
The vocabulary file name. |
vocabulary_size |
Number of the elements in the vocabulary. This
must be no greater than length of |
dtype |
The type of features. Only string and integer types are supported. |
default_value |
The integer ID value to return for out-of-vocabulary
feature values, defaults to |
num_oov_buckets |
Non-negative integer, the number of out-of-vocabulary
buckets. All out-of-vocabulary inputs will be assigned IDs in the range
|
a FeatureSpec
object.
steps for a complete list of allowed steps.
Other Feature Spec Functions:
dataset_use_spec()
,
feature_spec()
,
fit.FeatureSpec()
,
step_bucketized_column()
,
step_categorical_column_with_hash_bucket()
,
step_categorical_column_with_identity()
,
step_categorical_column_with_vocabulary_list()
,
step_crossed_column()
,
step_embedding_column()
,
step_indicator_column()
,
step_numeric_column()
,
step_remove_column()
,
step_shared_embeddings_column()
,
steps
## Not run: library(tfdatasets) data(hearts) file <- tempfile() writeLines(unique(hearts$thal), file) hearts <- tensor_slices_dataset(hearts) %>% dataset_batch(32) # use the formula interface spec <- feature_spec(hearts, target ~ thal) %>% step_categorical_column_with_vocabulary_file(thal, vocabulary_file = file) spec_fit <- fit(spec) final_dataset <- hearts %>% dataset_use_spec(spec_fit) ## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.