View source: R/cas_check_corpus.R
cas_check_corpus | R Documentation |
Checks if given corpus exists, and, optionally updates it
cas_check_corpus(
...,
update = FALSE,
keep_only_latest = FALSE,
path = NULL,
file_format = "parquet",
partition = NULL,
token = "full_text",
corpus_folder = "corpus"
)
... |
Passed to |
update |
Logical, defaults to FALSE. If set to TRUE, it checks if the local database has contents with a higher content id than is currently available in previously exported corpus, if any. If so, it writes a new, updated corpus. |
keep_only_latest |
Logical, defaults to FALSE. If set to TRUE, it deletes previous, older, corpora of the same type. |
path |
Defaults to NULL. If NULL, path is set to the project/website/export/dataset/file_format folder. |
file_format |
Defaults to "parquet". Currently, other options are not implemented. |
partition |
Defaults to NULL. If NULL, the parquet file is not
partitioned. "year" is a common alternative: if set to "year", the parquet
file is partitioned by year. If a |
token |
Defaults to "full_text", which does not tokenise the text
column. If different from |
Path to corpus. NULL, if no corpus is found and update is set to FALSE.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.