View source: R/03_process_document.R
| process_document | R Documentation |
This function extracts text embedded in a .pdf or .txt file
and processes it so it can be safely used by LLM API's.
process_document(path, extra_measures = NULL)
path |
character. Path leading to the desired PDF file. |
extra_measures |
character. To be implemented. Some documents are
especially difficult for LLM to process due to a variety of
issues such as size and formatting. |
character. Fully processed text.
path = arete_data("holzapfelae")
process_document(path)
extra_measures = list("mention", "Tricholathys spiralis")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.