scrapenames | R Documentation |
Uses the Global Names Recognition and Discovery service, see http://gnrd.globalnames.org/
NOTE: This function sometimes gives data back and sometimes not. The API that this function is using is extremely buggy.
scrapenames(
url = NULL,
text = NULL,
format = "csv",
bytes_offset = FALSE,
return_content = FALSE,
unique_names = TRUE,
ambiguous_names = FALSE,
no_bayes = FALSE,
odds_details = FALSE,
language = "detect",
words_around = 0,
verification = TRUE,
sources = NULL,
all_matches = FALSE,
...,
file = NULL,
unique = NULL,
engine = NULL,
detect_language = NULL,
data_source_ids = NULL
)
url |
(character) If text parameter is empty, and |
text |
(character) Contains the text which will be checked for
scientific names. If this parameter is not empty, the |
format |
(character) Sets the output format. It can be set to: |
bytes_offset |
(logical) This changes how the position of a detected
name in text is calculated. Normally a name's start and end positions are
given as the number of UTF-8 characters from the beginning of the text. If
this is |
return_content |
(logical) If this is |
unique_names |
(logical) If this is |
ambiguous_names |
(logical) If this is |
no_bayes |
(logical) If this is |
odds_details |
(logical) If |
language |
(character) The language of the text. Language value is used
for calculation of Bayesian odds. If this parameter is not given, |
words_around |
(integer) Allows to see the context surrounding a name-string. This sets the number of words located immediately before or after a detected name. These words are then returned in the output. Default is 0, maximum value is 5. |
verification |
(character) When this |
sources |
Pipe separated list of data source ids to resolve found names against. See list of Data Sources http://resolver.globalnames.org/data_sources |
all_matches |
When this option is true all found results are returned, not only the bestResults. The bestResult field in this case is null, and results field should contain found results of the matches. |
... |
Further args passed to crul::verb-GET |
file |
Defunct. If you feel this is important functionality submit an issue at "https://github.com/ropensci/taxize" |
unique |
Defunct. See the |
engine |
Defunct. The API used no longer supports this option. |
detect_language |
Defunct. See the |
data_source_ids |
Defunct. See the |
A tibble::tibble()
or list representing parsed JSON output
depending on the value of the format
option.
Scott Chamberlain, Zachary Foster
## Not run:
# Get data from a website using its URL
scrapenames('https://en.wikipedia.org/wiki/Spider')
scrapenames('https://en.wikipedia.org/wiki/Animal')
scrapenames('https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0095068')
scrapenames('https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0080498')
scrapenames(url = 'https://en.wikipedia.org/wiki/Spider', source=c(1, 169))
# Get data from text string
scrapenames(text='A spider named Pardosa moesta Banks, 1892')
# return OCR content
scrapenames(text='A spider named Pardosa moesta Banks, 1892',
return_content = TRUE, format = 'json')
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.