isMultipleHits: Get the RNs from a page listing the first 5 matches

Description Usage Arguments Multiple Hits See Also

View source: R/deprecated/isMultipleHits.R View source: R/cdp-utils.R

Description

There are 2 HTML responses involved in ChemiDPlus scraping. The first HTML response is associated with the concept search while the second is for the 1 to 5 Registry Numbers and their URLs if records were found. The ChemiDPlus parsing functions delineate the response into 3 groups: searches the resulted in no records, a single record, or multiple records (currently limited to parsing a maximum of 5 records).

Usage

1
isMultipleHits(response)

Arguments

response

"xml_document" "xml_node" class object returned by calling xml2::read_html on the rn_url

Multiple Hits

If a ChemiDPlus search has multiple possible matches, the landing page where the first 5 matches are listed is scraped for each of their RNs.

See Also

html_nodes,html_text as_tibble filter_at_grepl extract mutate,mutate_all str_remove

Other chemidplus parsing: is404(), isNoRecord(), isSingleHit()


meerapatelmd/skyscraper documentation built on Dec. 27, 2020, 7:46 a.m.