Man pages for meerapatelmd/skyscraper
Scrape Clinical Drug Data

cancergov_internalCancerGov Internal Functions
cdp_runRun ChemiDPlus on all HemOnc Drugs
cdp_searchSearch ChemiDPlus and Store Results
cg_runRun CancerGov Scrape and Store
cg_searchSearch Cancer.gov
chemidplus_parsing_functionsChemiDPlus Parsing Functions
chemidplus_scraping_functionsChemiDPlus Scraping Functions
drug_countGet the Drug Count in the Drug Dictionary
ex_Crawl_delayExecute Crawl Delay
get_classificationScrape the "Classification" Section at a Registry Number URL
get_classification_codeScrape the Classification Code in the Summary Header of the...
get_dictionary_and_linksScrape the Drug Definitions and Links from the NCI Drug...
get_drug_link_synonymGet the Synonyms found at a given Drug Link
get_drug_link_urlGet the URLs found in a Drug Link
get_links_to_resourcesScrape the "Links to Resources" Section at a Registry Number...
get_names_and_synonymsScrape the "Names and Synonyms" Section at a Registry Number...
get_ncitScrape the NCI Thesaurus
get_ncit_synonymScrape the NCI Thesaurus
get_pmScrape PubMed Publications
get_pm_earliestGet Earliest PubMed Publications
get_pm_latestGet Latest PubMed Publications
get_registry_numbersScrape the "Registry Numbers" Section at a Registry Number...
get_rn_url_validityCheck that the Registry Number URL is Valid
is404Is the RN URL returning an HTTP error 404?
isMultipleHitsGet the RNs from a page listing the first 5 matches
isMultipleHits2FUNCTION_TITLE
isMultipleHits3FUNCTION_TITLE
isNoRecordDoes the RN URL indicate that no records were found?
isSingleHitParse the RN from a single Substance Page
isSingleHit2FUNCTION_TITLE
list_cg_tablesList CancerGov Tables
log_drug_countLog the Drug Count in the Drug Dictionary
log_errorsFUNCTION_TITLE
log_registry_numberLog Registry Number Matches for a Search
lookup_ncit_codeLookup an NCIt Code
nci_internalNCI Drug Dictionary Internal Functions
nci_log_countLog the Drug Count in the Drug Dictionary
nci_runRun NCI Drug Dictionary
pipePipe operator
pm_runRun the complete PubMed Scrape
process_drug_link_ncitProcess the NCIt CUI from the Drug Link URL Table
process_drug_link_synonymProcess the Links found in the Drug Link Table for Synonyms
process_drug_link_urlProcess the Links found in the Drug Link Table for NCIt and...
scrapeScrape
scrape_cdpScrape ChemiDPlus
skyscraper-packageskyscraper: Scrape Clinical Drug Data
start_cdpCreate ChemiDPlus Schema If Not Exist
start_cgCreates CancerGov Schema
start_nciCreates NCI Schema
start_pmCreate PubMed Tables in Patelm9 Schema
update_cancergov_drugsUpdate the Cancergov Drugs Table
write_cg_staging_tblWrite Staging Tables to Cancergov Schema
meerapatelmd/skyscraper documentation built on Dec. 27, 2020, 7:46 a.m.