processDocument: Extract useful car sales information from a web page

Description Usage Arguments Value

View source: R/web-retrieve.R

Description

Extract useful car sales information from a web page. Prefer using processPage.

Usage

1
2
processDocument(xmlDoc,
  prodPath = "//div[@itemtype='http://schema.org/Product']")

Arguments

xmlDoc

an object of class XMLInternalDocument

prodPath

the XPath path identifying the car sale entry.

Value

a list with two elements, the first is a data frame; one line per car sold. The second item is a list of data points that needs further post processing before inclusion.


jmp75/carsales documentation built on May 19, 2019, 1:55 p.m.