getTalk: Download a Talk

Description Usage Arguments Details Value Author(s) Examples

View source: R/getTalk.R

Description

Download a talk from lds.org and process it into plain text.

Usage

1

Arguments

url

URL of the talk to be downloaded

Details

The complete HTML code from the page is downloaded. A series of regular expressions is used to parse the HTML into a plain text version of the talk. Using regular expressions is not generally recommended for parsing HTML. Unfortunately, the HTML code doesn't appear to play well with more advanced forms of parsing. The regular expressions seem to work well here mostly because talk paragraphs start with the tag <p uri= ... id="pid[num]">. HTML rendered characters are converted to plain text and tags such as links, italic, and bold are removed.

Value

A character string giving the full text of the talk. Note that references are not included.

Author(s)

Benjamin Nutter

Examples

1
2
# Download the first talk of the 1971 General Conference
getTalk('https://www.lds.org/general-conference/1971/04/out-of-the-darkness?lang=eng')

nutterb/ldsconf documentation built on May 24, 2019, 10:52 a.m.