findSectionHeaders: Find the XML nodes corresponding to the section titles of the...

Description Usage Arguments Author(s)

Description

This uses a heuristic approach to find the

Usage

1
findSectionHeaders(doc, sectionName = c("introduction", "background", "conclusions", "discussion", "materials and methods", "literature cited", "references cited", "the study"), otherSectionNames = c("references", "acknowledgements", "acknowledgments", "results", "methods"), checkCentered = TRUE, discardAfterReferences = TRUE, allowRotated = FALSE, onlyFirst = FALSE, order = TRUE, groupByLine = FALSE)

Arguments

doc
sectionName
otherSectionNames
checkCentered

a logical value. If the nodes we identify as section using the "expected" names are centered, then by default when we look for other text with the same font, we only include centered text. However, if checkCentered = FALSE we include all text with the same section header font. Checking for centered is currently expensive.

discardAfterReferences
allowRotated
onlyFirst
order
groupByLine

Author(s)

Duncan Temple Lang


dsidavis/GetDocElements documentation built on July 8, 2019, 2:01 p.m.