findSpecies: Find species names

Description Usage Arguments Details Value Author(s) See Also Examples

View source: R/findSpecies.R

Description

Find species in italics within PMC XML documents

Usage

1

Arguments

doc

XML document from PMC

Details

Finds text within italic tags and then checks for possible species by matching two words starting with a capital letter, excluding some common terms like "P value" and "In silico". Also replaces genus abbreviations by matching species names using unabbrev

Value

A vector of species names

Author(s)

Chris Stubben

See Also

unabbrev

Examples

1
2
3
4
5
## Not run: 
doc <- pmcOAI("PMC2231364" )
table2(findSpecies(doc))

## End(Not run)

cstubben/pmcXML documentation built on May 14, 2019, 12:25 p.m.