Description Usage Arguments Value Examples
This function prepares the data by cleaning punctuation, checking spelling against the lexicons, mapping terms accorsing to the lexicons, removing negative expressions and lower casing everything. It contains several of the other functions in the package for ease of use. The user can decide whether to also include POS tagging and Negative removal as well as which extractor. By default the extractor called 'Extractor' (which assumes all headers are present in the same order in each text entry) is used. Also by default the negative phrases are removed and POS tagging is not performed.
1 2 |
inputText |
The relevant pathology text column |
delim |
the delimitors so the extractor can be used |
NegEx |
parameter to say whether the NegativeRemove function used. |
Extractor |
this states which Extractor you want to use. 1 is Extractor 1 (for uniformly ordered headers), 2 is Extractor2 for text when headers are sometimes missing |
This returns a string vector.
1 2 3 4 | mywords<-c("Hospital Number","Patient Name:","DOB:","General Practitioner:",
"Date received:","Clinical Details:","Macroscopic description:",
"Histology:","Diagnosis:")
CleanResults<-textPrep(PathDataFrameFinal$PathReportWhole,mywords,NegEx="TRUE",Extractor="1",ExtractPOS="2")
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.