View source: R/process_numeric.r
as_numeric_ecotox | R Documentation |
character
to numeric
Similar to
as.numeric()
, but it also
performs some text sanitising before coercing text to numerics.
as_numeric_ecotox(x, range_fun = NULL, ..., warn = TRUE)
x |
A vector of |
range_fun |
Function to summarise range values. If |
... |
Arguments passed to |
warn |
If set to |
The following steps are performed to sanitise text before coercing it to numerics:
Notes labelled with "x"
or "\*"
are removed.
Operators (">"
, ">="
, "<"
, "<="
, "~"
, "="
, "ca"
, "er"
)
are removed.
Text between brackets ("()"
) is removed (including the brackets)
Comma's are considered to be a thousand separator when they are located at any fourth character (from the right) and removed. Comma's at any other location is assumed to be a decimal separator and is replaced by a period.
If there is a hyphen present (not preceded by an ""e"
or "E"
) it is probably
representing a range of values. When range_fun
is NULL
it will result in a NA
.
Otherwise, the numbers are split at the hyphen and aggregated with range_fun
It is your own responsibility to check if the sanitising steps are appropriate for your analyses.
A vector of numeric
values with the same length as x
.
Pepijn de Vries
Other ecotox-sanitisers:
as_date_ecotox()
,
as_unit_ecotox()
,
mixed_to_single_unit()
,
process_ecotox_dates()
,
process_ecotox_numerics()
,
process_ecotox_units()
## a vector of commonly used notations in the database to represent
## numeric values
char_num <- c("10", " 2", "3 ", "~5", "9.2*", "2,33",
"2,333", "2.1(1.0 - 3.2)", "1-5", "1e-3")
## Text fields reported as ranges are returned as `NA`:
as_numeric_ecotox(char_num, warn = FALSE)
## Text fields reported as ranges are processed with `range_fun`
as_numeric_ecotox(char_num, range_fun = median)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.