README.md

{rastro}

{rastro} is an attempt to explore the new type system provided by the {vctrs} package. In {rastro} I create some simple data containers for simple scenarios that I encounter in my scientific work.

Consider the following example. When an observation is carried out, obtained value is usually accompanied by an error. The error can be symmetric or asymmetric. This can be represented in a tibble using two or three columns. Alternatively, a special type can be created that stores observations and errors and can be easily transformed to and from data.frame.

library(rastro)
library(tibble, warn.conflicts = FALSE)
library(dplyr, warn.conflicts = FALSE)

# Create an observation from a standalone value
new_obs(5)
## <rastro_obs<double>[1]>
## [1] 5 ± 0
# Add an optional error
new_obs(rnorm(5), runif(5, 0, 0.5))
## <rastro_obs<double>[5]>
## [1] -1.72932190 ± 0.03937857  0.75431361 ± 0.05503880 -0.46229277 ± 0.38772896
## [4]  0.03224963 ± 0.04219440 -0.94938165 ± 0.07139272
# And asymmetric errors
new_obs(1:10, p_err = 2, n_err = 1)
## <rastro_obs<double>[10]>
##  [1]  1 (- 1; + 2)  2 (- 1; + 2)  3 (- 1; + 2)  4 (- 1; + 2)  5 (- 1; + 2)
##  [6]  6 (- 1; + 2)  7 (- 1; + 2)  8 (- 1; + 2)  9 (- 1; + 2) 10 (- 1; + 2)

With proper methods defined, rastro_obs can be used within tibble:

tibble(Time = 1:10, Value = new_obs(rnorm(10), runif(10, 0.1, 0.5)))
## # A tibble: 10 x 2
##     Time                   Value
##    <int>              <obs<dbl>>
##  1     1  0.50597762 ± 0.1141827
##  2     2 -1.09852633 ± 0.3058298
##  3     3  0.44978757 ± 0.1963286
##  4     4 -0.79895740 ± 0.1279313
##  5     5  2.19774867 ± 0.3376328
##  6     6  0.23805530 ± 0.2193765
##  7     7  1.01477300 ± 0.1503656
##  8     8  0.22494612 ± 0.3593588
##  9     9  1.13582433 ± 0.4722210
## 10    10 -0.07820105 ± 0.4501376

rastro_obs<T> behaves like a generic of type T. Type conversion rules limit possible values of T, but this limitation can be lifted by simply implementing additional type-conversion rules.

# An integer observation
new_obs(5L)
## <rastro_obs<integer>[1]>
## [1] 5 ± 0
# Strings are not allowed
new_obs("a")
## Error: Can't convert from <double> to <character>.

Arithmetic can be also implemented. A simple scenario would be to multiply/divide an observed quantity by a scalar, which scales both observation and its error:

new_obs(1:10, 4) * 0.5
## <rastro_obs<double>[10]>
##  [1] 0.5 ± 2 1.0 ± 2 1.5 ± 2 2.0 ± 2 2.5 ± 2 3.0 ± 2 3.5 ± 2 4.0 ± 2 4.5 ± 2
## [10] 5.0 ± 2

Another useful case is manipulation of multiple observed quantities. For instance, subtracting some background/baseline estimate (which itself is not known precisely) is a common task. To get the error on the derived quantity, a simple error propagation rule can be applied. The application of such rule can be encoded into the arithmetic functions for such type:

new_obs(30, 3) - new_obs(10, 4)
## <rastro_obs<double>[1]>
## [1] 20 ± 5

The result is 20 ± 5. If the function is Then , which is simply for a subtraction.

A more complex scenario

A similar type can be devised for a physical quantity. I chose flux (energy per unit area and unit time). Flux measurements are associated with a “filter” and a zero-point flux value (necessary for conversion). In general, it is illegal to combine fluxes obtained in different filters.

new_flux(100, "B", "mJy")
## <rastro_flux<B>[1]>
## [1] 1.000e+02
## Unit: mJy

This gives us flux of 100 milliJansky measured in filter B. If the error on this measurement is, say, 5 mJy, it is possible to store it as

new_obs(new_flux(100, "B", "mJy"), new_flux(5, "B", "mJy"))
## <rastro_obs<rastro_flux<B>>[1]>
## [1] 1.000e+02 ± 5.000e+00

In some cases the type inference can kick in and the error can be simply input as a double. This system prevents operation on fluxes obtained from different filters:

new_obs(new_flux(100, "B")) - new_obs(new_flux(50, "R"))
## Error: No common type for `..1` <rastro_flux<B>> and `..2` <rastro_flux<R>>.

But allows operations for the same filter (but not within rastro_obs<T>, because it requires T to be squared and then to extract square root from the sum)

new_flux(100, "B") - new_flux(50, "B")
## <rastro_flux<B>[1]>
## [1] 5.000e+01
## Unit: NA

Some more utilities

Angle measurements can be tricky, especially if angles are measured in degrees. Here is a solution

new_degr(c(370, -370))
## <rastro_degr[2]>
## [1] 10°  350°

Values are mapped to [0;360] interval. Base function support can be also defined:

mean(new_degr(10:30))
## <rastro_degr[1]>
## [1] 20°

And, most importantly,

sin(new_degr(30))
## [1] 0.5
cos(new_degr(60))
## [1] 0.5

Of course, this type can also be used with rastro_obs and tibble:

tibble(x = new_obs(new_degr(0:5 * 60), new_degr(10)))
## # A tibble: 6 x 1
##             x
##   <obs<degr>>
## 1    0° ± 10°
## 2   60° ± 10°
## 3  120° ± 10°
## 4  180° ± 10°
## 5  240° ± 10°
## 6  300° ± 10°

Work is still in progress



Ilia-Kosenkov/rastro documentation built on May 12, 2020, 12:56 a.m.