User Guide"

Introduction

Package 'photobiologyInOut' defines functions for importing spectral data from different instruments, data repositories and simulation models, and for data exchange with R packages 'hyperSpec', 'colorSpec' and 'pavo' by reading or writing objects of classes defined in them (see Tables at the start of each section).

As of version 0.4.21 packages 'hyperSpec' and 'pavo' need to be installed only if the corresponding functions will be used. This avoids bloat when these functions are not needed. If the vignette is built or examples attempted to run in a system where these packages are not installed, some output will be missing but no error will be triggered.

All functions that read data from files attempt to decode and store as metadata the information present in file headers. In addition, in most cases the unchanged header of the file is stored unaltered as a comment in the constructed objects.

It should be remembered, though, that this package has been developed based on the example files I had access to. Files from the same instruments with different hardware configurations, different firmware versions, or even settings may differ substantially. In many cases the output is produced by software in a host computer rather by the instrument itself, adding further uncertainties and possible differences due to for example the operating system of the host computer. A further complication is that in some cases the format of dates, times and numbers depends on the locale settings in use at the time of data acquisition, or analysis. For all those reasons, do expect to have to do some debugging, and most importantly always validate the imported data against the original file (remembering to run a new validation each time there is a software or firmware update) or update of this package as I test each version before release only with the example files I have available, which are not many.

Preliminaries

library(knitr)
opts_chunk$set(fig.align='center', fig.show='hold',
               fig.width=7, fig.height=6, size="footnotesize")
options(replace.assign = TRUE, width = 55,
        warnPartialMatchAttr = FALSE,
        warnPartialMatchDollar = FALSE,
        warnPartialMatchArgs = FALSE)

When loading suggested packages flags are set based on success and later used to selectively evaluate code chunks.

# setting TZ may be needed in some geographic locations as some Windows TZ 
# strings are not recognized by all versions of R
Sys.setenv(TZ = 'UTC')
library(photobiology)
library(photobiologyWavebands)
library(photobiologyInOut)
library(lubridate)
library(ggspectra)
library(readr)
# eval_colorSpec <- requireNamespace("colorSpec", quietly = TRUE)
library(colorSpec)
eval_colorSpec <- TRUE
eval_pavo <- requireNamespace("pavo", quietly = TRUE)
if (eval_pavo) {library(pavo)}
eval_hyperSpec <- requireNamespace("hyperSpec", quietly = TRUE)
if (eval_hyperSpec) {library(hyperSpec)}
theme_set(theme_bw()) # ggplot2
options(tibble.print_max = 5,
        tibble.print_min = 3,
        photobiology.strict.range = NA_integer_)

Spectrometer output files

In the examples we use function system-file to reliably locate the example files included in the package. For reproducing the examples with these files, this call is needed. When using user's files only the path as a character string needs to be passed as argument.

Functions for importing measured spectral data.

| R function | Instrument | Program | class of value | |:-----------------------|:-----------------------|:------------------|:-----------------| | read_oo_ssirrad() | Ocean Optics spectrom. | SpectraSuite | source_spct | | read_oo_ssdata() | Ocean Optics spectrom. | SpectraSuite | raw_spct | | read_oo_jazirrad() | Ocean Optics Jaz | instrument | source_spct | | read_oo_jazpc() | Ocean Optics Jaz | instrument | filter_spct | | read_oo_jazpc() | Ocean Optics Jaz | instrument | reflector_spct | | read_oo_jazdata() | Ocean Optics Jaz | instrument | raw_spct | | read_oo_pidata() | Ocean Optics spectrom. | STS DK (Raspbian) | raw_spct | | read_avaspec_csv() | Avantes spectrom. | instrument | source_spct | | read_macam_file() | Macam | instrument | source_spct | | read_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | source_spct | | read_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | filter_spct | | read_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | reflector_spct | | read_m_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | source_mspct | | read_m_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | filter_mspct | | read_m_licor_file() | LI-COR LI-1800 | PC1800 (MS-DOS) | reflector_mspct |

Ocean Optics Jaz

Raw detector counts

Reading a raw data file generated by Ocean Optics' Jaz spectrometer. The light source was the Jaz PX pulsed Xenon light module.

The first few lines of the file look like this, with W for wavelength, D for dark, R for reference, S for sample and P for processed (all spectral data values are raw detector counts):

Jaz Data File
++++++++++++++++++++++++++++++++++++
Date: Mon Apr 25 12:49:11 2016
User: jaz
Dark Spectrum Present: Yes
Reference Spectrum Present: Yes
Processed Spectrum Present: Yes
Spectrometers: JAZA3098
Integration Time (usec): 748000 (JAZA3098)
Spectra Averaged: 1 (JAZA3098)
Boxcar Smoothing: 0 (JAZA3098)
Correct for Electrical Dark: No (JAZA3098)
Strobe/Lamp Enabled: Yes (JAZA3098)
Correct for Detector Non-linearity: No (JAZA3098)
Correct for Stray Light: No (JAZA3098)
Number of Pixels in Processed Spectrum: 2048
>>>>>Begin Processed Spectral Data<<<<<
W   D   R   S   P
190.313904  0.000000    0.000000    0.000000    0.000000
190.695511  0.000000    0.000000    0.000000    0.000000
191.077087  1138.953125 1123.134277 1102.795898 228.570541
191.458633  1184.149658 1227.086426 1059.859131 -289.473419
191.840149  1175.110352 1193.188965 1132.173584 -237.500336
...
jaz.raw.file <- 
  system.file("extdata", "spectrum.jaz", 
              package = "photobiologyInOut", mustWork = TRUE)
jazraw.spct <- read_oo_jazdata(file = jaz.raw.file)
jazraw.spct <- trim_wl(jazraw.spct, range = c(250, 900))

Plotting the spectrum.

autoplot(jazraw.spct)

The metadata stored in attributes can be accessed with functions. It is clear, that not all settings can be recovered from the file. However, we store the record will all the fields which would have been filled if the data had been acquired directly from R using package 'ooacquire'.

getWhenMeasured(jazraw.spct)
getInstrDesc(jazraw.spct)
getInstrSettings(jazraw.spct)

Spectral energy irradiance

Reading an "Absolute Irradiance File" (sic) generated by Ocean Optics' Jaz spectrometer results in a source_spct object. In this example, the light source measured was a `white' fluorescent tube.

The first few lines of the file look like this:

Jaz Absolute Irradiance File
++++++++++++++++++++++++++++++++++++
Date: Tue Feb 03 09:44:41 2015
User: jaz
Dark Spectrum Present: Yes
Processed Spectrum Present: Yes
Spectrometers: JAZA1065
Integration Time (usec): 193000 (JAZA1065)
Spectra Averaged: 3 (JAZA1065)
Boxcar Smoothing: 5 (JAZA1065)
Correct for Electrical Dark: Yes (JAZA1065)
Strobe/Lamp Enabled: No (JAZA1065)
Correct for Detector Non-linearity: Yes (JAZA1065)
Correct for Stray Light: No (JAZA1065)
Number of Pixels in Processed Spectrum: 2048
Fiber (micron): 3900
Collection Area: 0.119459
Int. Sphere: No
>>>>>Begin Processed Spectral Data<<<<<
W   D   S   P
188.825226  0.000000    0.000000    0.000000
189.284851  0.000000    0.000000    0.000000
189.744415  -89.659378  -90.917900  -0.000000
190.203964  -106.165916 -96.419785  0.000000
...
jaz.s.irrad.file <- 
  system.file("extdata", "spectrum.JazIrrad", 
              package = "photobiologyInOut", mustWork = TRUE)
jaz.spct <- read_oo_jazirrad(file = jaz.s.irrad.file)
jaz0.spct <- jaz.spct
jaz.spct <- trim_wl(jaz.spct, range = c(290, 800))

Plotting the spectrum.

autoplot(jaz.spct)

Cleaning spectral data

We can see that the data have problems. We get a warning because the data include negative values for spectral irradiance. We will use some methods from package 'photobiology' to correct the problem. As the data are noisy we cannot just shift the scale so that the most negative value becomes zero. Neither can we replace all negative values with zeros, as this would create bias.

In the following code chunk we will use a region of the spectrum in which spectral irradiance is known to be equal to zero as reference to shift the scale zero. Afterwards we discard data ``known'' to be zero, and for which the instrument calibration is not valid, and finally we plot the spectrum.

jaz.spct <- fshift(jaz0.spct, range = c(255, 290), f = "mean")
jaz.spct <- trim_wl(jaz.spct, range = c(290, 800))
autoplot(jaz.spct)

We can next try to smooth the spectrum as it is very noisy outside the visible region.

jaz.spct <- smooth_spct(jaz.spct)
autoplot(jaz.spct)

Photon and energy irradiances.

e_irrad(jaz.spct, PAR())       # W m-2

All in one statement.

autoplot(read_oo_jazirrad(file = jaz.s.irrad.file))

As above but limiting the wavelength range plotted.

autoplot(read_oo_jazirrad(file = jaz.s.irrad.file),
     range = c(250,850))

Adding our custom ``adaptive'' smoothing.

autoplot(smooth_spct(read_oo_jazirrad(file = jaz.s.irrad.file)),
     range = c(250,850))

Other modular spectrometers from Ocean Optics

Now a file from an Ocean Optics' Q6500 spectrometer, with data processed with the Spectra Suite software.

Format of the header is similar, but not identical. The first few lines of the file look like this:

SpectraSuite Data File
++++++++++++++++++++++++++++++++++++
Date: Mon May 06 15:13:40 CEST 2013
User: User
Dark Spectrum Present: Yes
Reference Spectrum Present: No
Number of Sampled Component Spectra: 1
Spectrometers: QEB1523
Integration Time (usec): 100000 (QEB1523)
Spectra Averaged: 1 (QEB1523)
Boxcar Smoothing: 0 (QEB1523)
Correct for Electrical Dark: No (QEB1523)
Strobe/Lamp Enabled: No (QEB1523)
Correct for Detector Non-linearity: No (QEB1523)
Correct for Stray Light: Yes (QEB1523)
Number of Pixels in Processed Spectrum: 1044
>>>>>Begin Processed Spectral Data<<<<<
199.08  0.0000E00
199.89  0.0000E00
200.70  0.0000E00
...
q.raw.file <- 
  system.file("extdata", "spectrum.SSIrrad", 
              package = "photobiologyInOut", mustWork = TRUE)
autoplot(read_oo_ssirrad(file = q.raw.file))

Modular spectrometers from Avantes

Avantes' two column .csv files can also be imported.

ava.raw.file <- 
  system.file("extdata", "spectrum-avaspec.csv", 
              package = "photobiologyInOut", mustWork = TRUE)
autoplot(read_avaspec_csv(file = ava.raw.file),
     range = c(280, 900), unit.out = "photon")

Scanning spectrometer from Macam

Macam's single column DTA files can also be imported.

The first few lines of the file look like this with all data in a single column with alternate rows for wavelengths (in nm) and irradiances, and a very terse header:

@19/5/1997
@17:44:58
#No Title
 2.5000000000E+02
 0.0000000000E+00
 2.5100000000E+02
 0.0000000000E+00
 2.5200000000E+02
 0.0000000000E+00
...
macam.raw.file <- 
  system.file("extdata", "spectrum.DTA", 
              package = "photobiologyInOut", mustWork = TRUE)
autoplot(read_macam_dta(file = macam.raw.file))

LI-1800 scanning spectrometer from LI-COR

And a spectral photon irradiance output file generated by LI-COR's PC1800 program for the LI-1800 spectroradiometer.

The output has a relatively detailed header, but it lacks year information. Files can contain either energy or photon based spectral irradiances, and this is signalled in the header. In this example photon (= quantum) spectral irradiance is returned. The first few lines of the file look like this:

"FILE:FL2"
"REM: TLD 36W/865       (QNTM)"
"LIMS: 300- 900NM"
"INT:  1NM"
"DATE:08/23 16:32"
"MIN:  300NM  1.518E-04"
"MAX:  546NM  7.491E-01"
 300  1.518E-04
 301  3.355E-04
 302  2.197E-04
 303  3.240E-04
...

Function read_licor_prn will automatically detect whether the data is energy or photon based.

licor.file <- 
  system.file("extdata", "spectrum.PRN", 
              package = "photobiologyInOut", mustWork = TRUE)
licor.spct <- read_licor_prn(file = licor.file)

In all cases as much information as possible is decoded, and the data file headers are preserved as comments in the source.spct objects.

licor.spct
cat(comment(licor.spct))
autoplot(licor.spct, unit.out = "photon")

It is also possible to use the same function to import reflectance, and transmittance spectra acquired by the LI-1800.

And a spectral reflectance output file generated by LI-COR's PC1800 program for the LI-1800 spectroradiometer is used next.

The first few lines of the file look like this:

"FILE:RGD1"
"REM: REFL GREEN AD 1 "
"LIMS: 350- 800NM"
"INT:  2NM"
"DATE:05/30 13:50"
"MIN:  358NM  4.628E-02"
"MAX:  776NM  4.693E-01"
 350  5.135E-02
 352  4.713E-02
 354  5.324E-02
 356  4.740E-02
...

Function read_licor_prn cannot automatically detect the spectral quantity in the file, and when the irradiance default is not correct, users need to override it with an explicit argument for parameter s.qty.

licor.file <- 
  system.file("extdata", "reflectance.PRN", 
              package = "photobiologyInOut", mustWork = TRUE)
licor.spct <- read_licor_prn(file = licor.file, s.qty = "Rfr")

In all cases as much information as possible is decoded, and the data file headers are preserved as comments in the source.spct objects.

licor.spct
cat(comment(licor.spct))
autoplot(licor.spct)

Data from loggers

Campbell Scientific

Campbell Scientific is a well know supplier of data loggers for commercial and research applications. Function read_csi_dat() defined in this package has been tested with a recent datalogger model, the CR6, and using recent versions of programs PC400 and PC200W to download the data. The currently used format of .DAT files is easy to decode and our function can automatically detect the number and type of columns and the number of rows.

cs.day.file <- 
  system.file("extdata", "cr6-day.dat", 
              package = "photobiologyInOut", mustWork = TRUE)

Executing the statement below displays the 10 top lines of the DAT file as is, one character string per line.

# not run
read_lines(yoctopuce_hour.file, n_max = 10)
day.dat <- read_csi_dat(file = cs.day.file)
day.dat

All information is preserved in the returned tibble::tibble object, which is derived from data.frame.

cs_hour.file <- 
  system.file("extdata", "cr6-hour.dat", 
              package = "photobiologyInOut", mustWork = TRUE)
hour.dat <- read_csi_dat(file = cs_hour.file)
ggplot(hour.dat, aes(TIMESTAMP, PAR_Den_Avg)) + geom_line()

YoctoPuce

Yocto Puce Sarl sells numerous different USB modules. Those capable of data input can log the data to memory and these data can be downloaded as a CSV file. These files can be easily read into R but function read_yoctopuce_csv() makes this even simpler.

yoctopuce_hour.file <- 
  system.file("extdata", "yoctopuce-data.csv", 
              package = "photobiologyInOut", mustWork = TRUE)

Executing the statement below displays the 10 top lines of the CSV file as is, one character string per line.

# not run
read_lines(yoctopuce_hour.file, n_max = 10)

Here we import and plot the data.

hour.dat <- read_yoctopuce_csv(file = yoctopuce_hour.file)
ggplot(hour.dat, aes(ISO.time, temperature.avg)) + geom_line()

Output from simulation models

Functions for importing simulated spectral data.

| R function | Simulation model | Version | class of value | |:-----------------------|:-----------------------|:------------------|:-----------------| | read_tuv_usrout() | TUV (S. Madronich) | version 5.0 | source_spct | | read_tuv_usrout2mspct() | TUV (S. Madronich) | version 5.0 | source_mspct | | read_qtuv_txt() | TUV (S. Madronich) | version 5.2 | source_spct | | read_uvspec_disort() | libRadtran | irradiance | source_spct | | read_uvspec_vesa() | (T. & V. Kotilainen) | irradiance | source_spct | | read_fmi_cum() | (A. Lindfors) | daily cumulated | source_spct | | read_m_fmi_cum() | (A. Lindfors) | daily cumulated | source_mspct |

TUV

The output from the TUV model can be imported either by editing it before import, or by making a simple edit to the output routine of TUV. This function is known to work with TUV version 5.0 output. The output from TUV can contain a variable number of spectra in ''parallel'' columns, which are melted into a single column, with a factor with letters as levels, a numeric variable with the zenith angle and a POSIXct column with times. A date needs to be always supplied as the output file from TUV has only time of day information.

tuv.file <- 
  system.file("extdata", "usrout.txt", 
              package = "photobiologyInOut", mustWork = TRUE)
tuv.spct <- read_tuv_usrout(file = tuv.file,
                            date = ymd("2014-03-21"))
summary(subset(tuv.spct, spct.idx == "A"))
tuv.spct

It is possible to extract individual spectra with subset, or as done here plot them in different panels.

autoplot(tuv.spct, annotations = c("colour.guide")) +
  facet_wrap(~date, ncol = 2)

The output is a single source_spct} object that can be easily converted into asource_mspct} object containing the individual spectra as members of the collection.

tuv.mspct <- subset2mspct(tuv.spct)
tuv.mspct

A file can be directly read into a collection using read_tuv_usrout2mspct() which is a simple wrapper.

With the default of lubridate::today() date times are 'mapped' to the current local date using the time zone of the computer as visible to R.

tuv_nd.spct <- read_tuv_usrout(file = tuv.file)
tuv_nd.spct

Quick TUV calculator

The files output by the online calculator based on the TUV model, contain at most one spectrum, and arguments to only some parameters can be set by users. However, it is convenient to use when we only need a few simple simulations.

Function read_qtuv_txt() can extract spectra and the corresponding metadata from these files. These files do contain date time information and geolocation data when they are supplied as arguments to the calculator interface, otherwise only zenith angle is available.

qtuv.file <- 
  system.file("extdata", "qtuv.txt", 
              package = "photobiologyInOut", mustWork = TRUE)
qtuv.spct <- read_qtuv_txt(file = qtuv.file)
summary(qtuv.spct)
qtuv.spct

libRadtran

By default libRadtran's uvspec writes only spectral irradiances to a text file as output. This is different from 'TUV' which by default includes an extensive header with the parameter settings used for the simulation. It is easy to read this simple output file with R's functions. However, we provide functions, that simplify reading of the files. The output from uvspec varies depending on its input. The main source of differences is the solver routine used. We will provide a separate function for each solver.

For reading this simple output, no special function is needed. We can use read.table from base R. Here we read a file with two columns with wavelengths and global spectral energy irradiance (named "eglo" in libRadtran) in $mW\,m^{-2}\,nm^{-1}$. The file was created with one of the 'uvspec' examples included with libRadtran, but reducing the output to two columns.

The first few lines of the file look like this:

  250.000  0.000000e+00
  251.000  0.000000e+00
  252.000  0.000000e+00
  253.000  0.000000e+00
...
uvspec.2col.file <- 
  system.file("extdata", "uvspec-plain-2col.dat", 
              package = "photobiologyInOut", mustWork = TRUE)
lrt.df <- read.table(file = uvspec.2col.file,
                     col.names = c("w.length", "s.e.irrad"))
uvspec.01.spct <- source_spct(w.length = lrt.df$w.length,
                               s.e.irrad = lrt.df$s.e.irrad * 1e-3)
summary(uvspec.01.spct)
cat(comment(uvspec.01.spct))
autoplot(uvspec.01.spct, range = c(250, 2500), unit.out = "photon")

An example using solver disort and our function read_uvspec_disort() follows.

The first few lines of the file look like this:

  290.000  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00  0.000000e+00
  291.000  1.046525e-11  4.800521e-06  1.674293e-07  2.374267e-12  5.342789e-07  2.664720e-08
  292.000  5.888299e-10  2.813865e-05  9.814190e-07  1.335888e-10  3.162808e-06  1.561977e-07
  293.000  4.383296e-09  5.764524e-05  2.010660e-06  9.944455e-10  6.522616e-06  3.200065e-07
...
uvspec.disort.file <- 
  system.file("extdata", "uvspec-disort.dat", 
              package = "photobiologyInOut", mustWork = TRUE)
uvspec.02.spct <- read_uvspec_disort(uvspec.disort.file)
summary(uvspec.02.spct)
cat(comment(uvspec.02.spct))
autoplot(uvspec.02.spct, unit.out = "photon")

The data contains also estimates of diffuse and direct spectral irradiance. Here we plot the total energy irradiance with a solid line and the diffuse component with a dashed line.

ggplot(uvspec.02.spct) +
  geom_line() +
  geom_line(aes(y = s.e.irrad.diff), linetype = "dashed")

The uvspec file used to generate the spectrum read above is:

data_files_path uvspec_home/data/
atmosphere_file uvspec_home/data/atmmod/afglms.dat
source solar uvspec_home/data/solar_flux/kurudz_1.0nm.dat
rte_solver disort
mol_abs_param lowtran
deltam on
number_of_streams 6
wavelength 290 900
day_of_year 287
altitude 0.012
albedo_library IGBP
brdf_rpv_type 5
mol_modify O3 288 DU
mol_modify H2O 10 MM
sza 69.4662
sur_temperature 273

If we plan to save and reuse the spectral object, it is recommended to append the input file to the comment.

uvspec.disort.inp.file <- 
  system.file("extdata", "uvspec-disort.inp", 
              package = "photobiologyInOut", mustWork = TRUE)
comment(uvspec.02.spct) <- paste(comment(uvspec.02.spct),
                                 read_file(uvspec.disort.inp.file),
                                 sep = "\n\n")
cat(comment(uvspec.02.spct))

We give two additional examples, which will most likely need some adjustment by users, as these are for output from libRadtran post-processed to add additional information. These are included in the package because myself and collaborators use these formats heavily. In fact users could develop shell scripts or Perl scripts using the same output format.

Output enriched with time and date data

In this case the file to be read is similar as above, but including separate columns for direct and diffuse components of the spectral energy irradiance. In addition two columns, one with date strings in ISO format and one with times have been added. The file instead of containing a single spectrum, contains several spectra in long form.

The first few lines of the file look like this:

290.000 2015-05-19 11_00_00 0.000000e+00 0.000000e+00
291.000 2015-05-19 11_00_00 0.000000e+00 0.000000e+00
292.000 2015-05-19 11_00_00 0.000000e+00 0.000000e+00
293.000 2015-05-19 11_00_00 1.893645e-05 3.439497e-05
294.000 2015-05-19 11_00_00 1.648530e-04 2.764368e-04
...

A function is included for reading data saved in a text file in this format. It also automatically converts $mW\,m^{-2}\,nm^{-1}$ into $W\,m^{-2}\,nm^{-1}$.

uvspec.multi.file <- 
  system.file("extdata", "uvspec-multi.dat", 
              package = "photobiologyInOut", mustWork = TRUE)
lbr.multi.spct <- read_uvspec_disort_vesa(uvspec.multi.file)
print(lbr.multi.spct, n = 5)

Scripts developed by Anders Lindfors

A model for the simulation of the solar spectrum was developed at the Finnish Meteorological Institute (FMI) by Dr.\ Anders Lindfors and collaborators and uses functions from 'libRadtran' as its engine, but saves some additional metadata to the output file. The main addition is related to the estimation of the effect of clouds.

Functions read_fmi_cum() and read_m_fmi_cum() can be used to read text files output as daily integrated spectral irradiance. In other words cummulated daily data. Function read_fmi2mspct() reads spectral irradiance, extracting multiple sequential spectra from a single file.

The first few lines of the files with cummulated data look like this:

# date number_of_scans start_scan stop_scan max_time_gap max_sza_gap warnings
# 20140821 15 3:30:00 17:30:00 60 7.4
# wavelength exposure(J/m2/nm)
2900 0.00000000e+00
2910 2.93132235e-05
2920 7.23526379e-04
...

We can read an individual file into a source_spct object while adding some metadata read from the file header. In this case values are for daily global spectral energy exposures rather than irradiances. Wavelengths are expressed in Angstroms instead of nanometres.

fmi.file <- 
  system.file("extdata", "2014-08-21_cum.hel", 
              package = "photobiologyInOut", mustWork = TRUE)
z.spct <- read_fmi_cum(fmi.file)
class_spct(z.spct)
getWhenMeasured(z.spct)
z.spct

With function read_m_fmi_cum with an ``m'' in the name we can read several files each containing a single spectrum. The returned object is a collection of source spectra.

fmi.files <- 
  system.file("extdata", c("2014-08-21_cum.hel", "2014-08-21_cum.hel"),
              package = "photobiologyInOut", mustWork = TRUE)
z.mspct <- read_m_fmi_cum(fmi.files)
class(z.mspct)
getWhenMeasured(z.mspct)
z.mspct

Above we gave the names of the files explicitly, but as we show here, one can build on-the-fly a list of file names matching some pattern. The example below is not run, as the location of example files may vary. The string "." should be replaced with the path to the folder where the files to be read are located.

fmi.files <- list.files(".", "*cum.hel")
fmi.files <- paste(".", fmi.files, sep = "")
z1.mspct <- read_m_fmi_cum(fmi.files)
class(z1.mspct)
getWhenMeasured(z1.mspct)
z1.mspct

One also add a geocode at the time of import (or later).

# because of Google's query limits call will frequently fail without a key
# my.geocode <- ggmap::geocode("Kumpula, Helsinki, Finland", source = "google")
my.geocode <- data.frame(lon = 24.96474, lat = 60.20911)
z2.mspct <-
  read_m_fmi_cum(fmi.files,
                 geocode = my.geocode)
class(z2.mspct)
getWhenMeasured(z2.mspct)
getWhereMeasured(z2.mspct)
z2.mspct

Files with spectral irradiance contain data for multiple spectra stored as text. Each spectrum is delimited at the top by a header line with metadata and at the end by "end" in a line by itself.

The first few and last lines for each spectrum look like this:

# 20130501 3:30:00 3:30:00 82.656
210.00 2900 0
210.00 2910 0
210.00 2920 0
...
210.00 8480 103.73018
210.00 8490 104.63495
210.00 8500 90.18384
end
...

The number of spectra, range of wavelengths and the length of each spectrum can vary. Function read_fmi2mspct() uses matching to the delimiters to read all the data in all cases.

fmi.file <- 
  system.file("extdata", "2013-05-01.hel", 
              package = "photobiologyInOut", mustWork = TRUE)
z3.mspct <- read_fmi2mspct(fmi.file)
class(z3.mspct)[1:2]
getWhenMeasured(z3.mspct[[1]])
length(z3.mspct)
names(z3.mspct)
getWhenMeasured(z3.mspct[[1]])
getWhatMeasured(z3.mspct[[1]])

Online public repositories of spectral data

Functions for importing spectral data downloaded from repositories.

| R function | Data repository | Version | class of value | |:-----------------------|:-----------------------|:------------------|:-----------------| | read_FReD_csv() | Floral Reflectance db. | 2017-03-19 | reflector_spct | | read_ASTER_txt() | ASTER spectral lib. | version 2.0 ASCII | reflector_spct |

FReD Floral Reflectance Database

The files downloaded from FReD do not contain a header, but the first column indicates the flower ID.

157, 300, 0.0627119 
157, 301, 0.0654036 
157, 302, 0.0677941 
157, 303, 0.0670396 
...
fred.file <- 
  system.file("extdata", "FReDflowerID_157.csv", 
              package = "photobiologyInOut", mustWork = TRUE)
fred.spct <- read_FReD_csv(file = fred.file, 
                           label = "Gazania heterochaeta",
                           geocode = data.frame(lat = -28.8751, lon = 17.2293))

In this case as there is no metadata present in the file, it needs to be supplied by the user.

fred.spct
cat(comment(fred.spct))
autoplot(fred.spct)

ASTER spectral database

The files downloaded from ASTER contain a 25-lines-long header, but at the moment only the first field is decoded, as the whole header copied as a comment.

Name: Dry grass
Type:  Vegetation
Class:  Grasses
Subclass:  Dry grass
Particle Size:  Solid
Sample No.:  drygrass.doc
Owner:  JHU
Wavelength Range:  All
Origin:  The entire spectral range was measured at Johns Hopkins University.

Description:  Dry grass.  Spectra were assembled from two segments; the 
bidirectional VNIR and SWIR comprising segment one, and the hemispherical 
MWIR and TIR comprising segment two. The VNIR/SWIR spectrum was 
measured in the laboratory at JHU with a GER IRIS Mark IV, using a large piece 
of sod.  The grass was illuminated from directly above and measured at a 
reflectance angle of 60 degrees to avoid viewing the thatch. 
Measurement:  Bidirectional and directional hemispherical reflectance. 
First Column:  X
Second Column:  Y  
X Units:  Wavelength (micrometers)
Y Units:  Reflectance (percent)
First X Value: 0.38049
Last X Value: 14.011 
Number of X Values: 2559
Additional Information:  None.

0.38049 14.249   
0.38299 14.251      
0.38544 14.546      
0.38791 14.694      
...
aster.file <- 
  system.file("extdata", "drygrass-spectrum.txt", 
              package = "photobiologyInOut", mustWork = TRUE)
aster.spct <- read_ASTER_txt(file = aster.file)

The label and comment are set from the file header.

aster.spct
cat(comment(aster.spct))
autoplot(aster.spct)

Other R packages

A general way of exchanging data with other R packages or for use with base R functions is to create a matrix from a collection of spectra with as.matrix(), or a collection of spectra from a matrix with one of the as.xxxx_mspct() methods such as as.source_spct. Such methods are defined in package 'photobiology' as well as method join_mspct() for conversion of collections of spectra into wide data frames. However, a matrix is only guaranteed to contain numeric data and a "dim" attribute, while conversion to a data frame preserves only part of the metadata. These generic conversions cannot be reversed without loss of information. When possible use the package specific functions as they automate much of the recovery and preservation of metadata.

Functions for exchanging data with foreign R packages.

| R function | Foreign R package | Function | class of value | |:-----------------------|:-----------------------|:------------------|:-----------------| | hyperSpec2spct() | ’hyperSpec’ | import | source_spct | | spct2hyperSpec() | ’hyperSpec’ | export | hyperSpec | | hyperSpec2mspct() | ’hyperSpec’ | import | source_mspct | | mspct2hyperSpec() | ’hyperSpec’ | export | hyperSpec | | colorSpec2spct() | ’colorSpec’ | import | source_spct | | spct2colorSpec() | ’colorSpec’ | export | colorSpec | | colorSpec2mspct() | ’colorSpec’ | import | source_mspct | | mspct2colorSpec() | ’colorSpec’ | export | colorSpec | | chroma_spct2colorSpec() | ’colorSpec’ | export | colorSpec | | rspec2mspct() | ’pavo’ | import | source_mspct |

To 'hyperSpec'

Can export to ''hyperSpec'' objects only collections of spectra where all members have identical w.length vectors, as objects of class hyperSpec store a single vector of wavelengths for the whole collection of spectra.

z2.hspct <- mspct2hyperSpec(z2.mspct, "s.e.irrad")
class(z2.hspct)
# plot(z2.hspct)

From 'hyperSpec'

Can import only data with wavelength in nanometres. Other quantities and units are not supported by the 'photobiology' classes for spectral data. See package 'hyperSpec' vignette "laser" for details on the data and the conversion of the original wavelength units into nanometres.

data(laser)
class(laser)
laser
plot(laser)

We assume here, that the quantity for the spectral emission of the laser is spectral energy irradiance, expressed in $mW\,m^{-2}\,nm^{-1}$. This is likely to be wrong but for the sake of showing how the conversion takes place is irrelevant. The parameter multiplier can be passed a numeric argument to rescale the original data. The default multiplier is 1.

wl(laser) <- list (
  wl = 1e7 / (1/405e-7 - wl (laser)),
  label = expression (lambda / nm)
)
laser
plot(laser)
laser.mspct <-
  hyperSpec2mspct(laser, "source_spct", "s.e.irrad", multiplier = 1e-3)
ggplot(laser.mspct[[1]]) +
  geom_line() +
  stat_peaks(geom = "text", vjust = -1, label.fmt = "%.6g nm", color = "red")

From 'colorSpec'

# bug that needs to be fixed
fluorescent.mspct <- colorSpec2mspct(colorSpec::Fs.5nm)
print(fluorescent.mspct, n = 3, n.members = 3)
colorSpec2mspct(colorSpec::Hoya)
fluorescent.spct <- colorSpec2spct(colorSpec::Fs.5nm)
autoplot(fluorescent.spct) + aes(linetype = spct.idx)
colorSpec2chroma_spct(colorSpec::xyz1931.5nm)

To 'colorSpec'

sun.cspec <- spct2colorSpec(sun.spct)
plot(sun.cspec, col = "blue")
spct2colorSpec(yellow_gel.spct)
chroma_spct2colorSpec(beesxyzCMF.spct)

From 'pavo'

In this example we convert an rspec object from package 'pavo' into a collection of spectra and then we plot it with ggplot methods from package ggspectra' (an extension toggplot2'). The data are the spectral reflectance of the plumage from seven different individual birds of the same species, measured in three different body parts.

data(sicalis)
class(sicalis)
names(sicalis)

We convert the data into a collection of spectra, and calculate summaries for three spectra.

sicalis.mspct <- rspec2mspct(sicalis, "reflector_spct", "Rpc")
summary(sicalis.mspct[[1]])
summary(sicalis.mspct[[2]])
summary(sicalis.mspct[[3]])

We convert the subset of the collection corresponding to the first individual into a single spectra object for plotting with ggplot.

ggplot(rbindspct(sicalis.mspct[1:3])) +
  aes(linetype = spct.idx) +
  ylim(0,0.3) +
  geom_line()

Here we extract the ``crown'' data from all individuals and plot these spectra in a single plot.

print(sicalis.mspct[c(TRUE, FALSE, FALSE)])
ggplot(rbindspct(sicalis.mspct[c(TRUE, FALSE, FALSE)])) +
  aes(linetype = spct.idx) +
  ylim(0,0.15) +
  geom_line() +
  ggtitle("'crown' reflectance spectra")

We calculate the mean reflectance in wavebands corresponding to ISO colors obtaining a data frame. We then add to this returned data frame a factor indicating the body parts.

refl.by.band <- reflectance(sicalis.mspct, w.band = list(Red(), Green(), Blue(), UVA()))
refl.by.band$body.part <- c("crown", "throat", "breast")
refl.red <- reflectance(sicalis.mspct, w.band = Red())
names(refl.red)[2] <- "red.reflectance"
refl.red$body.part <- c("crown", "throat", "breast")
ggplot(refl.red, aes(x = body.part, y = red.reflectance)) +
  stat_summary(fun.data = "mean_se", color = "red") +
  geom_point(alpha = 0.5)

Dealing with odd and bad data

Using locales

Most functions in this package have a parameter locale, that accepts readr::locale objects as arguments. At the moment only the time zone and decimal mark are respected. This allows files using comma for decimal marker be easily imported, or the dates and times in the input file be interpreted in a given time zone. Setting the correct time zone is very important to avoid errors. Time coordinates are always stored in the created objects using universal time coordinates ("UTC").

jaz.irrad.comma.file <- 
  system.file("extdata", "spectrum-comma.JazIrrad", 
              package = "photobiologyInOut", mustWork = TRUE)
my.locale <- locale(decimal_mark = ",", tz = "EET")
jaz00.spct <- read_oo_jazirrad(file = jaz.irrad.comma.file,
                               locale = my.locale)
jaz00.spct

Overriding default metadata

We revisit now the Jaz irradiance data to show how the metadata can be changed by the user if needed (e.g. clock settings at the time of data acquisition were wrong).

A variable with the user supplied date and time data, or the date read from the header (the text itself) not the file date as the file date may not reflect the creation date and time.

jaz.s.irrad.file <- 
  system.file("extdata", "spectrum.JazIrrad", 
              package = "photobiologyInOut", mustWork = TRUE)
jaz01.spct <- read_oo_jazirrad(file = jaz.s.irrad.file,
                               date = NULL)
getWhenMeasured(jaz01.spct)
jaz02.spct <- read_oo_jazirrad(file = jaz.s.irrad.file,
                               date = ymd_hms("2015-11-15 12:00:00"))
getWhenMeasured(jaz02.spct)
jaz03.spct <- read_oo_jazirrad(file = jaz.s.irrad.file,
                               date = now())
getWhenMeasured(jaz03.spct)

Adding additional metadata

When can add a geocode, either directly by giving latitude and longitude coordinates or by generating it from a Google maps search using function ggmap::geocode() as shown here.

my.geocode <- data.frame(lon = 25.02006, lat = 60.22525)
jaz04.spct <- read_oo_jazirrad(file = jaz.s.irrad.file,
                               geocode = my.geocode)
jaz04.spct
getWhereMeasured(jaz04.spct)


Try the photobiologyInOut package in your browser

Any scripts or data that you put into this service are public.

photobiologyInOut documentation built on Jan. 12, 2020, 4:09 p.m.