Introductory analysis of daily precipitation with hydroTSM

knitr::opts_chunk$set(echo = TRUE)

Installation

Installing the latest stable version (from CRAN):

install.packages("hydroTSM")

\noindent Alternatively, you can also try the under-development version (from Github):

if (!require(devtools)) install.packages("devtools")
library(devtools)
install_github("hzambran/hydroTSM")

Setting up the environment

Loading the hydroTSM package, which contains data and functions used in this analysis:

library(hydroTSM)

Loading daily precipitation data at the station San Martino di Castrozza, Trento Province, Italy, from 01/Jan/1921 to 31/Dec/1990.

data(SanMartinoPPts)

Selecting only a 6-years time slice for the analysis

x <- window(SanMartinoPPts, start="1985-01-01")

Dates of the daily values of 'x'

dates <- time(x)

Amount of years in 'x' (needed for computations)

( nyears <- yip(from=start(x), to=end(x), out.type="nmbr" ) )

Basic exploratory data analysis (EDA)

1) Summary statistics

smry(x)

2) Amount of days with information (not NA) per year

dwi(x)

3) Amount of days with information (not NA) per month per year

dwi(x, out.unit="mpy")

4) Computation of monthly values only when the percentage of NAs in each month is lower than a user-defined percentage (10% in this example).

# Loading the DAILY precipitation data at SanMartino
data(SanMartinoPPts)
y <- SanMartinoPPts

# Subsetting 'y' to its first three months (Jan/1921 - Mar/1921)
y <- window(y, end="1921-03-31")

## Transforming into NA the 10% of values in 'y'
set.seed(10) # for reproducible results
n           <- length(y)
n.nas       <- round(0.1*n, 0)
na.index    <- sample(1:n, n.nas)
y[na.index] <- NA

## Daily to monthly, only for months with less than 10% of missing values
(m2 <- daily2monthly(y, FUN=sum, na.rm=TRUE, na.rm.max=0.1))

# Verifying that the second and third month of 'x' had 10% or more of missing values
cmv(y, tscale="month")

4) Basic exploratory figures:

Using the hydroplot function, which (by default) plots 9 different graphs: 3 ts plots, 3 boxplots and 3 histograms summarizing 'x'. For this example, only daily and monthly plots are produced, and only data starting on 01-Jan-1987 are plotted.

hydroplot(x, var.type="Precipitation", main="at San Martino", 
          pfreq = "dm", from="1987-01-01")

Global view of daily precipitation values a calendar heatmap (six years maximum), useful for visually identifying dry, normal and wet days:

calendarHeatmap(x)

For each month, the previous figure is read from top to bottom. For example, January 1st 1987 was Thursday, January 31th 1987 was Saturday and November 1st 1990 was Thursday.

Selecting only a three-month time slice for the analysis:

yy <- window(SanMartinoPPts, start="1990-10-01")

Plotting the selected time series:

hydroplot(yy,  ptype="ts", pfreq="o", var.unit="mm")

Annual analysis

Annual values of precipitation

daily2annual(x, FUN=sum, na.rm=TRUE)

Average annual precipitation

Obvious way:

mean( daily2annual(x, FUN=sum, na.rm=TRUE) )

Another way (more useful for streamflows, where FUN=mean):

The function annualfunction applies FUN twice over x:

( i) firstly, over all the elements of x belonging to the same year, in order to obtain the corresponding annual values, and (ii) secondly, over all the annual values of x previously obtained, in order to obtain a single annual value.

annualfunction(x, FUN=sum, na.rm=TRUE) / nyears

Monthly analysis

1) Plotting the monthly precipitation values for each year, useful for identifying dry/wet months.

# Daily zoo to monthly zoo
m <- daily2monthly(x, FUN=sum, na.rm=TRUE)

# Creating a matrix with monthly values per year in each column
M <- matrix(m, ncol=12, byrow=TRUE)
colnames(M) <- month.abb
rownames(M) <- unique(format(time(m), "%Y"))

# Plotting the monthly precipitation values
require(lattice)
print(matrixplot(M, ColorRamp="Precipitation", 
           main="Monthly precipitation at San Martino st., [mm/month]"))

2) Median of the monthly values at station 'x'. Not needed, just for looking at these values in the boxplot.

monthlyfunction(m, FUN=median, na.rm=TRUE)

3) Vector with the three-letter abbreviations for the month names

cmonth <- format(time(m), "%b")

4) Creating ordered monthly factors

months <- factor(cmonth, levels=unique(cmonth), ordered=TRUE)

5) Boxplot of the monthly values

boxplot( coredata(m) ~ months, col="lightblue", main="Monthly Precipitation", 
         ylab="Precipitation, [mm]", xlab="Month")

Seasonal analysis

Average seasonal values of precipitation

seasonalfunction(x, FUN=sum, na.rm=TRUE) / nyears

Extracting the seasonal values for each year

( DJF <- dm2seasonal(x, season="DJF", FUN=sum) )
( MAM <- dm2seasonal(m, season="MAM", FUN=sum) )
( JJA <- dm2seasonal(m, season="JJA", FUN=sum) )
( SON <- dm2seasonal(m, season="SON", FUN=sum) )

Plotting the time evolution of the seasonal precipitation values

hydroplot(x, pfreq="seasonal", FUN=sum, stype="default")

Some extreme indices

Common steps for the analysis of this section:

Loading daily precipitation data at the station San Martino di Castrozza, Trento Province, Italy, with data from 01/Jan/1921 to 31/Dec/1990.

data(SanMartinoPPts)

Selecting only a 6-year time slice for the analysis

x <- window(SanMartinoPPts, start="1985-10-01")

Plotting the selected time series

hydroplot(x,  ptype="ts", pfreq="o", var.unit="mm")

Seasonality index

Computing the seasonality index defined by Walsh and Lawler (1981) to classify the precipitation regime of x:

si(x)

According to the seasonality index defined by Walsh and Lawler (1981), a value of 0.35 corresponds to a precipitation regime that can be classified as "Equable but with a definite wetter season" (see more details with ?si).

Heavy precipitation days (R10mm)

Counting and plotting the number of days in the period where precipitation is > 10 [mm]:

( R10mm <- length( x[x>10] ) )

Very wet days (R95p)

Identifying the wet days (daily precipitation >= 1 mm):

wet.index <- which(x >= 1)

Computing the 95th percentile of precipitation on wet days (PRwn95):

( PRwn95 <- quantile(x[wet.index], probs=0.95, na.rm=TRUE) )

Note 1: this computation was carried out for the three-year time period 1988-1990, not the 30-year period 1961-1990 commonly used.

Note 2: missing values are removed from the computation.

Identifying the very wet days (daily precipitation >= PRwn95):

(very.wet.index <- which(x >= PRwn95))

Computing the total precipitation on the very wet days:

( R95p <- sum(x[very.wet.index]) )

Note 3: this computation was carried out for the three-year time period 1988-1990, not the 30-year period 1961-1990 commonly used

5-day total precipitation

Computing the 5-day total (accumulated) precipitation:

x.5max <- rollapply(data=x, width=5, FUN=sum, fill=NA, partial= TRUE, 
                    align="center")

hydroplot(x.5max,  ptype="ts+boxplot", pfreq="o", var.unit="mm")

Maximum annual value of 5-day total precipitation:

(x.5max.annual <- daily2annual(x.5max, FUN=max, na.rm=TRUE))

Note 1: for this computation, a moving window centred in the current day is used. If the user wants the 5-day total precipitation accumulated in the 4 days before the current day + the precipitation in the current day, the user have to modify the moving window.\newline

Note 2: For the first two and last two values, the width of the window is adapted to ignore values not within the time series

Climograph

Since v0.5-0, hydroTSM includes a function to plot a climograph, considering not only precipitation but air temperature data as well.

# Loading daily ts of precipitation, maximum and minimum temperature
data(MaquehueTemuco)

# extracting individual ts of precipitation, maximum and minimum temperature
pcp <- MaquehueTemuco[, 1]
tmx <- MaquehueTemuco[, 2]
tmn <- MaquehueTemuco[, 3]

Plotting a full climograph:

m <- climograph(pcp=pcp, tmx=tmx, tmn=tmn, na.rm=TRUE, 
                main="Maquehue Temuco Ad (Chile)", lat=-38.770, lon=-72.637)

\newpage Plotting a climograph with uncertainty bands around mean values, but with no labels for tmx and tmn:

m <- climograph(pcp=pcp, tmx=tmx, tmn=tmn, na.rm=TRUE, tmx.labels=FALSE, tmn.labels=FALSE, 
                main="Maquehue Temuco Ad (Chile)", lat=-38.770, lon=-72.637)

\newpage Plotting a climograph with uncertainty bands around mean values, but with no labels for tmx, tmn and pcp:

m <- climograph(pcp=pcp, tmx=tmx, tmn=tmn, na.rm=TRUE, 
                pcp.labels=FALSE, tmean.labels=FALSE, tmx.labels=FALSE, tmn.labels=FALSE, 
                main="Maquehue Temuco Ad (Chile)", lat=-38.770, lon=-72.637)

\newpage To better represent the hydrological year in Chile (South America), the following figure will plot a full climograph starting in April (start.month=4) instead of January (start.month=1):

m <- climograph(pcp=pcp, tmx=tmx, tmn=tmn, na.rm=TRUE, 
                start.month=4, temp.labels.dx=c(rep(-0.2,4), rep(0.2,6),rep(-0.2,2)),
                main="Maquehue Temuco Ad (Chile)", lat=-38.770, lon=-72.637)

Software details

This tutorial was built under:

sessionInfo()$platform
sessionInfo()$R.version$version.string 
paste("hydroTSM", sessionInfo()$otherPkgs$hydroTSM$Version)

Version history

Appendix

In order to make easier the use of \texttt{hydroTSM} for users not familiar with R, in this section a minimal set of information is provided to guide the user in the R world.

Editors, GUI

Importing data

Useful Websites

F.A.Q.

How to print more than one matrixplot in a single Figure?

Because matrixplot is based on lattice graphs, normal plotting commands included in base R does not work. Therefore, for plotting ore than 1 matrixplot in a single figure, you need to save the individual plots in an R object and then print them as you want.

In the following sequential lines of code, you can see two examples that show you how to plot two matrixplots in a single Figure:

library(hydroTSM)
data(SanMartinoPPts)
x <- window(SanMartinoPPts, end=as.Date("1960-12-31"))
m <- daily2monthly(x, FUN=sum, na.rm=TRUE)
M <- matrix(m, ncol=12, byrow=TRUE)
colnames(M) <- month.abb
rownames(M) <- unique(format(time(m), "%Y"))
p <- matrixplot(M, ColorRamp="Precipitation", main="Monthly precipitation,")

print(p, position=c(0, .6, 1, 1), more=TRUE)
print(p, position=c(0, 0, 1, .4))

The second and easier way allows you to obtain the same previous figure (not shown here), but you are required to install the gridExtra package:

if (!require(gridExtra)) install.packages("gridExtra")
require(gridExtra) # also loads grid
require(lattice)

grid.arrange(p, p, nrow=2)


Try the hydroTSM package in your browser

Any scripts or data that you put into this service are public.

hydroTSM documentation built on May 29, 2024, 8:36 a.m.