knitr::opts_chunk$set( collapse = TRUE, fig.align="center", fig.width = 5, fig.height = 5, comment = "#>" )
library(doremi)
The differential equation considered in this case is the following:
$$\frac{d^2y}{dt} + 2\xi\omega_{n}\frac{dy}{dt} + \omega_{n}^2 y = k\omega_{n}^2u(t)$$ (1)
Where:
$y(t)$ is the signal to be analyzed
$\frac{dy}{dt}$ is its first derivative
$\frac{d^2y}{dt}$ is its second derivative
$u(t)$ is the excitation term perturbing the dynamics of $y(t)$
And regarding the coefficients:
$\omega_{n} = \frac{2\pi}{T}$ is the system's natural frequency, the frequency with which the system would oscillate if there were no damping. $T$ is the corresponding period of oscillation. The term $\omega_{n}^2$ represents thus the ratio between the attraction to the equilibrium and the inertia. If we considered the example of a mass attached to a spring, this term would represent the ratio of the spring constant and the object's mass.
$\xi$ is the damping ratio. It represents the friction that damps the oscillation of the system (slows the rate of change of the variable). The value of $\xi$ determines the shape of the system time response, which can be: $\xi<0$ Unstable, oscillations of increasing magnitude $\xi=0$ Undamped, oscillating $0<\xi<1$ Underdamped or simply "damped". Most of the studies use this model, also referring to it as "Damped Linear Oscillator" (DLO). $\xi=1$ Critically damped $\xi>1$ Over-damped, no oscillations in the return to equilibrium
$k$ is the gain. It represents the proportionality between the stationary increase of $y$ and the constant increase of $u$ that caused it.
$yeq$ is the signal equilibrium value, i.e. the stationary value reached when the excitation term is 0.
If the excitation term $u(t)$ is null, then the equation reduce to
$$\frac{d^2y}{dt} + 2\xi\omega_{n}\frac{dy}{dt} + \omega_{n}^2 y = 0$$ (2)
Equation (2) can also be found in the social/behavioral sciences literature as: $$y'' + \zeta y' + \eta y = 0$$ (3) That assumes the $y_{eq}$ is 0.
In which: $$\zeta= 2\xi\omega_{n}$$ and $$\eta=\omega_{n}^2$$ (4)
The dynamics in this case are then provoked either by a previous excitation that is no longer present or by the displacement of the system from its equilibrium position (either due to an initial condition different from 0 or an initial "speed" or derivative at time 0 different from 0): $$y(t=0)=y_{0}$$ $$\frac{dy}{dt}(t=0)=v_{0}$$ The shape of the solution to equation (2) -also called trajectory, or system response in engineering- will change according to the values of the parameters $\xi$ and $\omega_{n}^2$ presented, specially the values of $\xi$, as this parameter will define if the behavior is divergent, oscillating or undamped, underdamped (oscillations decreasing exponentially) or overdamped (system going back to equilibrium without oscillations) as it can be seen in the figure below:
data11a <- data.table::rbindlist(lapply(seq(0,2,0.2), function(eps){ generate.2order(time = 0:49, y0 = 1, xi = eps, period = 20)[,xi := eps][] })) # plot ggplot2::ggplot(data11a,ggplot2::aes(t,y,color = as.factor(xi)))+ ggplot2::geom_line() + ggplot2::labs(x = "time (arb. unit)", y = "signal (arb. unit)", colour = "xi")
Two functions are available to simulate data in the package: generate.2order
simulate the solution of the differential equation for a given vector of time, and the parameters period = $T$, xi = $\xi$, yeq = $y_{eq}$, y0 = $y(t = 0)$, v0 = $\frac{dy}{dt}(t=0)$, k = $k$ and the vector excitation $u(t)$. The function create a data.table with a column t for the time, y for the signal, and exc for the excitation.
test <- generate.2order(time = 0:100,y0 = 10,v0 = 0,period = 10,xi = 0.2) plot(test$t,test$y)
The function generate.panel.2order
uses generate.2order
to generate a panel of nind
individuals, with measurement noise and inter-individual noise.
The parameter internoise
allow the parameters of the differential equation to vary between the individuals. They are in this case distributed along a normal distribution centered on the parameter value given to the generate.panel.2order
function with a standard deviation of internoise
parameter.
The parameter intranoise
allows to ass measurement noise. intranoise
is the ratio between the measurement noise' standard deviation and the signal' standard deviation.
time <- 0:100 set.seed(123) data1 <- generate.panel.2order(time = time, y0 = 10, xi = 0.1, period = 30, yeq = 2, nind = 6, internoise = 0.1, intranoise = 0.3) data1
When there is no excitation, the input can be set to NULL (default value), like in the example above. As presented on the first order differential equation vignette, the result can be easily plotted through the plot
command:
plot(data1) + ggplot2::geom_hline(yintercept=0)
We see here that the period, the equilibrium value and the damping parameter vary between each individual.
Let's note that the damping ratio parameter allows to generate not only oscillating signals, that is when $0<\xi<1$, but also signals where the system can reach its equilibrium value without oscillations: these are the critically damped ($\xi=1$) and overdamped ($\xi>1$). The simulation functions of the package also allow the generation of these behaviors, as shown below:
set.seed(123) data2a <- generate.panel.2order(time = 0:99, y0 = 1, period = 30, nind = 1, intranoise = 0.2) set.seed(123) data2b <- generate.panel.2order(time = 0:99, y0 = 1, xi = 1, period = 30, intranoise = 0.2) set.seed(123) data2c <- generate.panel.2order(time = 0:99, y0 = 1, xi = 2, period = 30, intranoise = 0.2)
gridExtra::grid.arrange(plot(data2a)+ ggplot2::ggtitle("undamped, xi=0"), plot(data2b)+ ggplot2::ggtitle("critically damped, xi=1"), plot(data2c)+ ggplot2::ggtitle("overdamped, xi=2"), ncol= 3)
Analyzing the previous dataset and as for the first order model, the user must specify the name of the columns containing the id of the participants, the excitation, and the signal. Several methods are available for the estimation of the derivatives and the user needs to specify which method to use (gold
is the default) and the embedding dimension/smoothing parameter (see the package pdf manual for more details).
res1 <- analyze.2order(data = data1, id = "id", time ="time", signal = "signal", dermethod = "glla", derparam = 13, order = 2)
Now let’s take a look at the result. It is possible to plot the estimated curve from the estimated coefficients, to visually inspect the analysis:
plot(res1)
The different parts of the resulting doremi object are the same as those for the first order:
data: contains the original dataset and extra columns for intermediate calculations. The column "signalname_derivate2" is the only additional one with respect to the first order result.
resultmean: contains the fixed coefficients resulting from the regression for all the individuals. It is also the part of the result displayed when one calls the variable name (that calls the print method for doremi objects):
res1
Beware that, as known in the two-steps procedures, and as in the first order case, the estimation of derivatives is a source of bias and thus the error terms provided by the regression are not final. Nevertheless, it is possible to obtain from the summary of the regression the standard errors calculated for the coefficients estimated that will be $\zeta$, $\eta$ (see equations 3 and 4) and $y_{eq}\zeta$ if the equilibrium value is different from 0.
In the resultid object, the first columns are the terms resulting from the regression:
omega2 is the term $\omega_n^2$ or $\eta$, with its standard error omega2_stde
xi2omega is the term $2\xi \omega_n^2$ or $\zeta$, with its standard error xi2omega_stde
yeqomega2 is $y_{eq}\omega_n^2$ or $y_{eq}\zeta$, with its standard error yeqomega2_stde
komega2 is $k\omega^2$ the coefficient associated to the external excitation term in the regression, with its standard error komega2_stde. As there is no excitation on this example, it is NA.
Whereas the following are the values calculated from these first columns:
period is the oscillation period, that can be extracted from the term $\omega_n^2$, were $T=2\pi/\omega_n$
wn is the natural frequency, calculated also from the term omega2 = $\omega_n^2$
xi is the damping factor, calculated from the term xi2omega = $2\xi \omega_n^2$
yeq is the equilibrium value, calculated from yeqomega2 = $y_{eq}\omega_n^2$
k is the gain, calculated from komega2
resultid: contains the same coefficients but reconstructed for each individual. In the figure above it can be seen that for some of them, the fit wasn't very good. This will be enhanced using the function optimum_param
that will identify which embedding number produces the best estimate, as for the first order case. For each individual we then have:
res1$resultid
The doremi object will also contain the derivative method used and the embedding number/smoothing parameter used for further reference.
In this example we will generate data for 5 individuals, that respond to a "step" excitation (an excitation that changes value abruptly, from 0 to 1 in this case). We will consider no dynamic noise, no variation of the damping factor, period, or equilibrium value across individuals (no interindividual noise).
time <- 0:100 data1 <- generate.panel.2order(time = time, excitation = as.numeric(time>20), xi = 0.1, period = 30, k = 1, nind = 5)
Plotting once more the data with the plot method available in the package for doremidata objects:
plot(data1)
The call to the function remains almost the same, this time with a noise to signal ratio of 0.3 and a 20% inter-individual noise:
# Generation of signals with intra and inter-noise time <- 0:100 data2 <- generate.panel.2order(time = time, excitation = as.numeric(time>20), xi = 0.1, period = 30, k = 1, nind = 5, internoise = 0.2, intranoise = 0.3)
plot(data2)
And, as it can be seen in the figures, the coefficients change according to the person (damping factor, period, gain). Initial value, speed and equilibrium value could also change if their initial value was different from 0. These have been set to 0 for readability of the results but they could also be included.
The functions to generate the solution of the second order differential equation allow to specify the time for which the initial condition (y0 and v0) are given. This time must be between the minimum and the maximum value of the time vector given to the function. Below an example specifying the value, the derivative of the signal at a given time:
time <- 0:99 data3 <- generate.panel.2order(time = time, excitation = as.numeric(time>20), xi = 0.3, period = 30, k = 1, y0 = 2, v0 = 1, t0 = 15,nind = 1) plot(data3)
The function generate.2order
generated for all time given the unique solution of the differential equation that has value 3 at $t = 25$ and a derivative of 1 at this point.
The excitation can be of any form. This package can also be used to simulate driven damped oscillators:
t <- 0:99 excitation <- 5*sin(2*pi*t/10) driven_dlo <- generate.panel.2order(time = t, excitation = excitation, y0 = 10, xi = 0.2, period = 20, nind = 1) plot(driven_dlo)
Here we can see that the system, which has a natural period of 20, has a steady state that oscillate with the same period as the excitation, that is a period of 10.
Analyzing the signals generated in the previous examples, we will verify that the parameters were the one introduced in the simulation function and that the estimated signals generated match the simulated ones.
The simplest case is the one in which the data measured corresponds to a single individual. The call to the function is almost the same but omitting the id
parameter in the call. The input parameter "verbose" as in other R functions, allows to print (using the package futile.logger
) the actions carried out by the function until the calculation of the result:
res1 <- analyze.2order(data = data1[id==1], input = "excitation", time ="time", signal = "signal", dermethod = "gold", derparam = 3, verbose=T)
plot(res1)
Analyzing the data generated in the example 2 of the simulation section, the user must specify the name of the columns containing the id of the participants, the excitation, and the signal.
As several methods are available for the estimation of the derivatives, the user needs to specify which method to use (gold
is the default) and the embedding dimension (see the package pdf manual for more details).
res2 <- analyze.2order(data = data2, id = "id", input ="excitation", time ="time", signal = "signal", dermethod = "gold", derparam = 5, order = 4)
plot(res2)
As it was mentioned before, the estimation of derivatives is a source of bias. The previous fit can be enhanced in three ways:
By changing the embedding number/smoothing parameter
By changing the order of the derivative
By changing the derivation method
In the following example, we will use the function optimum_param
of the doremi
package to find the embedding number that provides an R2 the closest to 1. The other ways can be tested "manually" by calling the other functions, for instance, in a simulation study.
res3 <- optimum_param (data=data2, id="id", input="excitation", time="time", signal="signal", model = "2order", dermethod = "glla", order = 2, pmin = 5, pmax = 17, pstep = 2) res3$analysis res3$summary_opt res3$d
And, from the range provided, an embedding number of 13 produces the best fit and that the coefficients are closer to their true values than the ones estimated in the previous example.
If we want to graphically see the evolution of the coefficients according to the embedding number in this case, we can easily plot the results with the plot
function. This will call the method for the plotting of "doremiparam" objects:
plot(res3)
And doing again the analysis with the optimum embedding number:
res3b <- analyze.2order(data = data2, id = "id", input ="excitation", time ="time", signal = "signal", dermethod = "glla", derparam = res3$d, order = 2) res3b
plot(res3b)
In this example, we will generate the response to 3 excitations, with a gain different for each excitation.
#Simulating data with these hypothesis #Generating the three excitation signals: time <- 1:100 u1 <- as.numeric(time < 20 & time > 10) u2 <- as.numeric(time < 40 & time > 30) u3 <- as.numeric(time < 80 & time > 70) # Arbitrarily choosing a = 1, b = 2 and c = 5 for the first individual et1 <- u1 + 3*u2 + 5*u3 y1 <- generate.2order(time = time, excitation = et1)$y #as we are using the $y argument of the object generated #Signals for the second individual; # Arbitrarily choosing a = 1, b = 2.5 and c = 4 for the second individual et2 <- u1 + 2.5*u2 + 4*u3 y2 <- generate.2order(time = time, excitation = et2)$y #Generating table with signals dataa4 <- data.table::data.table(id = rep(c(1, 2), c(length(et1), length(et2))), time = c(time, time), excitation1 = rep(u1,2), excitation2 = rep(u2,2), excitation3 = rep(u3,2), signal_no_noise = c(y1, y2)) dataa4[,signal := signal_no_noise + rnorm(.N,0,0.5)] dataa4[,excitation := excitation1 + excitation2 + excitation3]
#Plotting signals ggplot2::ggplot( data = dataa4) + ggplot2::geom_line(ggplot2::aes(time,signal_no_noise, colour = "Signal_no_noise"))+ ggplot2::geom_point(ggplot2::aes(time,signal, colour = "Signal"))+ ggplot2::geom_line(ggplot2::aes(time,excitation,colour = "Total excitation"))+ ggplot2::facet_wrap(~id)+ ggplot2::labs(x = "Time (s)", y = "Signal (arb. unit)", colour = "")
We see that we generate three different amplitudes of response for these three excitations. It is possible to estimate the gain for each excitation by giving a vector of the different excitation column to the analyze.2order
function, as shown below:
#Analyzing signals res4 <- analyze.2order(data = dataa4, id = "id", input = c("excitation1", "excitation2", "excitation3"), time = "time", signal = "signal", dermethod = "glla", derparam = 7) #Looking for the calculation of the coefficients of the excitation res4 res4$resultid
And one can find the gains estimated for each excitation by extracting them from the $resultid
. They are a good approximation of the coefficients introduced.
#Plotting signals plot(res4)
Any scripts or data that you put into this service are public.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.