# 2 and 3 dimensional gof test based on the in-and-out-of-sample approach

### Description

`gofPIOSRn`

tests a 2 or 3 dimensional dataset with the approximate PIOS test for a copula. The possible copulae are "normal", "t", "gumbel", "clayton" and "frank". The parameter estimation is performed with pseudo maximum likelihood method. In case the estimation fails, inversion of Kendall's tau is used. The approximate p-values are computed with a semiparametric bootstrap, which computation can be accelerated by enabling in-build parallel computation.

### Usage

1 2 3 |

### Arguments

`copula` |
The copula to test for. Possible are the copulae |

`x` |
A 2 or 3 dimensional matrix containing the residuals of the data. |

`M` |
Number of bootstrapping loops. |

`param` |
The parameter to be used. |

`param.est` |
Shall be either |

`df` |
Degrees of freedom, if not meant to be estimated. Only necessary if tested for |

`df.est` |
Indicates if |

`margins` |
Specifies which estimation method shall be used in case that the input data are not in the range [0,1]. The default is |

`dispstr` |
A character string specifying the type of the symmetric positive definite matrix characterizing the elliptical copula. Implemented structures are "ex" for exchangeable and "un" for unstructured, see package |

`execute.times.comp` |
Logical. Defines if the time which the estimation most likely takes shall be computed. It'll be just given if |

`processes` |
The number of parallel processes which are performed to speed up the bootstrapping. Shouldn't be higher than the number of logical processors. Please see the details. |

### Details

The "Rn" test is introduced in Zhang et al. (2015). It is a information ratio statistic which is approximately equivalent to the "Tn" test, which is the PIOS test. Both test the *H0* hypothesis

*H0 : C0 in Ccal.*

"Rn" is introduced because the "Tn" test has to estimate *n/m* parameters which can be computationally demanding. The test statistic of the "Tn" test is defined as

*T = sum(sum(l(U_k^b;theta_n ) - l(U_k^b;theta_n^(-b) ), k=1, ...,m ), b=1, ...,M)*

with *l* the log likelihood function, the pseudo observations *U[ij]* for *i = 1, ...,n*; *j = 1, ...,d* and

*theta_n = arg max_theta sum(l(U_i; theta), i=1, ..., n)*

and

*theta_n^(-b) = arg max_theta sum(sum(l(U_k^(b^'); theta), k=1, ..., m), b^'=1, ..., M, b^' != b), b = 1, ..., M.*

By defining two information matrices

*S(theta) = E0 [d^2/d theta d theta^T l(U_1; theta)],*

*V(theta) = E0 [d/d theta l(U_1; theta) d/d theta l^T(U_1; theta)]*

where *S(.)* represents the negative sensitivity matrix, *V(.)* the variability matrix and *E0* is the expectation under the true copula *C0*. Under suitable regularity conditions, given in Zhang et al. (2015), holds then in probability, that

*T = tr{S(theta^*)^(-1) - V(theta^*)}*

as *n -> infinity.*

The approximate p-value is computed by the formula

*sum(|T[b]| >= |T|, b=1, .., M) / M,*

For more details, see Zhang et al. (2015). The applied estimation method is the two-step pseudo maximum likelihood approach, see Genest and Rivest (1995).

For small values of `M`

, initializing the parallization via `processes`

does not make sense. The registration of the parallel processes increases the computation time. Please consider to enable parallelization just for high values of `M`

.

### Value

A object of the `class`

gofCOP with the components

`method` |
a character which informs about the performed analysis |

`statistic` |
value of the test statistic |

`p.value` |
the approximate p-value |

### References

Zhang, S., Okhrin, O., Zhou, Q., and Song, P.. Goodness-of-fit Test For Specification of Semiparametric Copula Dependence Models. *under revision in Journal of Econometrics from 15.01.2014* http://sfb649.wiwi.hu-berlin.de/papers/pdf/SFB649DP2013-041.pdf

Genest, C., K. G. and Rivest, L.-P. (1995). A semiparametric estimation procedure of dependence
parameters in multivariate families of distributions. *Biometrika, 82:534-552*

### Examples

1 2 3 | ```
data(IndexReturns)
gofPIOSRn("normal", IndexReturns[c(1:100),c(1:2)], M = 20)
``` |