Bagidis-package: BAGIDIS (BAses Giving DIStances). A New Set of Tools for...

Description Details Author(s) References

Description

This is the companion package of a PhD thesis entitled "Bases Giving Distances. A new paradigm for investigating functional data with applications for spectroscopy" by Timmermans (2012). See references below for details and related publications.

The core of the BAGIDIS methodology is a functional wavelet based semi-distance that has been introduced by Timmermans and von Sachs (2010, 2015) and Timmermans, Delsol and von Sachs (2013). This semi-distance allows for comparing curves with sharp local patterns that might not be well aligned from one curve to another. It is data-driven and highly adaptive to the curves being studied. Its main originality is its ability to consider simultaneously horizontal and vertical variations of patterns, which proofs highly useful when used together with clustering algorithms or visualization method. BAGIDIS is an acronym for BAsis GIving DIStances. The extension of BAGIDIS to image data relies on the same principles and has been described in Timmermans and Fryzlewicz (2012), Fryzlewicz and Timmermans (2015).

Details

Package: Bagidis
Type: Package
Version: 2.0
Date: 2012-05-03
License: GPL-3
LazyLoad: yes

The BAGIDIS methodology aims at answering the need for a method able to detect the closeness of curves or images whose significant sharp features might not be well aligned. It has been developped and studied by Timmermans and von Sachs (2010, 2015), Timmermans, Delsol and von Sachs (2013) and Timmermans and Fryzlewicz (2012), Fryzlewicz and Timmermans (2015) and you should refer to these papers for detailed information about its use, and hence about the purpose of this package (see http://hdl.handle.net/2078.1/154928 and http://hdl.handle.net/2078.1/118369, for curves and http://hdl.handle.net/2078.1/110529 and http://stats.lse.ac.uk/fryzlewicz/shah/shah.pdf for images). Its approach is based upon the definition of a semi-distance that is functional, in the sense that the ordering of the data is explicitly taken into account, and wavelet-based, in the sense that it relies on a basis function expansion in which positions and amplitudes of a pattern are encoded. However, this method overcome the dyadic restriction that is attached with classical wavelet expansions, and do not require any preliminary smoothing of the data. Furthermore, a major originality of the method is that it relies on projections on basis functions that are different from one series to another.

Timmermans (2012) (http://hdl.handle.net/2078.1/112451) provides a comprehensive survey of the method. Regarding curves, the main ideas are as follows:

Function BUUHWE computes the Unbalanced Haar Wavelet Expansion (BUUHWE) of a series in the Unbalanced Haar basis that is best suited to it according to the bottom-up algorithm developped by Fryzlewicz (2007). Function Breakpoints and Function Details are used to encode the BUUHWE in a efficient compact way. Functions BUUHWE.plot and BD.plot give two different graphical representations of the BUUHWE expansion of a series. Function BAGIDIS.dist computes the BAGIDIS semidistance between two series encoded by their BUUHWE expansions. Function BAGIDIS.dist.BD computes the BAGIDIS semidistance between two series encoded by their breakpoints and details. semimetric.BAGIDIS computes the BAGIDIS semidistance between two series encoded as a numeric series. If semimetric.BAGIDIS is applied to matrices of series to be compared, it returns the dissimilarity matrix between them, as computed using the BAGIDIS semi-distance.

The BAGIDIS semi-distance between images relies on very similar principles. An optimal unbalanced Haar wavelet basis is associated to each image through the 2-dimensional Bottom-Up unbalanced Haar wavelet expansion (2D-BUUHWE). This expansion is rather termed SHape Adaptive Haar in Timmermans and Fryzlewicz (2012), Fryzlewicz and Timmermans (2015) and both terminology are used here in the functions name. The 2D-BUUHWE or SHAH is obtained thanks to function BUUHWE_2D and its alias SHAH. An efficient way to encode this expansion is through the signature of the image, obtained by applying Signature_2D to the output of BUUHWE_2D (or SHAH). A representation of the transform process is obtained using BUUHWE_2D.plot and its alias SHAH.plot. The transform process is linked with the definition of a well-suited unbalanced Haar basis for the image. This basis and its representation can be obtained using BUUHWE_2D_Stepwise and its alias SHAH_Stepwise. The BAGIDIS semi-distance between images is then computed using semimetric.BAGIDIS_2D.

Author(s)

Catherine Timmermans, Institute of Statistics, Biostatistics and Actuarial Sciences, UCLouvain.
Maintainer: <catherine.timmermans@uclouvain.be>

References

The main references are

Other references include

The function BUUHWE_2D in this package is similar to the function uh.bu.2d (copyrighted Fryzlewicz 2014) in the package "shah_code", available on the webpage of Piotr Fryzlewicz: http://stats.lse.ac.uk/fryzlewicz/shah/shah_code.R , which accompanies the paper Fryzlewicz and Timmermans (2015).


Bagidis documentation built on May 29, 2017, noon