# Generate the DPI database to be used by the TDCOR main function

### Description

`CalculateDPI`

builds a DPI database for the TDCOR main function to prune diamond motifs

### Usage

1 2 | ```
CalculateDPI(dataset,l_genes, l_prior, times, time_step, N, ks_int, kd_int,
delta_int, noise, delay)
``` |

### Arguments

`dataset` |
Numerical matrix storing the transcriptomic data. The rows of this matrix must be named by gene codes (like AGI gene codes for Arabidopsis data). |

`l_genes` |
A character vector containing the gene codes of the genes included in the analysis (i.e. to be used to build the network) |

`l_prior` |
A numerical vector containing the prior information on the genes included in the network recontruction. By defining the |

`times` |
A numerical vector containing the successive times at which the samples were collected to generate the time-series transcriptomic dataset. |

`time_step` |
A positive number corresponding to the time step (in hours) i.e. the temporal resolution at which the gene profiles are analysed. |

`N` |
An integer corresponding to the number of iterations that are carried out in order to estimate the DPI distributions. |

`ks_int` |
A numerical vector containing two positive elements in increasing order. The first (second) element is the lower (upper) boundary of the interval into which the equation parameters corresponding to the regulation strength of the targets by their regulators are randomly sampled. |

`kd_int` |
A numerical vector containing two positive elements in increasing order. The first (second) element is the lower (upper) boundary of the interval into which the equation parameters corresponding to the transcripts degradation rates are randomly sampled. |

`delta_int` |
A numerical vector containing two positive elements in increasing order and expressed in hours. The first (second) element is the lower (upper) boundary of the sampling interval for the equation parameters corresponding to the time needed for the transcripts of the regulator to mature, to get exported out of the nucleus, to get translated and for the regulator protein to get imported into the nucleus and to bind its target promoter. |

`noise` |
A positive number between 0 and 1 corresponding to the noisiness of the system. (0 = no noise, 1 = very strong noise). |

`delay` |
A positive number corresponding to the time shift (in hours) that is expected between the profile of a regulator and its direct target. This parameter is used to generate a reference target profile from the profile of the regulator and calculate the DPI index. |

### Details

`CalculateDPI`

models two 4-genes networks showing slightly different topologies. Each network topology is modelled using a specific system of delay differential equations. For all genes listed in `l_genes`

whose corresponding prior in `l_prior`

is not null (i.e. the genes that are regarded as transcriptional regulators), the two systems of differential equations are solved `N`

times with `N`

different sets of random parameters. The Diamond Pruning Index (DPI) is calculated for all of these `2N`

networks. From these in silico data the conditional probability distribution of the DPI index given the regulator and the topology can be estimated. The probability distribution of the topology given DPI and the regulator is next calculated using Bayes' theorem and returned by the function. These shall be used when reconstructing the network to prune the "diamond" motifs.

`CalculateDPI`

returns a list object which works as a database. It not only stores the conditional probability distributions but also all the necessary information for `TDCOR`

to access the data, and the input parameters. The latter are read by the `UpdateDPI`

function to update the database.

### Value

`CalculateDPI`

returns a list object.

`prob_DPI_ind` |
A numerical vector whose elements are named by the vector |

`prob_DPI` |
A list storing lists of 3 spline functions of probability distributions. Each of the spline functions corresponds to the probability distribution of one topology given a regulator and a DPI value. The information about which regulator was used to generate the distributions stored in the i-th element of |

`prob_DPI_domain` |
A list storing vectors of two elements. The first (second) element of element i is the lowest (greatest) DPI value obtained during the simulation with the regulator i. |

`input` |
A list that stores the input parameters used to generate the database. |

### Note

The computation of the TPI and DPI databases is time-consuming as it requires many systems of differential equations to be solved. It may take several hours to build a database for a hundred genes.

### Author(s)

Julien Lavenus jl.tdcor@gmail.com

### See Also

See also `UpdateDPI`

, `TDCor-package`

.

### Examples

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 | ```
## Not run:
# Load the LR transcriptomic dataset
data(LR_dataset)
# Load the vector of gene codes, gene names and prior
data(l_genes)
data(l_names)
data(l_prior)
# Load the vector of time points for the LR_dataset
data(times)
# Generate a small DPI database (3 genes)
DPI_example=CalculateDPI(dataset=LR_dataset,l_genes=l_genes[4:6],l_prior=l_prior[4:6],
times=times,time_step=1,N=5000,ks_int=c(0.5,3),kd_int=c(0.5,3),delta_int=c(0.5,3),
noise=0.15,delay=3)
## End(Not run)
``` |

Want to suggest features or report bugs for rdrr.io? Use the GitHub issue tracker. Vote for new features on Trello.