Description Format Details See Also Examples
Examples of the types of data to which mixmod
and mdmixmod
may be used to fit mixture models.
For CiData
and DlData
, a list with three elements, binding
, expression
, and conservation
, which are numeric vectors, matrices, or data frames. For CiGene
and DlGene
, a data frame with elements containing the symbol, name, CG ID, FlyBase ID, chromosome, strand, start position, stop position, and target status for each gene in the corresponding *Data
data set.
Both Ci*
and Dl*
contain data relating to identification of transcription factor (TF) target genes in Drosophila involved in embryonic development. Ci
relates to cubitus interruptus, a TF involved in regulation of almost all Hedgehog-responsive (Hh-responsive) genes (Von Ohlen et al., 1997). Binding data represents log-ratios of Ci binding in the regulatory regions of genes vs. background binding. Expression values are a matrix of log-ratios of expression in mutant vs. wild-type embryos, mutants being homozygous null for one of four proteins known to affect Ci's regulatory function. The proteins are Smoothened (Smo), Patched (Ptc), and Ci and Hh themselves. Data are preprocessed and scaled from the raw data available at the Gene Expression Omnibus (GEO) accession number GSE24055.
Dl
relates to the Dorsal TF, which controls dorsal-ventral patterning in early embryogenesis. Binding data represent the log-ratios of binding in regulatory regions vs. background for dorsal and Snail (Sna), a TF which is an early target of Dl and plays an important role in the dorsal-ventral patterning process. Raw data are available at GEO GSE26285. Expression data represent log-ratios of gene expression for different mutant strains with varying levels of Dl throughout the embryo (pipe-/pipe- vs. toll10B and pipe-/pipe- vs. tollrm9/tollrm10). Raw data are available at GEO GSE5434.
For both Ci*
and Dl*
, cross-species gene sequence conservation is calculated from PhastCons using 12 fly species with one species each of mosquito, honeybee, and beetle as outgroups. The conservation values used in the analysis are a univariate vector calculated from the sums of PhastCons highly conserved element (HCE) scores for HCEs which overlap genes. These scores are available from the University of California, Santa Cruz (UCSC) Genome Browser. “Known target” status, represented by the target
element of *Gene
, is calculated from previous studies and from Gene Ontology (GO) and Berkeley Drosophila Genome Project (BDGP) annotation.
mixmod
, mdmixmod
, rocauc
for the functions used in Examples; simulation
for functions to simulate data with similar characteristics to the real data.
1 2 3 4 5 6 7 8 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.