library(dplyr)
knitr::opts_chunk$set(echo = FALSE)

Motivation

Understanding how DNA methylation affects/predicts gene expression in heterogeneous patient populations.

Suppose for some patient population we have:

We are interested in:

  1. Clustering patients into the correct sub-group
  2. Within sub-group, finding relationship between CpG site & gene expression

Idea

Goal: Correctly cluster observations & regress in high dimensional $X$ & $Y$.

$$ f\left(\boldsymbol{y}{i} \mid \boldsymbol{x}{\boldsymbol{i}} ; \boldsymbol{\theta}\right)=\Sigma_{k=1}^{K} \pi_{k} \mathcal{N}{q}\left(\boldsymbol{y}{\boldsymbol{i}} ; \boldsymbol{x}{\boldsymbol{i}} A{k}, \Sigma_{k}\right) $$

Multivariate Regression

!["Multivariate Regression"](G:/My Drive/Dissertation/HTH Mixture/hthmixture/slides/mvr.png)

Mixed Regression

!["Mixed Regression"](G:/My Drive/Dissertation/HTH Mixture/hthmixture/slides/mvr_mix.png)

Reduced Rank Regression

!["Reduced Rank Regression"](G:/My Drive/Dissertation/HTH Mixture/hthmixture/slides/mvr_rr.png)

HTH Mixture Algorithm

While not converged ($m=1,\ldots, M$) do:

HTH Mixture Algorithm

Likelihood Space Exploration Idea 1

Likelihood Space Exploration Idea 2

Data Simulation

Finally, combine $X$ & $Y$

Simulation

Results

Results

Weighted Likelihood

Results

Results

Results

data <- readr::read_rds("G:/My Drive/Dissertation/HTH Mixture/hthmixture/results/20210415_slide_results.rds")


data %>% 
  slice(1:12) %>% 
  knitr::kable(format = "latex", digits = 2)

Results

data %>% 
  slice(13:24) %>% 
  knitr::kable(format = "latex", digits = 2)

Performance



alexanderjwhite/hthmixture documentation built on Sept. 4, 2022, 4:48 a.m.