rewind: rewind: *R*constructing *E*tiology *w*ith B*in*ary...

Description Details Main rewind functions References See Also

Description

rewind is designed for analyzing multivariate binary data via binary factor analyses, with or without pre-specified number of factors, and without specifying the number of clusters in the data. It is motivated by analyzing the multivariate presence or absence of a list of antigens in serum samples collected from autoimmune disease patients. This package provides a tool for statistical inference of a model that assumes human body generates antibodies to a small number of protein complexes (or, machines), each comprised of a few important antigens.

Details

rewind This package implements a Bayesian hierarchical model that represents observations as aggregation of a few unobserved binary machines where the aggregation varies by subjects. Our approach is to specify the model likelihood via factorization into two latent binary matrices: machine profiles and individual factors. Given latent factorization, we account for inherent errors in measurement using sensitivities and specificities of protein detection. We use a prior for the individual factor matrix (Indian Buffet Process for binary matrices) to encourage a small number of subject clusters each with distinct patterns of active machines. The posterior distribution for the numbers of patient clusters and machines are estimated from data and by design tend to concentrate on smaller values. The posterior distributions of model parameters are estimated via Markov chain Monte Carlo which makes a list of molecular machine profiles with uncertainty quantification as well as patient-specific posterior probability of having each machine.

Main rewind functions

sampler

References

See Also


oslerinhealth/rewind documentation built on May 26, 2021, 6:56 a.m.