README.md

Documentation etc.

Interested in getting involved? Here are some ways to help:

Future Work

(Unsure about the priority of the rest, but the top one is definitely most important.)

Explicit model for p(u|v)

Currently the way we assign weights to sample for p(u|v) (roughly as described in the paper) requires a bit of hand-tweaking to work well in individual examples. It's also hard to generalize to categorical inputs. This may be the biggest barrier to widespread adoption.

As an alternative, perhaps we can explicitly build a model for the desired conditional distribution, e.g. maybe by using something like BART.

Todo:

Categorical inputs

Once we've done "Explicit model for p(u|v)", allowing categorical inputs should be much easier, but there's still some thought required.

Sensivity Analysis

Some of the examples show how to do sensitivity analysis in the spirit of this package, but it'd be great to have that do it for you.

"Variable Importance"

Implement something like conditional variable importance in the spirit of this package. (Like "permutation importance", but instead of taking a permutation, you'd sample from the conditional distribution p(u|v).)

Other tools/methods for understanding complicated models

I'd like to compile a list of other work in this direction, maybe comparing them with this.

I should add a page discussing other methods people have used to get at somewhat the same idea.



dchudz/predcomps documentation built on May 15, 2019, 1:48 a.m.