Description Usage Arguments Details Value References
This function selects the symptoms that are potentially reported with bias by comparing the fitted marginal distribution of the symptoms with the observed symptom distribution in the community sample. To control for the false discovery rate, Bonferroni adjustment to the significance values can be sequentially applied to the models.
1 2 3 4 5 6 |
formula |
A formula object. The left side of the formula is
the collection of symptoms. The right side is the cause of
death. For example, if there are 5 symptoms, named
or for short as:
Note that the short way of writing formula requires the symptoms variables
are located in a consecutive block in the data starting from
|
data |
a list of two datasets. The first is the hospital data, which contains the known cause of death for each individual, and a collection of symptoms from verbal autopsy studies. The second is the community data where typically only the symptoms are available. The known cause of death can be available outside hospital if it is a validation study, but it will not be used during estimation. Variable names must be exactly the same in two data sets. |
nsymp |
a positive integer, specifing the size of subsets of
symptoms drawn from the total set for estimating cause specific
mortality fractions at each iteration. The optimal number of
|
n.subset |
A positive integer specifing the total number of
subsets and thus estimations of all symptoms.
The default is |
nboot |
a positive integer. If |
boot.se |
a Logical value. If |
method |
A string specifying the computational procedure
used to estimate the cause specific mortality fractions. When
|
fix |
A vector of strings that specifies whether a subset of
the cause specific mortality fractions are set to predetermined
values (based on, e.g.,the information obtained from other
sources). Suppose we would like to prefix ”d1” to be 5%, ”d2”
to be 15%, then |
bound |
A vector of strings that specifies lower and upper
bounds of a subset of the cause specific mortality fractions
(based on, e.g.,the information obtained from other
sources). Suppose we would like ”d3” to be estimated between 5% and
10%, "d4" to be between 1% and 2%, then
|
prob.wt |
A positive integer or a vector of weights that determines how
likely a symptom is of being selected for a subset. When
|
printit |
Logical value. If |
print.reg.size |
Logical value. If |
clean.method |
A string specifying which test to use to detect
poorly fit symptoms. The default is |
min.Symp |
An integer value. When the number of availability
symptoms is less than |
confidence |
A number between 0 and 1. It specifies the
confidence level (or the significance level |
FDR |
Logical value. If |
For details, please refer to ”Designing Verbal Autopsy Analyses: A Report to the World Health Organization” (King and Lu, 2008b) and http:\gking.harvard.edu\va
va.validate
outputs the following objects. cod.list
returns a list of cause of death estimations based on a set of nested
models. Ps.list
returns a list of fitted marginal symptom
distributions based on the same set of nested
models. delete.list
returns a list of collections of removed
symptoms based on the nested models. FDR.delete.list
returns a
list of symptoms that should be removed based on the Bonferroni
adjustment and attained a global confidence level of
confidence
. When boot.se
is TRUE
,
va.validate
also returns a set of objects that summarizes the
variance of the predicted marginal symptom distribution
(Ps.var
), the variance of the residuals (e.var
), the
results based on all the bootstrapped samples (cod.list.boot
and Ps.list.boot
).
King, Gary and Ying Lu. (2008) “Verbal Autopsy Methods with Multiple Causes of Death”, 14(1), Statistical Science. Also available at http:gking.harvard.edu/va King, Gary and Ying Lu. (2008b) “Designing Verbal Autopsy Analyses: A Report to WHO”.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.