Description Usage Arguments Details Value References
This function determines the optimal size of the subsets of symptoms.
In general, the training set is randomy splited, half of the training
set is treated as the hospital sample and the other half is treated as
the community sample. va.gcv
seraches for the optimal size of subsets
nsymp
that minimize the prediction errors in the community
sample.The estimation is done using constrained quadratic optimization.
1 2 |
formula |
a formula object. The left side of the formula is
the collection of symptoms. The right side is the cause of
death. For example, if there are 5 symptoms, named
or for short as:
Note that the short way of writing formula requires the symptoms variables
are located in a consecutive block in the data starting from
|
data |
a list of two datasets. The first is the hospital data, which contains the known cause of death for each individual, and a collection of symptoms from verbal autopsy studies. The second is the community data where typically only the symptoms are available. The known cause of death can be available outside hospital if it is a validation study, but it will not be used during estimation. Variable names must be exactly the same in two data sets. |
nsymp.vec |
a vector of positive integer, it contains a collection
of different |
n.subset |
A positive integer specifing the total number of
subsets and thus estimations of all symptoms.
The default is |
prob.wt |
A positive integer or a vector of weights that determines how
likely a symptom is of being selected for a subset. When
|
boot.se |
a Logical value. If |
nboot |
a positive integer. If |
printit |
Logical value. If |
print.reg.size |
Logical value. If |
For details, please refer to "Verbal Autposy Methods with Multiple Causes of Death"(King and Lu, 2008), and http:\gking.harvard.edu\va
va.gcv
outputs two objects. best.symp
returns the best
nsymp
that minimizes mean square error between estimated
cause-specific mortality fraction and the observed cause-specific
mortality fraction. mse
returns a vector of mean square errors
associated with each size of the subsets (as specified in symp.vec
).
King, Gary and Ying Lu. (2008) “Verbal Autopsy Methods with Multiple Causes of Death”, 14(1), Statistical Science, Also available at http:gking.harvard.edu/va
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.