This package is based on Professor Kuan's paper^[A Simple and Robust Method for Partially Matched Samples Using the P-Values Pooling Approach, 2013].
It provided a statistical analyses in scenarios where some samples from the matched pairs design are missing, resulting in partially matched samples.
This package could process 5 procedures:
$x,y$ are vector values. In our package, we assume $x$ is tumor data and $y$ is normal data. NA values for missing data.
alternative can be "two.sided", "less" or "greater".
modttest(x,y)
;cor_Z(x,y)
;homoMLE(x,y)
;heteMLE(x,y)
;weighted.z.test(x,y)
.The modified t-statistic $t_3$ of Kim et al.^[Kim B, Kim I, Lee S, Kim S, Rha S, Chung H. Statistical methods of translating microarray data into clinically relevant diagnostic information in colorectal cancer. Bioinformatics. 2004; 21(4): 517–528. [PubMed: 15374865]] takes the form $$ \begin{aligned} t_3=\frac{{n_1}\bar{D}+{n_H}(\bar{T}-\bar{N})}{\sqrt{{n_1}{S_D}^2+{n_H}^2(S_N^2/n_3+S_T^2/n_2)}} \end{aligned} $$ And the null distribution of $t_3$ is approcimated with a standard Gaussian distribution.
The corrected Z-test of Looney and Jones^[ Looney S, Jones P. A method for comparing two normal means using combined samples of correlated and uncorrelated data. Statistics in Medicine. 2003; 22:1601–1610.] is based on a modified variance estimation of the standard Z-test by accounting for the correlation among the $n_1$ matched pairs, $$ \begin{aligned} Z_{corr}=\frac{\bar{T^}-\bar{N^}}{\sqrt{S_T^{2}/(n_1+n_2)+S_N^{2}/(n_1+n_3)-2n_1S_{TN_1}/(n_1+n_2)(n_1+n_3)}} \end{aligned} $$
There are two kinds of MLE based test. The first is proposed by Ekbohm^[ Ekbohm G. On comparing means in the paired case with incomplete data on both responses. Biometrika. 1976;63(2):299–304. ] and the second is proposed by Lin and Stivers^[ Lin P, Stivers L. On differences of means with incomplete data. Biometrika. 1974; 61(2):325–334. ].
Both MLE based test under homo- or heteroscedasticity have similar form like this: $$ \begin{aligned} Z_{LS}=\frac{f(\bar{T_1}-\bar{T})-g(\bar{N_1}-\bar{N})+\bar{T}-\bar{N}}{\sqrt{V_1}} \end{aligned} $$
The combined p-value by the weighted Z-test introduced by Liptak^[Liptak T. On the combination of independent tests. Magyar Tudom Aanyos Akad Aemia Matematikai Kutat Ao Intezetenek Kozlemenyei. 1958; 3:171–197. ] is: $$ \begin{aligned} p_{ci}=1-\Phi(\frac{w_1Z_{1i}+w_2Z_{2i}}{\sqrt{w_1^2+w_2^2}}) \end{aligned} $$
We assume the data set is big and missing value is relatively small.
Instead of call particular method, we call t.test() in extreme case.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.