checkdir: Extract test statistics from all HTML and PDF files in a...

Description Usage Arguments Details Value Author(s) See Also Examples

Description

Extracts statistical references from a directory with HTML and PDF files. The "pdftotext" program is used to convert PDF files to plain text files. This must be installed and PATH variables must be properly set so that this program can be used from command line.

By default a gui window is opened that allows you to choose the directory (using tcltk).

Usage

1
checkdir(dir, subdir = TRUE, ...)

Arguments

dir

String indicating the directory to be used.

subdir

Logical indicating whether you also want to check subfolders. Defaults to TRUE

...

Arguments sent to statcheck.

Details

See statcheck for more details. This function is a wrapper around both checkPDFdir for PDF files and checkHTMLdir for HTML files.

Depending on the PDF file the comparison operators can sometimes not be converted correctly, causing these to not be reported in the output. Using html versions of articles is reccomended for more stable results.

Note that the conversion to plain text and extraction of statistics can result in errors. Some statistical values can be missed, especially if the notation is unconvetional. It is recommended to manually check some of the results.

Value

A data frame containing for each extracted statistic:

Source

Name of the file of which the statistic is extracted

Statistic

Character indicating the statistic that is extracted

df1

First degree of freedom

df2

Second degree of freedom (if applicable)

Value

Reported value of the statistic

Reported.Comparison

Reported comparison, when importing from pdf this will often not be converted properly

Reported.P.Value

The reported p-value, or NA if the reported value was NS

Computed

The recomputed p-value

Raw

Raw string of the statistical reference that is extracted

InExactError

Error in inexactly reported p values as compared to the recalculated p values

ExactError

Error in exactly reported p values as compared to the recalculated p values

DecisionError

The reported result is significant whereas the recomputed result is not, or vice versa.

Author(s)

Sacha Epskamp <mail@sachaepskamp.com> & Michele B. Nuijten <m.b.nuijten@uvt.nl>

See Also

statcheck, checkPDF, checkHTMLdir, checkHTML, checkHTMLdir

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
# with this command a menu will pop up from which you can select the directory with articles


# checkdir()





# you could also specify the directory beforehand


# for instance:


# DIR <- "C:/mydocuments/articles"


# checkdir(DIR)

chartgerink/statcheckTEMP documentation built on May 13, 2019, 3:37 p.m.