The HypothesisReader R package supports the analytic methods as described in . The main action of this package is to extract any and all hypothesis and/or proposition statements found in the provided documents. Once these statements are extracted, the following key features are generated:
For example, take the following sample hypothesis:
Hypothesis 1: Commitment configuration is positively associated with firm performance.
After this hypothesis is extracted from the source academic paper, it is reduced to it's key features, shown below:
| Variable 1 | Variable 2 | Direction | Causality | | :--- | :--- | :--- | :--- | | commitment configuration | firm performance | positive | 0 |
The HypothesisReader package utilizes Python in addition to R. Therefore, a Python interpreter must be installed on any machine running said package. Through the R package Reticulate, the HypothesisReader package downloads and configures the Python infrastructure for the user. This is the default method for using this package. If the user wishes to manually set-up the Python connection, information about the required Python version and Python package versions are described in Configure Python Environment.
The following is for the quick set-up and installation of the R package. Further detail for each of these steps is provided in the sections below.
devtools::install_github("canfielder/HypothesisReader")
devtools::install_github("ropensci/rtika"")
HypothesisReader::LaunchApp()
HypothesisReader::InstallPythonPackages()
The HypothesisReader package was developed R Version 4.0.2.
The package requires Java 8
or OpenJDK 1.8
. Higher versions will also work. To verify the Java version on your machine, enter java -version
in a terminal. Installation information on Java can be found at https://www.java.com/en/download/ or http://openjdk.java.net/install/.
The HypothesisReader package is currently not on the Comprehensive R Archive Network (CRAN). To use, it must be installed from the package’s GitHub repository. It is strongly recommended to use the devtools package to assist in installation.
The devtools package can be installed with the following:
# Install devtools from CRAN
install.packages(“devtools”)
With devtools installed, the HypothesisReader package can be installed by executing the following:
devtools::install_github("canfielder/HypothesisReader")
Install rtika package from GitHub repository. The package rtika is a dependent package for HypothesisReader and must be installed before the package can be used.
devtools::install_github("ropensci/rtika"")
The HypothesisReader package will automatically configure the Python environment, with minimal input by the user. Once the package is installed, all the user has to do is attempt to use the function HypothesisReader or attempt to process a PDF document through the provided Shiny app, accessed from the function LaunchApp (see Usage below).
Once the package begins processing a PDF, it will search for the required Python configuration. If it has not yet been set up, the Reticulate package will prompt the user to install the Miniconda installer. The prompt will generate on the RStudio console. To install, the user must enter y.
After installation is complete, it is recommended the user restart the R session (Session > Restart R).
Note: From the Miniconda documentation: “Miniconda is a free minimal installer for conda. It is a small, bootstrap version of Anaconda that includes only conda, Python, the packages they depend on, and a small number of other useful packages, including pip, zlib and a few others.”
The Reticulate package downloads a version of Miniconda containing Python 3.6.
The HypothesisReader package is constructed to work with the following Python packages:
The main action of the HypothesisReader package (as described above) is accomplished through two functions: HypothesisReader and LauchApp.
HypothesisReader
This function is the code-based method for performing the above action. This function accepts PDF file(s), or a folder containing PDF file(s), and then returns a table containing the information described above.
LaunchApp
This function provides a Graphical User Interface through a Shiny app to perform the above action. The table with the processed information can then be downloaded as a CSV file. Executing LauchApp() from the RStudio console will launch a Shiny app. The app allows the user to select PDF files for upload and processing.
Note: The app launches using the default web browser for the local machine. If the Shiny app does not launch after running this function, please check your browser's pop-up settings.
The function InstallPythonPackages is also provided in the package. This function is to manually install the required Python packages, and should only be used if the default installation process failed.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.