This vignette describes how you need to install the Observational Health Data Sciencs and Informatics (OHDSI) PatientLevelPrediction
package under Windows, Mac, and Linux.
Under Windows the OHDSI Patient Level Prediction (PLP) package requires installing:
Under Mac and Linux the OHDSI Patient Level Prediction (PLP) package requires installing:
The preferred way to install the package is by using drat, which will automatically install the latest release and all the latest dependencies. If the drat code fails or you do not want the official release you could use devtools to install the bleading edge version of the package (latest master). Note that the latest master could contain bugs, please report them to us if you experience problems.
To install using drat run:
install.packages("drat") drat::addRepo("OHDSI") install.packages("PatientLevelPrediction")
To install using devtools run:
install.packages('devtools') devtools::install_github("OHDSI/FeatureExtraction") devtools::install_github('ohdsi/PatientLevelPrediction')
When installing using devtools make sure to close any other Rstudio sessions that are using PatientLevelPrediction or any dependency. Keeping Rstudio sessions open can cause locks that prevent the package installing.
Many of the classifiers in the PatientLevelPrediction use a Python back end. To set up a python environment run:
library(PatientLevelPrediction) reticulate::install_miniconda() configurePython(envname='r-reticulate', envtype='conda')
To add the R keras interface, in Rstudio run:
devtools::install_github("rstudio/keras") library(keras) install_keras()
Some of the less frequently used classifiers are not installed during this set-up to add them run:
For GBM survival:
reticulate::conda_install(envname='r-reticulate', packages = c('scikit-survival'), forge = TRUE, pip = FALSE, pip_ignore_installed = TRUE, conda = "auto", channel = 'sebp')
For any of the torch models:
reticulate::conda_install(envname='r-reticulate', packages = c('pytorch', 'torchvision', 'cpuonly'), forge = TRUE, pip = FALSE, channel = 'pytorch', pip_ignore_installed = TRUE, conda = 'auto')
To test whether the package is installed correctly run:
library(DatabaseConnector) connectionDetails <- createConnectionDetails(dbms = 'sql_server', user = 'username', password = 'hidden', server = 'your server', port = 'your port') PatientLevelPrediction::checkPlpInstallation(connectionDetails = connectionDetails, python = T)
To test the installation (excluding python) run:
library(DatabaseConnector) connectionDetails <- createConnectionDetails(dbms = 'sql_server', user = 'username', password = 'hidden', server = 'your server', port = 'your port') PatientLevelPrediction::checkPlpInstallation(connectionDetails = connectionDetails, python = F)
The check can take a while to run since it will build the following models in sequence on simulated data:Logistic Regression, RandomForest, MLP, AdaBoost, Decision Tree, Naive Bayes, KNN, Gradient Boosting. Moreover, it will test the database connection.
Installation issues need to be posted in our issue tracker: http://github.com/OHDSI/PatientLevelPrediction/issues
The list below provides solutions for some common issues:
If you have an error when trying to install a package in R saying 'Dependancy X not available ...' then this can sometimes be fixed by running install.packages('X')
and then once that completes trying to reinstall the package that had the error.
I have found that using the github devtools to install packages can be impacted if you have multiple R sessions open as one session with a library open can causethe library to be locked and this can prevent an install of a package that depends on that library.
Considerable work has been dedicated to provide the PatientLevelPrediction
package.
citation("PatientLevelPrediction")
Please reference this paper if you use the PLP Package in your work:
This work is supported in part through the National Science Foundation grant IIS 1251151.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.