View source: R/preprocLinkage.R
preprocLinkage | R Documentation |
This function records linkage along with data preprocessing. It has been meticulously executed to cover a wide range of datasets, ensuring that variable names are standardized using synonyms. This approach facilitates seamless data integration and analysis across various datasets.
preprocLinkage(d1,d2,chz="NULL",var=c("age","sex"),threshold=0.9)
d1 |
A data frame. |
d2 |
A data frame. |
chz |
the number of the name of the variable that the user does not want to change based on the output of the |
var |
The vector of the names of the blocked variables that the user chooses based on the output of the |
threshold |
A numeric value between 0 and 1. |
The results are stored in the .csv files, but if the number of records exceeds one million, they are stored in the rdata files.
Two csv files or two rdata files.
Note that, to see the results in the created file, first call the data.table package.
Hossein Hassani and and Leila Marvian Mashhad.
selVar
, chzInput
d1 = RLdata500
d2 = RLdata10000
preprocLinkage(d1, d2, var = "by")
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.