process_delim: A function to read in large data files as an FBM
In plmmr: Penalized Linear Mixed Models for Correlated Data

process_delim

R Documentation

A function to read in large data files as an FBM

Description

A function to read in large data files as an FBM

Usage

process_delim(
  data_dir,
  data_file,
  feature_id,
  rds_dir = data_dir,
  rds_prefix,
  logfile = NULL,
  overwrite = FALSE,
  quiet = FALSE,
  ...
)

Arguments

`data_dir`	The directory to the file.
`data_file`	The file to be read in, without the filepath. This should be a file of numeric values. Example: use `data_file = "myfile.txt"`, not `data_file = "~/mydirectory/myfile.txt"` Note: if your file has headers/column names, set 'header = TRUE' – this will be passed into `bigmemory::read.big.matrix()`.
`feature_id`	A string specifying the column in the data X (the feature data) with the row IDs (e.g., identifiers for each row/sample/participant/, etc.). No duplicates allowed.
`rds_dir`	The directory where the user wants to create the '.rds' and '.bk' files Defaults to `data_dir`
`rds_prefix`	String specifying the user's preferred filename for the to-be-created .rds file (will be create insie `rds_dir` folder) Note: 'rds_prefix' cannot be the same as 'data_prefix'
`logfile`	Optional: the name (character string) of the prefix of the logfile to be written. Defaults to 'process_delim', i.e. you will get 'process_delim.log' as the outfile.
`overwrite`	Optional: the name (character string) of the prefix of the logfile to be written. Defaults to 'process_plink', i.e. you will get 'process_plink.log' as the outfile. Note: If there are multiple `.rds` files with names that start with "std_prefix_...", this will error out. To protect users from accidentally deleting files with saved results, only one `.rds` file can be removed with this option.
`quiet`	Logical: should the messages printed to the console be silenced? Defaults to FALSE.
`...`	Optional: other arguments to be passed to `bigmemory::read.big.matrix()`. Note: 'sep' is an option to pass here, as is 'header'.

Value

The file path to the newly created '.rds' file

Examples

temp_dir <- tempdir()
colon_dat <- process_delim(data_file = "colon2.txt",
 data_dir = find_example_data(parent = TRUE), overwrite = TRUE,
 rds_dir = temp_dir, rds_prefix = "processed_colon2", sep = "\t", header = TRUE)

colon2 <- readRDS(colon_dat)
str(colon2)

plmmr documentation built on April 4, 2025, 12:19 a.m.