Getting Started
In datasetjson: Read and Write CDISC Dataset JSON Files

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"
)

library(datasetjson)

Using datasetjson

datasetjson works by allowing you to take a data frame and apply the necessary attributes required for the CDISC Dataset JSON. The goal is to make this experience simple. Before you can write a Dataset JSON file to disk, you first need to build the Dataset JSON object. An example call looks like this:

ds_json <- dataset_json(head(iris, 5), 
                        item_oid = "IG.IRIS", 
                        name = "IRIS", 
                        dataset_label = "Iris", 
                        columns = iris_items)

This is the minimum information required to provide to create a datasetjson object.

The parameters here can be described as follows:

The input data frame iris
The item_oid, which can be described as the "Object of Dataset", which is a key value is a unique identifier for the dataset, corresponding to ItemGroupDef/@OID in Define-XML.
name, which is the dataset name
dataset_label, which is the dataset label, and finally
columns, which is the variable level metadata for your dataset.

The columns parameter is special here, in that you provide a data frame with the necessary variable metadata. Take a look at the iris_items data frame.

iris_items

This data frame has 7 columns, 4 of which are strictly required. This is defined by the CDISC Dataset JSON Specification.

| Attribute | Requirement | Description | |----------------|-----------------|--------------------------------------------------------------------------------------------------------------------------------| | itemOID | Required | OID of a variable (must correspond to the variable OID in the Define-XML file) | | name | Required | Variable name | | label | Required | Variable description | | dataType | Required | Type of the variable. Allowed values: "string", "integer", "decimal", "float", "double", "boolean", "datetime", "date", "time", "URI". See ODM types for details. | | targetDataType | Required | Type of the variable. Allowed values: "integer", "decimal". Indicates the data type into which the receiving system must transform the associated Dataset-JSON variable. | | length | Optional | Variable length | | displayFormat | Optional | Display format supports data visualization of numeric float and date values. | | keySequence | Optional | Indicates that this item is a key variable in the dataset structure. It also provides an ordering for the keys. |

The data within this dataframe ultimately populates the columns element of the Dataset JSON file. The itemOID, name, label, and dataType columns are all required and must be populated for each variable. Note that the dataType column has a list of allowable values:

string
integer
float
double
decimal
boolean
datetime
date
time
URI

This information must be provided directly by the user. Note that no type conversions of your data are performed by the datasetjson package. The displayFormat column inherently refers to display formats used within SAS.

Writing and Reading

The datasetjson object allows you to collect the information needed to generate a Dataset JSON file, but to write the dataset out need to use the write_dataset_json() file. Once the Dataset JSON object is available, all you need is that object name and a file path.

write_dataset_json(ds_json, file="iris.json")

The write_dataset_json() also has the option to return the JSON output as a character string.

js <- write_dataset_json(ds_json, pretty=TRUE)
cat(js)

Similarly, to read a Dataset JSON object, you can use the function read_dataset_json(). This function will return a dataframe to you, ready to use. To read, provide a file path.

read_dataset_json("path/to/file")

You can also provide single element character vector of the JSON text already read in.

dat <- read_dataset_json(js)

The data frame that's read in is itself a datasetjson object and carries a number of attributes. For example, opening the dataframe within the RStudio IDE will present the variable labels. Additionally, the extra metadata provided in a Dataset JSON file is available. The attributes provided follow the naming convention of the Dataset JSON standard.

We've provided some helper functions to leverage this data further. If you'd like to grab the column metadata from the columns element, you can use the function get_column_metadata()

get_column_metadata(dat)

With this column metadata available, you can additionally use the function set_variable_attributes() to apply the columns metadata to the individual variables within the data frame.

dat <- set_variable_attributes(dat)
attributes(dat$Species)

Any scripts or data that you put into this service are public.

datasetjson documentation built on April 3, 2025, 10:54 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

datasetjson
Read and Write CDISC Dataset JSON Files

Getting Started
In datasetjson: Read and Write CDISC Dataset JSON Files

Using datasetjson

Writing and Reading

Try the datasetjson package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

datasetjson Read and Write CDISC Dataset JSON Files

Getting Started In datasetjson: Read and Write CDISC Dataset JSON Files

Using datasetjson

Writing and Reading

Try the datasetjson package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

datasetjson
Read and Write CDISC Dataset JSON Files

Getting Started
In datasetjson: Read and Write CDISC Dataset JSON Files