Data versioning

We suggest to version the data that you upload. This is needed because of the reproducibility of the research that is going to be performed on the datasets. We are suggesting to do this based upon (semantic versioning). A better explanation on using semantic versioning in data can be found here: semantic versioning for data products.

Semantic versioning

The implementation of semantic versioning is as follows.

Table scheme

The table scheme we use is composed of 2 components, the data version and the table name.

Examples for the 'core' tables
Examples for the 'outcome' tables

Other approaches

Semantic versioning is just one of the strategies you can use. Date-based versioning is also a good way to deal with this examples can be:

Examples for the 'core' tables

Full datetime

Date only

Date month - if you have less iterations

Changelogs

To keep track of all the changed within the different versions of the data you uploaded you can keep track of a changelog.

To view an example, please check out the changelogs of the dictionaries. You can use the same format, but for the data. When you archived the project it becomes less relevant.



lifecycle-project/ds-upload documentation built on June 14, 2024, 6:54 a.m.