feather_files/README.md

iAtlas Data Structures

When importing data into iAtlas, it is very important that the following conventions are followed. Doing so will get the new data into the iAtlas database and make it available for the app.

Data Model

Information on the data model can be found in the data_model folder which contains this README.md file.

File Format

All data should come into the iAtlas application in the form of feather files. Feather files allow for fast reading and help ensure structural integrity.

Data Locations

All data (feather files) should be located in the iAtlas Synapse directory

Within the feather_file directory, data files should be segregated into folders as follows:

Feather File Structure

Data files in each folder MUST follow a specific convention for that data type. The files can be named as is deemed most descriptive and MUST end in .feather.

Column names MUST be spelled exactly as shown in this document.

The conventions for the feather files in each folder are as follows:

#### Copy Number Results Column Names

### datasets

#### Datasets Column Names

### driver_results

#### Driver Results Column Names

#### Edges Column Names

#### Features Column Names

#### Gene Type Column Names

#### Gene Column Names

#### Mutation Code Column Names

#### Mutation Type Column Names

#### Mutation Column Names

#### Node Column Names

A node may use a gene OR a feature. One of these is REQUIRED.

#### Patients Column Names

#### Sample Column Names

#### Slide Column Names

#### Tag Column Names

Tags may be used to group various pieces of data. At a base level, a tag is simply a string (with some descriptive meta data). Multpile pieces of data may be related by tagging them. Tags may even be tagged to create the semblance of hierarchy.

#### Publications Column Names

Often data is about relationships. The following folders are for data relationships. Each relationship depends on the original dat pieces being represented in their respective folders.



CRI-iAtlas/iatlas-data documentation built on July 7, 2020, 2:18 a.m.