README.md

Focus: This repository explores strength of association statistics for data science, machine learning, etc., questions focused on classification. More notes upcoming.

Required Items

Most probably an input YAML that includes: The URL of a CSV data file whose fields will be assessed. The R data type of each field in the CSV file. * The URL of a JSON file that outlines the characteristics of the CSV file data fields; more details within section The Structure of the JSON File.

The Structure of the JSON File

This section outlines the structure of the JSON file that outlines the characteristics of the data file whose fields will be assessed. Starting off with

dictionary = []

The overarching JSON object literal structure is

dictionary.append({'source': 'file name including extension',
    'properties': {'target': <the list of the target field/s>,
     'numeric': <the list of numeric fields>,      
     'categoricalFields': <the list of categorical fields>,
     'binaryCF': <the list of binary categorical fields>, 
     'polytomousCF':  <the dictionary of polytomous categorical fields>
    }})

Wherein a list has the form:

['item', 'item', ...]

Whereas, the structure of the dictionary of polytomous categorical fields is:

{
  'fieldName': <list of unique elements>, 
  'fieldName': <list of unique elements>, 
  'fieldName': <list of unique elements>, ... 
}

Reference

Notes

Read "Writing R Extensions" for more information.



exhypotheses/associations documentation built on Oct. 26, 2023, 9:57 p.m.