Discussion of Maritimes environmental data access, July 7, 2020
B. Casault, E. Chisholm, C. Layton, C. Johnson, M. McMahon
Meeting Minutes
Variable Names
Emily presented azmpmetrics.xlsx (available via Data Access Teams chat)
Options for variable naming schemes includes either 'made - up' custom names or standardized names following CF conventions
Group preference is for CF names
provides most description, good for someone unfamiliar with data
downside is that they are quite long names
BC suggests to remove annual and monthly prefixes as this will be redundant with file names and other metadata
discussion of whether to append regional names eg. 'ss' or 'gsl', this will make names more messy and long, will be difficult to compare or combine the same variable between regions
remove emerald/misaine on variables as well
Spreadsheet should also have column with name of azmpdata csv which data will be a part of for clarity and organization (EC/CL/BC)
what columns need to be included for each csv?
using csv names provided by CJ after last meeting
zooplankton names
occupations have abundance, annual has log abundance
discussion about how to shorten names
decided that 'log' information is important to include, 'abundance' is redundant
EC will update names
some confusion with variables that are not actually in azmpdata, EC will remove
Calanus stages should be denoted C1...C6, EC will update
instead of 'biomass' should use wet/dry weight
dry weight has only 'meso' size fraction, wet weight hasd 'macro', 'meso' and 'total' size fraction (EC will update)
Progress on azmpdata csvs
BC has produced some csvs
needs to be updated to include 'day' column (BC)
Sample ID lookup table discussed to connect sample ID to missions
need to decide on nominal vs actual position, especially for fixed station
*CL created ASCII files
includes some header metadata
read into R as list, so that metadata is preserved along with dataframe
similar to an oce structure but without becoming a complex S4 class
Next time
EC will keep notes about discussion between CL, BC, EC on organizing data into csvs notes will be provided to CJ