NEWS.md

orgdata 1.5.4

orgdata 1.5.3

orgdata 1.5.2

orgdata 1.5.1

orgdata 1.5.0

orgdata 1.4.9

orgdata 1.4.8

orgdata 1.4.7

orgdata 1.4.6

orgdata 1.4.5

orgdata 1.4.2

orgdata 1.4.1

orgdata 1.3.0

orgdata 1.2.0

orgdata 1.1.0

orgdata 1.0

base or global option orgdata.recode.base with logical input. TRUE will select the base year for recoding geographical code from the year of the original file to the current year. Default is FALSE ie. include all available geographical codes available in the codebook. (#157) - Fixed #139 for orgdata.debug.geo keep original geo codes for enumeration areas before adding 9999. (#140) - Fixed #142 show codes that have problem to recode directly instead of row numbers (#144) - Save all codes that have problem in log environment for easy access. To list the codes is either with log$code00 or log$codeShort - Recode geo even when argument aggregate = FALSE in make_file() function. - Rename make_filegroup and lag_filgruppe to plural ie. make_filegroups and lag_filgrupper. - Use options orgdata.debug.rows to select only specific row(s) for further processing. It can be activated via global options with options(orgdata.debug.rows = 20:50) or via argument row = 20:50 in make_file() to select row 20 to 50. - Fixed #135 with incorrect geo recode. (#131) - Make multiple filegroups via make_filegroups. (#137) - Fixed #132 LANDSSB must be string - Convert whitespace to NA to be able to delete all rows with NA - Fixed #119 able to mutate for TABS and VALS as well (#126) - Fixed #122 delete rows with NA via EXTRA column (#127) - Fixed #118 warning text when column(s) aren't defined in FILGRUPPE and will be deleted (#128) - Edit error message for columns with existing NA value before aggregating. Total value will be NA and this will conflict with the allready existing NA category in the aggregated column(s). Therefore existing NA value in the selected column(s) must be recoded to a valid value. - Use options(orgdata.debug.geo = TRUE) to keep old geo codes for debuging (#120) - Use reset_options() to reset to default options. - Warn when process discontinued due to debugging. - Add vignettes for Standardize git and Debugging - Fixed #121 recode geographical code for split codes (#120) - Change database filename to raw-database_BE.accdb - Fixed #106 split long messages (#107) - Fixed #108 #112 grunnkrets codes that have changed before 2002 not available via API from SSB while code changes for municipality includes changes from 1977. Check from SSB website. We use the municipality codes to create uspesified grunnkrets codes for data before 2002 (#109 #113) - Fixed #110 updating SQL code for new table name for codebook (#111) - Check columns to aggregate for any possible NA (#98). Columns that have NA should be recoded to uoppgitt or something equivalent since leaving the category to NA will conflict with NA representing total value when aggregating. - Fixed #100 for grunnkrets that ends with 00 have no correspond codes from SSB API. Need to add it manually (#101) - Fixed #99 when geo codes fails to be recoded then the row index will be shown (#103) - Geo codes ends with 4 zeros xxxx0000 neither have equivalent codes from SSB nor representing a correct coding structure as so called Delområde that ends with 2 zeros xxxxxx00. To avoid missing the information, these geo codes are recoded to xxxx9999 with function is_grunnkrets_0000() as in PR (#103). - see_file() accepts just a single numeric as well. - Fixed #85 see_file() list all the columns when columnames or column indexes are not specified. The variables are sorted whenever possible. (#87) - Add more function tests (#88) - Exclude LANDSSB in aggregate when split to LANDBAK and INNVKAT. This is because code 0 will be recoded to 20 when split and causes unnecessary more rows (#84) - Delete deprecated functions. - Fixed #93 when source level can't be identified due to NA. - Fixed #95 for grunnkrets codes that aren't missing but have less number of digits ie. less than 7 digits. Assuming these are codes for municipality then 9999 is added at the end of these codes (#96) - Gives row number for GEO codes that get coerced as NA when converted to integer. This will make it easy to check in the original raw data (#96) - Aggregate now give total to all dimensions including those specified in AGGKOL (#82) - Function see_file() accept column index as well (#83) - Recode variables using regular expression when defined in codebook with type RE. Finding pattern can either be written in ordinary regular expression ie. \\d{4}.* or with rex() package. (#78) - New feature for checking categories for variables with see_file() (#75) - Fixed #65 make TABS and VALS dynamic for easy extension for these columns (#66) - Fixed #64 recode of variable that has different class (#68) - Fixed #63 implicit null includes all possible VAL columns when exist (#69) - Fixed #70 recode GEO of different object class (#71) - Fixed #67 aggregate with total values for standard variables ie. UTDANN, LANDSSB, LANDBAK and INNVKAT (#72) - Fixed #61 use AGGKOL in Access registration database to specify other columns to aggregate other than the standard eg. KJONN, TAB1, TAB2 etc. (#73) - Fixed #55 to recode standard variables via codebook instead of hard coded (#58) - Fixed #52 skip split if not specified (#59) - Fixed #57 split column with duplicated values will keep the original column (#60) - Fixed #56 aggregate all VAL columns whenever specified and not only specific to VAL1 (#62) - Edit verbose messages - Reshape dataset from wide to long. Reshape can have more than one measure variables. Please read how this is specified in Access registration database. - Split columns must have equal number of values to the defined SPLITTIL. Duplicate the value if it is less than the maximum SPLITTIL. For example for value 0 in column LANDSSB which will be split into LANDBAK and INNVKAT, the value will be duplicated into 00 to avoid split with value NA. - Recode for LANDBAK and INNVKAT after aggregating are done internally ie. hard coded, in do_aggregate_recode_standard(). Total is coded with 20. Any eventuality for future change should also look other related functions such as is_aggregate_standard_cols() and is_col_num(). - Change argument parameter for find_spec() function. - Update text document in several places. - Add colour type warn2 for warning message without Warning: prefix. - Request (#43) messages with specific colour - Fixed (#46) recode to string even though columns is type integer or numeric. - Unknown bydel ie. (uoppgitt) is added when enumeration areas codes ie. (grunnkrets) for bydel is XXXX9999 in function geo_level(). - Add unknown grunnkrets for kommune when not available since some of the datasets have unknown grunnkrets that aren't listed in API downloaded data (#39). - Exclude TAB1, TAB2 and TAB3 from being aggregated. (#44) - Recode for aggregated variables uses AG in TYPE column in the codebook instead of FILGRUPPE with AGGREGATE as it was implemented in ver 0.2.0. This will make it possible so specify FILGRUPPE and LESID to implement the principle for GENERAL, COMMON and SPECIFIC variables. - Change function name do_aggregate_recode to do_aggregate_recode_standard for standard variables. - Recode for aggregated categories can be defined in Recode form ie. codebook, and use AGGREGATE in the specification under FILGRUPPE - Delete rows when defined in codebook using minus symbol under TIL column. Similar principles is implemented for GENERAL, COMMON and SPECIFIC feature as in recode. Read detail in ver 0.0.5 - alpha. - Display both columnames to be recoded that are found in the dataset or those that aren't found when defined as ALLE in the codebook so user will be aware of its existence. - Standardize some most used arguments to read_file() such as nrows, header, skip, trimws and na. Read details in read_file() function description. - Output to read_file() as data.table class. - Use standard columnames with V1, V2 etc when argment header = FALSE is specified. - Error message with list of unmatch columns in do_column_standard(). - Give clearer message and debug message eg. Execute: read_file(). - Change MAPPE to UTMAPPE to make it more explicit for path specification to save file. - Defun orgdata.active global options to use columnames from original dataset. - Use global options options(orgdata.debug.nrow = TRUE) to read only first 20 rows. Suitable for debug purposes. - Fix (#28) GEO derived from two columns with empty INNLESARG. - Add column LEVEL for granularity level ie. grunnkrets, fylke, kommune, bydel etc

orgdata 0.0.1 - alpha version

orgdata 0.0.4 - alpha

Changes is in PR #2

orgdata 0.0.3 - alpha

Changes is in PR #1

orgdata 0.0.2 - alpha

Things that are implemented



helseprofil/orgdata documentation built on Feb. 3, 2025, 4:55 p.m.