estc: English Short Title Catalogue (ESTC) Metadata Toolkit

# Define the input file and output folder
# The rest should then execute out-of-the box
source.data.file <- "data/estc.csv.gz"

# Install and load the required custom libraries
library(devtools)
install_github("ropengov/sorvi")
install.packages(c("gender", "genderdata"),
                 repos = "http://packages.ropensci.org",
                 type = "source")
install_github("ropengov/bibliographica")
install_github("ropengov/estc")

library(estc)
library(bibliographica)

# Read the raw data
df.orig <- read_bibliographic_metadata(source.data.file)

# Load the polishing function
source("forby.R") # Modify freely

# Polish the publisher field
# TODO Printed by to be added
pub <- polish_publisher_forby(df.orig$publisher)

# Write summaries:
## Publishers ordered from most to least common
tmp <- write_xtable(pub$printedfor, file = "publisher_for_accepted.csv", count = TRUE)

## Discarded fields: those where no output is generated
disc <- df.orig$publisher[rowSums(is.na(pub) | is.null(pub)) == ncol(pub)]
tmp <- write_xtable(disc, file = "publisher_discarded.csv")

## Conversions from raw to final version
tab <- cbind(original = df.orig$publisher, pub)
tmp <- write_xtable(tab, file = "publisher_conversions.csv")

COMHIS/estc documentation built on April 7, 2022, 4:53 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

COMHIS/estc
English Short Title Catalogue (ESTC) Metadata Toolkit

inst/examples/new/old/publisher_forby.R
In COMHIS/estc: English Short Title Catalogue (ESTC) Metadata Toolkit

R Package Documentation

Browse R Packages

We want your feedback!

COMHIS/estc English Short Title Catalogue (ESTC) Metadata Toolkit

inst/examples/new/old/publisher_forby.R In COMHIS/estc: English Short Title Catalogue (ESTC) Metadata Toolkit

R Package Documentation

Browse R Packages

We want your feedback!

COMHIS/estc
English Short Title Catalogue (ESTC) Metadata Toolkit

inst/examples/new/old/publisher_forby.R
In COMHIS/estc: English Short Title Catalogue (ESTC) Metadata Toolkit