title: "Document dimension preprocessing summary" author: "Leo Lahti / Computational History Group" date: "2018-06-21" output: markdown_document
Some dimension info is provided in the original raw data for altogether 475544 documents (99%) but could not be interpreted for 8867 documents (ie. dimension info was successfully estimated for 98.1 % of the documents where this field was not empty).
Document size (area) info was obtained in the final preprocessed data for altogether 469277 documents (98%). For the remaining documents, critical dimension information was not available or could not be interpreted: List of entries where document surface could not be estimated
Document gatherings info is originally available for 463253 documents (96%), and further estimated up to 466677 documents (97%) in the final preprocessed data.
Document height info is originally available for 8149 documents (2%), and further estimated up to 469277 documents (98%) in the final preprocessed data.
Document width info is originally available for 3729 documents (1%), and further estimated up to 469277 documents (98%) in the final preprocessed data.
These tables can be used to verify the accuracy of the conversions from the raw data to final estimates:
The estimated dimensions are based on the following auxiliary information sheets:
Document dimension estimates (used when information is partially missing)
Left: final gatherings vs. final document dimension (width x height). Right: original gatherings versus original heights where both are available. The point size indicates the number of documents for each case. The red dots indicate the estimated height that is used when only gathering information is available.
Left: Document dimension histogram (surface area); Middle: Paper consumption histogram; Right: title count per gatherings.
Popularity of different document sizes over time. Left: absolute title counts. Right: relative title counts. Gatherings with less than 15 documents at every decade are excluded:
## NULL
## NULL
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.