inst/examples/author.md

title: "Author preprocessing summary" author: "Leo Lahti / Computational History Group" date: "2018-06-21" output: markdown_document

Authors

Auxiliary files

Top-20 uniquely identified authors and their productivity (title count).

plot of chunk summaryauthorsplot of chunk summaryauthors

Ambiguous authors

Authors with ambiguous living year information - can we spot here cases where these are clearly known identical or distinct authors? Should also add living year information from supporting sources later.

33540 authors with missing life years (Life year info can be augmented here)

8333 authors with ambiguous life years Some of these might be synonymous and could be added to author synonyme list (the first term will be selected for the final data)

Life span of uniquely identified top authors

Ordered by productivity (number of documents))

plot of chunk summaryauthorslife

Author age

135846 documents (28%) have author age at the publication year. These have been calculated for documents where the publication year and author life years (birth and death) are available, and the document has been printed during the author's life time.

plot of chunk author_age

Author productivity

Title count versus paper consumption (all authors):

plot of chunk authortitlespapers

plot of chunk summaryTop10authorstimeline

plot of chunk topauthplot of chunk topauth



COMHIS/estc documentation built on April 7, 2022, 4:53 p.m.