useR_2008_abstracts | R Documentation |
Abstracts and some metadata for presentations at the useR! 2008 conference held August 12-14, 2008 at Technische Universität Dortmund, Germany.
data("useR_2008_abstracts")
A data frame with 177 observations on the following 5 character variables.
Abstract
:the text of the abstract.
Title
:the title of the presentation.
Author
:the authors of the presentation, collapsed with ‘ and ’.
Session
:an identifier indicating the session the presentation was slotted into (leading ‘foc’ and ‘kal’ indicate useR! Focus and useR! Kaleidoscope sections, respectively).
Keywords
:keywords for the presentation, collapsed with ‘, ’.
Abstracts were obtained in PDF format from
https://www.r-project.org/conferences/useR-2008/abstracts/,
converted to text using pdftotext
, and hand-edited.
Metadata were provided by the conference organizers.
data("useR_2008_abstracts")
words <-
with(useR_2008_abstracts,
strsplit(Abstract, "[[:space:]]+"))
## (A poor word tokenizer ...)
n_of_words <- sapply(words, length)
authors <-
with(useR_2008_abstracts,
strsplit(Author, " and ", fixed = TRUE))
n_of_authors <- sapply(authors, length)
## Do more authors write longer abstracts?
boxplot(n_of_words ~ n_of_authors)
## Session structure:
sessions <-
with(useR_2008_abstracts,
sub("-[[:digit:]].*", "", Session))
sort(unique(sessions))
## Numbers of focus, invited and kaleidoscope presentations:
table(sub("-.*", "", sessions))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.