CorporaCorpus-package: A Collection of Small Corpora Prepared by the Centre for...

Description Details Author(s) Source

Description

The package contains a collection of small corpora prepared by the Centre for Corpus Research.

Details

This package contains the following corpora

Name Description
DNov Dickens' novels
19C 19th Century Novels

The content of each corpus is documented in detail in the Corpora vignette. Basic metadata for the corpus texts is provided in the form of a data.frame returned by the corpus_metadata function.

The texts have been processed in such a way as to facilitate tokenization and to simplify analysis. See the FAQ vignette for full details of how the texts were prepared.

The function corpus_filepaths return file paths to the novels texts; the locations of which are not always transparent to the user.

For a list of all documentation use library(help="CorporaCorpus").

Author(s)

Maintainer: Anthony Hennessey <anthony.hennessey@nottingham.ac.uk>.

Source

For details of the individual texts see the Corpora vignette.

DNov

‘Novels by Charles Dickens’ at ‘Project Gutenberg’. https://www.gutenberg.org/ “Plain Text UTF-8” files. Retrieved from https://www.gutenberg.org/ebooks/author/37 on 2016-11-27.

All texts are covered by the Project Gutenberg License. A copy of the full license can be found at http://gutenberg.org/license.

19C

All novel texts where sourced from the ‘Project Gutenberg’, “Plain Text UTF-8” files and were retrieved on 2017-02-02.

All texts are covered by the Project Gutenberg License. A copy of the full license can be found at https://gutenberg.org/license.


ravingmantis/CorporaCorpus documentation built on May 27, 2019, 2:04 a.m.