brown: Brown corpus

brownR Documentation

Brown corpus

Description

The Brown corpus in tabular format tokenized and pos-tagged as distributed on https://www.nltk.org/nltk_data/. Headings and sentence boundaries are currently not preserved.

Usage

brown

Format

A data frame with five variables: genre_id, doc_id, sentence_id, word, pos; and two string attributes: contents and readme

Details

For documentation, see http://korpus.uib.no/icame/brown/bcm.html. The the raw README and CONTENTS files are also included as attributes.

Examples


data(brown)
head(brown)

class(attr(brown, "README"))

alex-raw/occurR documentation built on March 10, 2023, 5:08 p.m.