README.md

NGramPackage

Features

Searches Google Book ngram dataset for words that begin with the letter "a"

Outputs a dataframe of the frequency of searched word

Outputs what year the word is used most frequently

Outputs the number of books that use the ngram

Data

http://storage.googleapis.com/books/ngrams/books/datasetsv2.html

References

Jean-Baptiste Michel, Yuan Kui Shen, Aviva Presser Aiden, Adrian Veres, Matthew K. Gray, William Brockman, The Google Books Team, Joseph P. Pickett, Dale Hoiberg, Dan Clancy, Peter Norvig, Jon Orwant, Steven Pinker, Martin A. Nowak, and Erez Lieberman Aiden. Quantitative Analysis of Culture Using Millions of Digitized Books. Science (Published online ahead of print: 12/16/2010)

Yuri Lin, Jean-Baptiste Michel, Erez Lieberman Aiden, Jon Orwant, William Brockman, Slav Petrov. Syntactic Annotations for the Google Books Ngram Corpus. Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics Volume 2: Demo Papers (ACL '12) (2012)



cnmwebb/NGramPackage documentation built on May 22, 2019, 11:51 p.m.