README.md

clevercloudr

codecov R-CMD-check

The goal of clevercloudr is to to serve as a solution to create meaningful word clouds. To create meaningful word clouds, data scientists typically takes multiple steps to clean the data, such as removing stopwords, removing punctuation and digits, making the letters lower cases, conducting stemming. This package will help data scientists clean the data easily following the common practices and also allow to create more meaningful word clouds.

Functions

There are 4 functions in this package:

Fitting into the R ecosystem

Packages that have similar functions:

What we do differently:

Installation

You can install the development version of clevercloudr from GitHub with:

# install.packages("devtools")
devtools::install_github("UBC-MDS/clevercloudr")

Examples

Some quick examples of CleverClean(), CleverStemmer(), CleverStopwords() and CleverWordCloud().

library(clevercloudr)
text <- list("grounds!!!", "feet6", "running123", "feeding", "feed", "feed$", "grounding", "feet", "happiness")

# Clean raw text data 
clean_text <- CleverClean(text)
clean_text
#> [1] "grounds"   "feet"      "running"   "feeding"   "feed"      "feed"     
#> [7] "grounding" "feet"      "happiness"
# Perform stemming on the text 
stem_text <- CleverStemmer(clean_text)
stem_text
#> [1] "ground" "feet"   "run"    "feed"   "feed"   "feed"   "ground" "feet"  
#> [9] "happi"
# Customize the stopwords by adding stopwords specific to each project / task 
new_words <- list("happi")
new_stopwords <- CleverStopwords(new_words)
# Generate the word cloud
CleverWordCloud(stem_text, new_stopwords)

Contributing

Contributors of the project: Amelia Tang, Arushi Ahuja, Victor Francis, Adrianne Leung

Interested in contributing? Check out the contributing guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

License

clevercloudr was created by Amelia Tang, Arushi Ahuja, Victor Francis, Adrianne Leung. It is licensed under the terms of the MIT license.



UBC-MDS/clevercloudr documentation built on Feb. 7, 2022, 9:11 a.m.