View source: R/parseTweetFiles.R
clean_tweets | R Documentation |
Performs all necessary cleaning on a data frame of tweets. This includes removing all symbols from tweets, converting them to lower case, removing all stop words, and converting timestamps to an R usable format. Can also filter by time zone if desired (default does not filter)
clean_tweets(tweets.df, tz = NULL, stoplist = NULL)
tweets.df |
An array of tweets with desired variables attached. (Use dplyr to filter variables) |
tz |
A list of time zones to filter by, currently case sensitive |
stoplist |
The stoplist used to filter words |
The tweet data frame with all editing / filtering done. Empty dataset
## Not run: df = select(rawdata, text, time_zone)
## Not run: tweets = clean_tweets(dataframe)
## Not run: tweets = clean_tweets(dataframe, tz = c("Pacific Time (US & Canada)", "Eastern Time (US & Canada)),
stoplist = stoplist))
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.