Home

/

GitHub

/

In UBC-MDS/rtweetlytics: Analyzing the tweets for specific keyword

knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>",
  fig.path = "man/figures/README-",
  out.width = "100%"
)

rtweetlytics

Authors: Mahsa Sarafrazi, Shiva Shankar Jena, Amir Shojakhani, Mahmoodur Rahman

An R package to extract twitter data, analyzed and plot top occuring hashtags

Overview

The goal of rtweetlytics is to extract, analyze and plot twitter data. It provides functions to extract twitter data, clean tweets, analyze tweets and plot hashtags frequency.

Functions

| Function Name | Input | Output | Description | |----------------|----------------------------------------------------------|-----------|---------------------------------------------------------------------------------------------------------------------------------| | get_store() | bearer_token, keyword, start_date, end_date | Dataframe | Extract twitter data and save as .csv | | clean_tweets() | PATH, | String | Cleans the text in the tweets and returns as new columns in the dataframe. The cleaning process includes converting into lower case, removal of punctuation, hastags and hastag counts. | | analytics() | .csv | Dataframe | Analyze the clean data frame extracted from twitter website, and returns a tibble including metrics of analytics. | | plotting() | .csv, col_text | Image | The plotting function creates a bar-chart plot of most occurring hashtags.

Installation

The development version of rtweetlytics can be installed from GitHub with:

# install.packages("devtools")
devtools::install_github("UBC-MDS/rtweetlytics")

Features

The package is an assimilation of four independent functions:

get_store(): Extract data from twitter through calling API and provide csv file as output and create a dataframe.
clean_tweets(): Cleans the text in the tweets and returns as new columns in the dataframe. The cleaning process includes converting into lower case, removal of punctuation, hastags and hastag counts.
analytics(): Analyze the clean data frame extracted from twitter website, and returns a tibble including metrics of analytics.
plotting(): The plotting function creates a bar-chart plot of most occurring hashtags.

Example

Load the library

library(rtweetlytics)

1. Downloading data and creating dataframe

The first function in our library is the rtweetlytics::get_store(). This function will require tghe developer to obtain bearer token from the twitter API development website.

tweets = rtweetlytics::get_store(
            bearer_token,
            keyword="vancouver",
            start_date="2022-01-12",
            end_date="2022-01-17")
head(tweets)

tweets <- read.csv("output/tweets_response.csv")
head(tweets,2)

2. Cleaning data

The second function in our library is the rtweetlytics::clean_tweets(). This function cleans the data to gets tweet texts, word counts.

PATH <- "../output/tweets_response.csv"
tweets_df <- rtweetlytics::clean_tweets(PATH, tokenization=TRUE, word_count=TRUE)
head(tweets_df)

clean_tweets <- read.csv("output/clean_tweets.csv")
head(clean_tweets,2)

3. Analyzing data

Our third function rtweetlytics::analytic() analyses the data to give a resulting dataframe showing total Number of Likes, total Number of Comments, total Number of Retweets, percentage of Positive Sentiments, percentage of Neutral Sentiments, and percentage of Negative Sentiments.

results <- rtweetlytics::analytics(tweets_df)
head(results)

knitr::include_graphics("man/figures/analytics.png")

4. Creating plot

In the last and final function we are using rtweetlytics::plotting() to further clean the data to extract hastags and plot the top 15 tags.

hash_plot <-  rtweetlytics::plotting(tweets_df, text)
hash_plot

knitr::include_graphics("man/figures/plotting.png")

Contributors

The names of core development team is listed below.

Please note that this project is released with a Code of Conduct. By contributing to this project, one implies to agree to abide by its terms.

License

rtweetlytics is licensed under the terms of the MIT license.

UBC-MDS/rtweetlytics documentation built on Feb. 6, 2022, 12:33 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

UBC-MDS/rtweetlytics
Analyzing the tweets for specific keyword

In UBC-MDS/rtweetlytics: Analyzing the tweets for specific keyword

rtweetlytics

Overview

Functions

Installation

Features

Example

Load the library

1. Downloading data and creating dataframe

2. Cleaning data

3. Analyzing data

4. Creating plot

Contributors

License

R Package Documentation

Browse R Packages

We want your feedback!

UBC-MDS/rtweetlytics Analyzing the tweets for specific keyword

In UBC-MDS/rtweetlytics: Analyzing the tweets for specific keyword

rtweetlytics

Overview

Functions

Installation

Features

Example

Load the library

1. Downloading data and creating dataframe

2. Cleaning data

3. Analyzing data

4. Creating plot

Contributors

License

R Package Documentation

Browse R Packages

We want your feedback!

UBC-MDS/rtweetlytics
Analyzing the tweets for specific keyword