SentiAnalyzer is a straight forward solution for analysis of consumer reviews which includes natural language processing (NLP) to preprocess the reviews and prepare the term matrix of reviews data in order to be consistent with Machine Learning algorithms. The dataset that SentiAnalyzer can work with is the review text and a binary class indicating the customer sentiment, e.g. whether consumer feeling is possitive or negative.
Balancing the dataset : using function BalanceData
Cleaning, Tokenizing and Building the term matrix of reviews : using function CleanText
Visualize the clean term matrix to give the user the insight inside the data : using function VisualzieData and shiny app
Train different classification algorithms(e.g., SVM,NB,RF,KNN,GBM) and choose best parameters for each one to get the highest possible classification accuracy for an specefic dataset : using function BuildTraining
Choose the best trained classification algorithm for the specific dataset according to different measures (e.g., FScore, Recall, Precision, Accuracy) : using function Comparison
Predict on the new review data : using function BuildPrediction
Visualize the output of the confusion matrix, that is, the accuracy, precision, recall and f1-score of the training model in predicting the sentiment of the consumer review : Using function Comparison and Shiny app
install.packages("SentiAnalyzer")
The package website: SentiAnalyzer
Tutorial : Workflow of Functions
Shiny versions:
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.