1)Helps in intial preprocessing like converting text to lower case, removing (punctuation, numbers,stop words), stemming, sparsity control and TF-IDF pre-processing.2) Helps in recognizing domain/corpus specific stop words 3) makes use of 'ldatunig' output to pick optimal number of topics for LDA modelling 4) Helps in extracting dominant words or key words that represent the context/topics of the content in each document.
Package details |
|
---|---|
Author | Krishna Harsha @COGNIZANT ANALYTICS |
Maintainer | Krishna Harsha@COGNIZANT ANALYTICS <khkrishnaharsha123@gmail.com> |
License | GPL |
Version | 0.1.0 |
Package repository | View on GitHub |
Installation |
Install the latest version of this package by entering the following in R:
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.