README.md

topicl

The goal of topicl is to provide tools for identifying stable (reproducible) topics and for their analysis.

Installation

You can install the development version of topicl from GitHub with:

# install.packages("pak")
pak::pak("paskn/topicl")

Example

See vignette for details.

This is a basic example which shows you how to solve a common problem:

library(topicl)
library(stm)
library(dplyr)

modA <- stm(poliblog5k.docs, 
            poliblog5k.voc, K=25,
            prevalence=~rating, 
            data=poliblog5k.meta,
            max.em.its=2, 
            init.type="Random",
            seed = 9934,
            verbose = F) 

modB <- stm(poliblog5k.docs, 
            poliblog5k.voc, K=25,
            prevalence=~rating, 
            data=poliblog5k.meta,
            max.em.its=2, 
            init.type="Random",
            seed = 9576,
            verbose = F) 

compare_solutions(list(modA, modB), depth=500) |> 
  arrange(desc(jaccard)) |> 
  head()
#> # A tibble: 6 × 5
#>   model_id_A topic_id_A model_id_B topic_id_B jaccard
#>   <chr>      <chr>      <chr>      <chr>        <dbl>
#> 1 mod_1      topic_4    mod_2      topic_24     0.161
#> 2 mod_1      topic_7    mod_2      topic_6      0.152
#> 3 mod_1      topic_4    mod_2      topic_3      0.144
#> 4 mod_1      topic_24   mod_2      topic_24     0.143
#> 5 mod_1      topic_3    mod_2      topic_5      0.142
#> 6 mod_1      topic_4    mod_2      topic_9      0.142


paskn/topicl documentation built on Feb. 1, 2025, 9:33 p.m.