Newsmap: Semi-Supervised Model for Geographical Document Classification

require(quanteda)
require(newsmap)

txt <- c("Ireland is famous for beer.",
         "Beer is popular in Ireland.",
         "Cork is an Irish coastal city.",
         "India is known for curry.",
         "Indian curry and beer go well.",
         "New Delhi is the capital of India")

toks <- tokens(txt)
label_toks <- tokens_lookup(toks, data_dictionary_newsmap_en, levels = 3)
label_dfm <- dfm(label_toks)

feat_dfm <- dfm(toks) %>%
    dfm_remove(stopwords()) %>%
    dfm_select('^[a-z1-2]+', selection = "keep", valuetype = 'regex')
map_lr <- textmodel_newsmap(feat_dfm, label_dfm, measure = "likelihood")
map_et <- textmodel_newsmap(feat_dfm, label_dfm, measure = "entropy")

map_lr$model
map_et$model

predict(map_lr, confidence.fit = TRUE)
predict(map_et, confidence.fit = TRUE)

koheiw/Newsmap documentation built on April 14, 2024, 3:26 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

koheiw/Newsmap
Semi-Supervised Model for Geographical Document Classification

dev/entropy.R
In koheiw/Newsmap: Semi-Supervised Model for Geographical Document Classification

R Package Documentation

Browse R Packages

We want your feedback!

koheiw/Newsmap Semi-Supervised Model for Geographical Document Classification

dev/entropy.R In koheiw/Newsmap: Semi-Supervised Model for Geographical Document Classification

R Package Documentation

Browse R Packages

We want your feedback!

koheiw/Newsmap
Semi-Supervised Model for Geographical Document Classification

dev/entropy.R
In koheiw/Newsmap: Semi-Supervised Model for Geographical Document Classification