tm.plugin.alceste-package: A plug-in for the tm text mining framework to import corpora...

Description Details Author(s) References

Description

This package provides a tm Source to create corpora from files formatted in the format used by the Alceste application.

Details

Typical usage is to create a corpus from an Alceste file prepared manually (here called myAlcesteCorpus.txt). Frequently, it is necessary to specify the encoding of the texts via link{AlcesteSource}'s encoding argument.

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
    # Import corpus
    source <- europresseSource("myAlcesteCorpus.txt")
    corpus <- Corpus(source)

    # See how many articles were imported
    corpus

    # See the contents of the first article and its meta-data
    inspect(corpus[1])
    meta(corpus[[1]])
  

See link{AlcesteSource} for more details and real examples.

Author(s)

Milan Bouchet-Valat <nalimilan@club.fr>

References

http://www.image-zafar.com/en/alceste-software


tm.plugin.alceste documentation built on May 1, 2019, 10:30 p.m.