getWACKY: Read in concordances from the WaCkY Corpus

View source: R/getWACKY.R

getWACKYR Documentation

Read in concordances from the WaCkY Corpus

Description

This function allows for reading in concordances exported from the free web interface for searching the WaCkY corpora, http://nl.ijs.si/noske/wacs.cgi/first_form.

Usage

getWACKY(filename, tags = c("display", "omit", "column"), XML = TRUE)

Arguments

filename

The name of the file you read in. You can use file.choose() to choose a file interactively. The file must be in XML (not .txt) format. It has to be formatted in UTF-8 (which should be the case by default).

tags

string indicating how tokens and tags should be displayed, in case you exported a lemmatized or otherwise tagged concordance. "omit" will only show tokens and omit the lemmas. "display" will display the lemmas alongside the tokens in the same column, separated by /. "column" will add a "lemma" column to the concordance. "only" will only show lemmas in the keyword column. Default is "omit".

XML

Is the concordance file in XML format? Default is TRUE. Currently only XML files are supported, support for .txt files will probably be implemented in the near future.

Value

A file containing a tab-separated concordance sheet.


hartmast/concordances documentation built on April 19, 2023, 9:39 p.m.