Description Usage Arguments Value
Find tokens using Lucene-like search queries
1 2 3 4 5 |
tokens |
a tokenlist object. See ?asTokenlist() for details. |
queries |
a data frame containing the queries. See ?searchQuery() for an explanation of the query language and to test individual queries. |
batchsize |
This function is faster if multiple queries are searched together, but too many queries (with too many tokens) at once can eat up memory or crash R. Try lowering batchsize in case of issues. |
default.window |
Determines the default word distance of the condition terms to the indicator (thus, if no specific word distance is set with the ~ symbol) |
condition_once |
logical. If TRUE, then if an indicator satisfies its conditions once in an article, all indicators within that article are coded. |
indicator_filter |
A logical vector that indicates which tokens can match an indicator. Can for instance be used to only select tokens that are proper names (using POS tagging) when looking for people. |
presorted |
The data has to be sorted on order(doc_id, position). If this is already the case, presorted can be set to TRUE to save time (which is usefull when testing many individual queries for large tokenlists) |
doc.col |
The name of the document_id column. Defaults to "doc_id", unless a global default is specified using setTokenlistColnames() |
position.col |
The name of the column giving the position in a document. Defaults to "position", unless a global default is specified using setTokenlistColnames() |
word.col |
The name of the column containing the token text. Defaults to "word", unless a global default is specified using setTokenlistColnames() |
verbose |
show progress |
the annotated tokens data frame
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.