process_multiwords_fast: Optimized multiword processing workflow
In tall: Text Analysis for All

process_multiwords_fast

R Documentation

Optimized multiword processing workflow

Description

Complete optimized workflow for multiword detection and processing. Uses C++ functions and data.table for maximum performance.

Usage

process_multiwords_fast(x2, stats, term = c("lemma", "token"))

Arguments

`x2`	Data frame with token information
`stats`	Data frame with multiword statistics (keyword, ngram columns)
`term`	Type of term to process: "lemma" or "token"

Details

This function replaces the original switch block with an optimized version that uses:

C++ functions for text recoding
Vectorized operations instead of multiple mutate calls
Pre-computed lookups to avoid repeated joins

Value

Data frame with columns: doc_id, term_id, multiword, upos_multiword, ngram

Examples

## Not run: 
result <- process_multiwords_fast(dfTag, multiword_stats, term = "lemma")

## End(Not run)

tall documentation built on Feb. 12, 2026, 9:08 a.m.

tall index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

tall
Text Analysis for All

process_multiwords_fast: Optimized multiword processing workflow
In tall: Text Analysis for All

Optimized multiword processing workflow

Description

Usage

Arguments

Details

Value

Examples

Related to process_multiwords_fast in tall...

R Package Documentation

Browse R Packages

We want your feedback!

tall Text Analysis for All

process_multiwords_fast: Optimized multiword processing workflow In tall: Text Analysis for All

Optimized multiword processing workflow

Description

Usage

Arguments

Details

Value

Examples

Related to process_multiwords_fast in tall...

R Package Documentation

Browse R Packages

We want your feedback!

tall
Text Analysis for All

process_multiwords_fast: Optimized multiword processing workflow
In tall: Text Analysis for All