nlp_light_pipeline: Spark NLP Light pipeline

View source: R/light-pipeline.R

nlp_light_pipelineR Documentation

Spark NLP Light pipeline

Description

LightPipelines are Spark ML pipelines converted into a single machine but multithreaded task, becoming more than 10x times faster for smaller amounts of data (small is relative, but 50k sentences is roughly a good maximum). To use them, simply plug in a trained (fitted) pipeline.

Usage

nlp_light_pipeline(x, parse_embeddings = FALSE)

Arguments

x

a trained (fitted) pipeline

parse_embeddings

whether to parse the embeddings

Value

a LightPipeline object


r-spark/sparknlp documentation built on Oct. 15, 2022, 10:50 a.m.