textEmbedReduce: Pre-trained dimension reduction (experimental)
In text: Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

textEmbedReduce

R Documentation

Pre-trained dimension reduction (experimental)

Description

Pre-trained dimension reduction (experimental)

Usage

textEmbedReduce(
  embeddings,
  n_dim = NULL,
  scalar = "fb20/scalar.csv",
  pca = "fb20/rpca_roberta_768_D_20.csv"
)

Arguments

`embeddings`	(list) Embedding(s) - including, tokens, texts and/or word_types.
`n_dim`	(numeric) Number of dimensions to reduce to.
`scalar`	(string or matrix) Name or URL to scalar for standardizing the embeddings. If a URL, the function first examines whether it has been downloaded before. The string should be to a csv file containing a matrix with the pca weights for matrix multiplication. For more information see reference below.
`pca`	(string or matrix) Name or URL to pca weights. If a URL, the function first examines whether it has been downlaoded before. The string should be to a csv file containing a matrix. For more information see reference below.

Details

To use this method please see and cite:
Ganesan, A. V., Matero, M., Ravula, A. R., Vu, H., & Schwartz, H. A. (2021, June). Empirical evaluation of pre-trained transformers for human-level nlp: The role of sample size and dimensionality. In Proceedings of the conference. Association for Computational Linguistics. North American Chapter. Meeting (Vol. 2021, p. 4515). NIH Public Access.

See Git-Hub Empirical-Evaluation

Value

Returns embeddings with reduced number of dimensions.

Examples

## Not run: 
embeddings <- textEmbedReduce(word_embeddings_4$texts)

## End(Not run)

text documentation built on June 8, 2025, 1:32 p.m.

text index

README.md Creating a Singularity Container to Run HuggingFace Transformers Models in R Extended Installation Guide Getting started How to best manage computationally heavy analyses HuggingFace language models are downloaded in .cache HuggingFace Transformers in R: Word Embeddings Defaults and Specifications Implicit Motives Tutorial L-BAM Tutorial Pre-registration and Researcher Degrees of Freedom Psychological Methods: the Text Tutorial The Language-Based Assessment Model (L-BAM) Library

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

text
Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

textEmbedReduce: Pre-trained dimension reduction (experimental)
In text: Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

Pre-trained dimension reduction (experimental)

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to textEmbedReduce in text...

R Package Documentation

Browse R Packages

We want your feedback!

text Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

textEmbedReduce: Pre-trained dimension reduction (experimental) In text: Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

Pre-trained dimension reduction (experimental)

Description

Usage

Arguments

Details

Value

See Also

Examples

Related to textEmbedReduce in text...

R Package Documentation

Browse R Packages

We want your feedback!

text
Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning

textEmbedReduce: Pre-trained dimension reduction (experimental)
In text: Analyses of Text using Transformers Models from HuggingFace, Natural Language Processing and Machine Learning