visualize_term_embeddings: Visualize Embeddings of Terms

Description Usage Arguments Value Examples

View source: R/plots.R

Description

Visualize given embeddings of terms of specified categories. The visualization is generated on the 2D plane by t-SNE algorithm or by PCA (two main components) and plotted with ggplot. If t-SNE cannot be generated (because for example perplexity is too large for the number of samples), there are plotted two main components from PCA. The plot can be optionally saved to the given PDF file.

Usage

1
2
visualize_term_embeddings(term_table, term_vectors, category,
  method = "tsne", save = FALSE, path_to_save)

Arguments

term_table

A data frame with columns:

term

A medical term (elements in this column do not have to be unique)

category

A category of the term

term_vectors

A matrix of embeddings of the terms

category

A character vector of categories of the terms to be visualized

method

One of "tsne" (default) or "pca" - a method of generating the plot

save

A logical indicating if the plot should be saved to the file

path_to_save

An optional string of the path to the target PDF file

Value

A generated plot of embeddings.

Examples

1
2
3
4
5
6
7
inter_term_vectors <- embed_terms(interviews, embedding_size = 10L,
  term_count_min = 1L)

visualize_term_embeddings(terms_categories, inter_term_vectors,
                          unique(as.character(terms_categories$category)))
visualize_term_embeddings(terms_categories, inter_term_vectors,
                          c("anatomic"), method = "pca")

adamgdobrakowski/memr documentation built on Sept. 4, 2021, 3:45 a.m.