get_cluster_fill_counts: Get Cluster Fill Counts

View source: R/cluster_profiles.R

get_cluster_fill_countsR Documentation

Get Cluster Fill Counts

Description

get_cluster_fill_counts() creates a data frame that shows the number of graphs in each cluster for each input document.

Usage

get_cluster_fill_counts(df)

Arguments

df

A data frame of cluster assignments from get_clusters_batch. The data frame has columns docname and cluster. Each row corresponds to a graph and lists the document from which the graph was obtained and the cluster to which that graph is assigned. Optionally, the data frame might also have writer and doc columns. If present, writer lists the writer ID of each document and doc is an identifier to distinguish between different documents from the same writer.

Value

A dataframe of cluster fill counts for each document in the input data frame.

Examples

docname <- c(rep("doc1", 20), rep("doc2", 20), rep("doc3", 20))
writer <- c(rep(1, 20), rep(2, 20), rep(3, 20))
doc <- c(rep(1, 20), rep(2, 20), rep(3, 20))
cluster <- sample(3, 60, replace = TRUE)
df <- data.frame(docname, writer, doc, cluster)
get_cluster_fill_counts(df)


CSAFE-ISU/handwriter documentation built on Feb. 8, 2025, 6:25 a.m.