text_dtm_prep: Prepare data for topic modelling
In DataS-DHSC/consultations: Re-usable functions for describing the responses to a public consultation or call for evidence

Calculate TF-IDF scores and create a Document-Term Matrix from your dataset.

1	text_dtm_prep(unnest_data, grouping_var, word_col = "word")

`unnest_data`	dataframe with unnested tokens
`grouping_var`	column used for determining what consistutes a document; with quotation marks (string) If each response is a row, add a column with row identifiers to be the grouping_var.
`word_col`	column name containing the unnested tokens

List containing the prepared data and a Document-Term Matrix

1 2	tidytext::unnest_tokens(dummy_response, 'word', colnames(dummy_response)[7]) %>% text_dtm_prep(., 'response_id')

DataS-DHSC/consultations documentation built on Jan. 28, 2022, 1:56 a.m.

DataS-DHSC/consultations index

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Description