docs_top_topics: Top-ranked topics for documents
In agoldst/dfrtopics: Tools for exploring topic models of text

docs_top_topics

R Documentation

Top-ranked topics for documents

Description

This function extracts the most salient topics for all documents from the document-topic matrix.

Usage

docs_top_topics(m, n, ...)

Arguments

`m`	`mallet_model` object
`n`	number of top topics to extract
`weighting`	a function to transform the document-topic matrix. By default, the topic proportions are used (same rank as raw weights)

Details

Here as elsewhere "saliency" can be variously defined: though the easiest choice is to choose the topic which captures the largest proportion of a document, and that is the default, we might want to penalize topics which are widespread across the whole corpus. TODO: actually implement the alternative weighting.

Value

a data frame with three columns, doc, the numerical index of the document in doc_ids(m), topic, and weight, the weight used in ranking (topic proportion, by default)

a dataframe with n rows and two columns, topic and weight.

agoldst/dfrtopics
Tools for exploring topic models of text

docs_top_topics: Top-ranked topics for documents
In agoldst/dfrtopics: Tools for exploring topic models of text

Top-ranked topics for documents

Description

Usage

Arguments

Details

Value

See Also

Related to docs_top_topics in agoldst/dfrtopics...

R Package Documentation

Browse R Packages

We want your feedback!

agoldst/dfrtopics Tools for exploring topic models of text

docs_top_topics: Top-ranked topics for documents In agoldst/dfrtopics: Tools for exploring topic models of text

Top-ranked topics for documents

Description

Usage

Arguments

Details

Value

See Also

Related to docs_top_topics in agoldst/dfrtopics...

R Package Documentation

Browse R Packages

We want your feedback!

agoldst/dfrtopics
Tools for exploring topic models of text

docs_top_topics: Top-ranked topics for documents
In agoldst/dfrtopics: Tools for exploring topic models of text