Man pages for koheiw/quanteda.core
Quantitative Analysis of Textual Data

as.corpus	coerce a compressed corpus to a standard corpus
as.data.frame.dfm	Convert a dfm to a data.frame
as.dfm	Coercion and checking functions for dfm objects
as.dictionary	Coercion and checking functions for dictionary objects
as.fcm	Coercion and checking functions for fcm objects
as.matrix.dfm	Coerce a dfm to a matrix or data.frame
as.matrix.textstat_simil_sparse	as.matrix method for textstat_simil_sparse
as.tokens	Coercion, checking, and combining functions for tokens...
as.yaml	Convert quanteda dictionary objects to the YAML format
attributes-set	Function extending base::attributes()
cbind.dfm	Combine dfm objects by Rows or Columns
char_tolower	Convert the case of character objects
convert	Convert quanteda objects to non-quanteda formats
convert-wrappers	Convenience wrappers for dfm convert
corpus	Construct a corpus object
corpus-class	Base method extensions for corpus objects
corpus_reshape	Recast the document units of a corpus
corpus_sample	Randomly sample documents from a corpus
corpus_segment	Segment texts on a pattern match
corpus_subset	Extract a subset of a corpus
corpus_trim	Remove sentences based on their token lengths or a pattern...
corpus_trimsentences	Remove sentences based on their token lengths or a pattern...
create	Function to assign multiple slots to a S4 object
data_char_sampletext	A paragraph of text for testing various text-based functions
data_char_ukimmig2010	Immigration-related sections of 2010 UK party manifestos
data_corpus_inaugural	US presidential inaugural address texts
data_dfm_lbgexample	dfm from data in Table 1 of Laver, Benoit, and Garry (2003)
data_dictionary_LSD2015	Lexicoder Sentiment Dictionary (2015)
data-internal	Internal data sets
data-relocated	Formerly included data objects
dfm	Create a document-feature matrix
dfm2lsa	Convert a dfm to an lsa "textmatrix"
dfm-class	Virtual class "dfm" for a document-feature matrix
dfm_compress	Recombine a dfm or fcm by combining identical dimension...
dfm_group	Combine documents in a dfm by a grouping variable
dfm-internal	Internal functions for dfm objects
dfm_lookup	Apply a dictionary to a dfm
dfm_match	Match the feature set of a dfm to given feature names
dfm_replace	Replace features in dfm
dfm_sample	Randomly sample documents or features from a dfm
dfm_select	Select features from a dfm or fcm
dfm_sort	Sort a dfm by frequency of one or more margins
dfm_subset	Extract a subset of a dfm
dfm_tfidf	Weight a dfm by _tf-idf_
dfm_tolower	Convert the case of the features of a dfm and combine
dfm_trim	Trim a dfm using frequency threshold-based feature selection
dfm_weight	Weight the feature frequencies in a dfm
diag2na	convert same-value pairs to NA in a textstat_proxy object
dictionary	Create a dictionary
dictionary-class	dictionary class objects and functions
docfreq	Compute the (weighted) document frequency of a feature
docnames	Get or set document names
docvars	Get or set document-level variables
escape_regex	Internal function for 'select_types()' to escape regular...
expand	Simpler and faster version of expand.grid() in base package
fcm	Create a feature co-occurrence matrix
fcm-class	Virtual class "fcm" for a feature co-occurrence matrix
fcm_sort	Sort an fcm in alphabetical order of the features
featfreq	Compute the frequencies of features
featnames	Get the feature labels from a dfm
field_system	Shortcut functions to access or assign metadata
flatten_dictionary	Flatten a hierarchical dictionary into a list of character...
format_sparsity	format a sparsity value for printing
friendly_class_undefined_message	Print friendly object class not defined message
generate_groups	Generate a grouping vector from docvars
get_docvars	Internal function to extract docvars
get_object_version	Get the package version that created an object
groups	Grouping variable(s) for various functions
head.corpus	Return the first or last part of a corpus
head.dfm	Return the first or last part of a dfm
head.textstat_proxy	Return the first or last part of a textstat_proxy object
is_glob	Check if patterns contains glob wildcard
is_indexed	Check if a glob pattern is indexed by index_types
is_regex	Internal function for 'select_types()' to check if a string...
keyness	Compute keyness (internal functions)
kwic	Locate keywords-in-context
list2dictionary	Internal function to convert a list to a dictionary
lowercase_dictionary_values	Internal function to lowercase dictionary values
make_meta	Internal functions to create a list for the meta attribute
matrix2dfm	Converts a Matrix to a dfm
matrix2fcm	Converts a Matrix to a fcm
merge_dictionary_values	Internal function to merge values of duplicated keys
message_error	Return an error message
meta	Get or set object metadata
metadoc	Get or set document-level meta-data
meta_system	Internal function to get, set or initialize system metadata
names-quanteda	Special handling for names of quanteda objects
ndoc	Count the number of documents or features
nest_dictionary	Utility function to generate a nested list
nsentence	Count the number of sentences
ntoken	Count the number of tokens or types
object-builders	Object compilers
pattern	Pattern for feature, token and keyword matching
pattern2id	Convert regex and glob patterns to type IDs or fixed patterns
pattern2list	Convert various input as pattern to a vector used in...
phrase	Declare a compound character to be a sequence of separate...
pipe	Pipe operator
print.phrases	Print a phrase object
print-quanteda	Print methods for quanteda core objects
quanteda.core-package	An R package for the quantitative analysis of textual data
quanteda_options	Get or set package options for quanteda
read_dict_functions	Internal functions to import dictionary files
reexports	Objects exported from other packages
remove_empty_keys	Utility function to remove empty keys
replace_dictionary_values	Internal function to replace dictionary values
reshape_docvars	Internal function to subset or duplicate docvar rows
sample_bygroup	Sample a vector by a group
search_glob	Select types without performing slow regex search
search_index	Internal function for 'select_types' to search the index...
serialize_tokens	Function to serialize list-of-character tokens
set_dfm_dimnames	Internal functions to set dimnames
set_dfm_slots-set	Set values to a dfm's S4 slots
set_fcm_slots-set	Set values to a fcm's S4 slots
sparsity	Compute the sparsity of a document-feature matrix
split_values	Internal function for special handling of multi-word...
summary.corpus	Summarize a corpus
textmodels	Models for scaling and classification of textual data
texts	Get or assign corpus texts
textstat_collocations	Identify and score multi-word expressions
textstat_entropy	Compute entropies of documents or features
textstat_keyness	Calculate keyness statistics
textstat_proxy	[Experimental] Compute document/feature proximity
textstat_proxy-class	textstat_simil/dist classes
textstat_simil	Similarity and distance computation between documents or...
textstat_summary	Summarize documents
tokenize_internal	quanteda tokenizers
tokens	Construct a tokens object
tokens_chunk	Segment tokens object by chunks of a given size
tokens_compound	Convert token sequences into compound tokens
tokens_group	Recombine documents tokens by groups
tokens_lookup	Apply a dictionary to a tokens object
tokens_ngrams	Create ngrams and skipgrams from tokens
tokens_recompile	recompile a serialized tokens object
tokens_replace	Replace tokens in a tokens object
tokens_sample	Randomly sample documents from a tokens object
tokens_segment	Segment tokens object by patterns
tokens_select	Select or remove tokens from a tokens object
tokens_split	Split tokens by a separator pattern
tokens_subset	Extract a subset of a tokens
tokens_tolower	Convert the case of tokens
tokens_tortl	[Experimental] Change direction of words in tokens
tokens_wordstem	Stem the terms in an object
topfeatures	Identify the most frequent features in a dfm
types	Get word types from a tokens object
unlist_character	Unlist a list of character vectors safely
unlist_integer	Unlist a list of integer vectors safely
unused_dots	Raise warning of unused dots
valuetype	Pattern matching using valuetype