Man pages for quanteda
Quantitative Analysis of Textual Data

apply_ifModify only documents matching a logical condition
as.character.corpusCoercion and checking methods for corpus objects
as.data.frame.dfmConvert a dfm to a data.frame
as.dfmCoercion and checking functions for dfm objects
as.dictionaryCoercion and checking functions for dictionary objects
as.fcmCoercion and checking functions for fcm objects
as.matrix.dfmCoerce a dfm to a matrix or data.frame
as.tokensCoercion, checking, and combining functions for tokens...
as.yamlConvert quanteda dictionary objects to the YAML format
attributes-setFunction extending base::attributes()
bootstrap_dfmBootstrap a dfm
cbind.dfmCombine dfm objects by Rows or Columns
char_selectSelect or remove elements from a character vector
char_tolowerConvert the case of character objects
check_classCheck object class for functions
check_dotsCheck arguments passed to other functions via ...
check_integerValidate input vectors
concatReturn the concatenator character from an object
convertConvert quanteda objects to non-quanteda formats
convert-wrappersConvenience wrappers for dfm convert
corpusConstruct a corpus object
corpus-classBase method extensions for corpus objects
corpus_groupCombine documents in corpus by a grouping variable
corpus_reshapeRecast the document units of a corpus
corpus_sampleRandomly sample documents from a corpus
corpus_segmentSegment texts on a pattern match
corpus_subsetExtract a subset of a corpus
corpus_trimRemove sentences based on their token lengths or a pattern...
data_char_sampletextA paragraph of text for testing various text-based functions
data_char_ukimmig2010Immigration-related sections of 2010 UK party manifestos
data_corpus_inauguralUS presidential inaugural address texts
data_dfm_lbgexampledfm from data in Table 1 of Laver, Benoit, and Garry (2003)
data_dictionary_LSD2015Lexicoder Sentiment Dictionary (2015)
data-internalInternal data sets
data-relocatedFormerly included data objects
dfmCreate a document-feature matrix
dfm2lsaConvert a dfm to an lsa "textmatrix"
dfm-classVirtual class "dfm" for a document-feature matrix
dfm_compressRecombine a dfm or fcm by combining identical dimension...
dfm_groupCombine documents in a dfm by a grouping variable
dfm-internalInternal functions for dfm objects
dfm_lookupApply a dictionary to a dfm
dfm_matchMatch the feature set of a dfm to given feature names
dfm_replaceReplace features in dfm
dfm_sampleRandomly sample documents from a dfm
dfm_selectSelect features from a dfm or fcm
dfm_sortSort a dfm by frequency of one or more margins
dfm_subsetExtract a subset of a dfm
dfm_tfidfWeight a dfm by _tf-idf_
dfm_tolowerConvert the case of the features of a dfm and combine
dfm_trimTrim a dfm using frequency threshold-based feature selection
dfm_weightWeight the feature frequencies in a dfm
dictionaryCreate a dictionary
dictionary-classdictionary class objects and functions
docfreqCompute the (weighted) document frequency of a feature
docnamesGet or set document names
docvarsGet or set document-level variables
escape_regexInternal function for 'select_types()' to escape regular...
expandSimpler and faster version of expand.grid() in base package
fcmCreate a feature co-occurrence matrix
fcm-classVirtual class "fcm" for a feature co-occurrence matrix
fcm_sortSort an fcm in alphabetical order of the features
featfreqCompute the frequencies of features
featnamesGet the feature labels from a dfm
field_systemShortcut functions to access or assign metadata
flatten_dictionaryFlatten a hierarchical dictionary into a list of character...
flatten_listInternal function to flatten a nested list
format_sparsityformat a sparsity value for printing
get_docvarsInternal function to extract docvars
get_object_versionGet the package version that created an object
groupsGrouping variable(s) for various functions
head.dfmReturn the first or last part of a dfm
indexLocate a pattern in a tokens object
info_tbbGet information on TBB library
is.collocationsCheck if an object is collocations
is_globCheck if patterns contains glob wildcard
is_indexedCheck if a glob pattern is indexed by index_types
is_regexCheck if a string is a regular expression
kwicLocate keywords-in-context
list2dictionaryInternal function to convert a list to a dictionary
lowercase_dictionary_valuesInternal function to lowercase dictionary values
make_docvarsInternal function to make new system-level docvars
make_metaInternal functions to create a list of the meta fields
matrix2dfmConverts a Matrix to a dfm
matrix2fcmConverts a Matrix to a fcm
merge_dictionary_valuesInternal function to merge values of duplicated keys
message_dfmPrint messages in dfm methods
message_errorReturn an error message
messagesMessage parameter documentation
message_tokensPrint messages in tokens methods
metaGet or set object metadata
meta_systemInternal function to get, set or initialize system metadata
msgConditionally format messages
names-quantedaSpecial handling for names of quanteda objects
ndocCount the number of documents or features
nest_dictionaryUtility function to generate a nested list
nsentenceCount the number of sentences
ntokenCount the number of tokens or types
object2idMatch quanteda objects against token types
object-buildersObject builders
patternPattern for feature, token and keyword matching
pattern2idMatch patterns against token types
phraseDeclare a pattern to be a sequence of separate patterns
pipePipe operator
print-methodsPrint methods for quanteda core objects
print.phrasesPrint a phrase object
quanteda_optionsGet or set package options for quanteda
quanteda-packageAn R package for the quantitative analysis of textual data
read_dict_functionsInternal functions to import dictionary files
reexportsObjects exported from other packages
remove_empty_keysUtility function to remove empty keys
replace_dictionary_valuesInternal function to replace dictionary values
resampleSample a vector
reshape_docvarsInternal function to subset or duplicate docvar rows
search_globSelect types without performing slow regex search
search_indexInternal function for 'select_types' to search the index...
serialize_tokensFunction to serialize list-of-character tokens
set_dfm_dimnamesInternal functions to set dimnames
spacyr-methodsExtensions for and from spacy_parse objects
sparsityCompute the sparsity of a document-feature matrix
split_valuesInternal function for special handling of multi-word...
summary.corpusSummarize a corpus
summary_metadataFunctions to add or retrieve corpus summary metadata
textmodelsModels for scaling and classification of textual data
textplotsPlots for textual data
textsGet or assign corpus texts [deprecated]
textstatsStatistics for textual data
tokenize_customCustomizable tokenizer
tokenize_internalquanteda tokenizers
tokensConstruct a tokens object
tokens_chunkSegment tokens object by chunks of a given size
tokens-classBase method extensions for tokens objects
tokens_compoundConvert token sequences into compound tokens
tokens_groupCombine documents in a tokens object by a grouping variable
tokens_lookupApply a dictionary to a tokens object
tokens_ngramsCreate n-grams and skip-grams from tokens
tokens_recompilerecompile a serialized tokens object
tokens_replaceReplace tokens in a tokens object
tokens_restoreRestore special tokens
tokens_sampleRandomly sample documents from a tokens object
tokens_segmentSegment tokens object by patterns
tokens_selectSelect or remove tokens from a tokens object
tokens_splitSplit tokens by a separator pattern
tokens_subsetExtract a subset of a tokens
tokens_tolowerConvert the case of tokens
tokens_trimTrim tokens using frequency threshold-based feature selection
tokens_wordstemStem the terms in an object
tokens_xptrMethods for tokens_xptr objects
topfeaturesIdentify the most frequent features in a dfm
typesGet word types from a tokens object
unlist_characterUnlist a list of character vectors safely
unlist_integerUnlist a list of integer vectors safely
valuetypePattern matching using valuetype
quanteda documentation built on Sept. 11, 2024, 6:08 p.m.