Man pages for malsch/occupationCoding
Supervised Learning for Occupation Coding

accuracyAccuracy
asDocumentTermMatrixDocument-Term Matrix
calcAccurateAmongTopKCalculate aggregate properties for top k predicted categories
coding_index_excerptAn excerpt from the Gesamtberufsliste der BA
cosineSimilarityCosine Similarity
createDescriptivesDescribe occupational data
createSimilarityTableStringdistSimilarity Table with Coding index
createSimilarityTableSubstringSimilarity Table with Coding index
createSimilarityTableWordwiseStringdistWordwise Similarity Table with Coding index
expandPredictionResultsExpands predicted datasets to contain all allowed codes
frequent_phrasesSome job titles and job descriptions
kldb2010PlusFive2010 German Classification of Occupations (KldB 2010)
logLossLog loss
occupationsA selection of 250 coded occupational answers
plotAgreementRateVsProductionRatePlot agreement rate vs. production rate
plotReliabilityDiagramReliability Diagram
plotTruePredictionsVsFalsePredictionsPlot true predictions versus false predictions
predictCreecysMemoryBasedReasoningPredict codes with Creecys Memory-based reaoning model
predictGweonsNearestNeighborPredict codes with Gweons Nearest Neighbor Method
predictLogisticRegressionWithPenalizationPredict codes using a logistic regression model
predictSimilarityBasedReasoningPredict codes using a Similarity Based Probability Model
predictWithCodingIndexCode answers with a coding index
predictXgboostPredict codes using an extreme gradient boosted tree model
prepare_German_coding_index_Gesamtberufsliste_der_BAPrepares the Gesamtberufsliste der BA to be used with this...
produceResultsProduces summaries of predictive performance
removeFaultyAndUncodableAnswers_And_PrepareForAnalysisData Preparation
selectMaxProbMethodFrom multiple prediction methods, select the prediction...
sharpnessSharpness
stringPreprocessingPreprocess German occupational text
surveyCountsSubstringSimilarityAnonymized training data (substring similarity) to be used...
surveyCountsWordwiseSimilarityAnonymized training data (wordwise similarity) to be used...
trainCreecysMemoryBasedReasoningTrain Creecys Memory-based reaoning model
trainGweonsNearestNeighborTrains Gweons Nearest Neighbor model
trainLogisticRegressionWithPenalizationTrain a logistic regression model with penalization
trainSimilarityBasedReasoningTrain Similarity Based Probability Model
trainSimilarityBasedReasoning2Train Similarity Based Probability Model with anonymized...
trainXgboostTrain an extreme gradient boosted tree model
malsch/occupationCoding documentation built on March 14, 2024, 8:09 a.m.