PCC.contam: PCC Exploratory Methods: Contamination Detection

Description Usage Arguments Details Value Warning Note Author(s) References See Also Examples

View source: R/PCC.contam.R

Description

Detect possible contaminations by assessing the role of each witness in conflicting information between variant locations.

Usage

1
PCC.contam(x, omissionsAsReadings = FALSE, alternateReadings = FALSE, pauseAtPlot = FALSE)

Arguments

x

if alternateReadings = FALSE (default), a numeric matrix, with witnesses in columns, variant locations in rows, and readings coded by a number; if alternateReadings = TRUE, a character matrix, with witnesses in columns, variant locations in rows, and, in each cell, one or several readings, coded by numbers and separated by a comma (e.g. '1,2,3', if the witness has three different readings); or an object of class "pccConflicts" or "pccElimination".

omissionsAsReadings

logical; if TRUE, omissions are treated as variant readings (and taken into account in determining conflicts between variant locations or in computing severe disagreements between witnesses). Default: FALSE.

alternateReadings

logical; if TRUE, a witness can have multiple variants for a single variant location (contaminated manuscripts, editio variorum, …), encoded as comma-separated values. Default: FALSE.

pauseAtPlot

logical; if TRUE, the algorithms stops at each plot during the execution of PCC.contam (by setting graphical parameter ask = TRUE). Default: FALSE.

Details

To help assess the role of each witness in the conflicting information between variant locations, this function computes the number of conflicting variant locations when removing one of the witnesses, for each witness. If removing a witness makes the number of conflicting variant locations significantly drop, then contamination can be considered as plausible. Be aware that this function will be most efficient for contaminations limited to a single manuscript.

Value

An object of class "pccContam", a list containing

totalByMs

a numeric matrix, with, in rows, each variant locations, and, in columns, the number of conflicts and centrality in the full database, followed by the difference in total conflicts and centrality caused by the removal of each witness.

conflictsDifferences

a one row numeric matrix, containing, for each witness, the total decrease in conflicts caused by its removal from the computations

database

the original database used for the calculations

Warning

The execution of this command can be time-consuming for large databases.

Note

Additional contamination detection methods will be implemented in the future.

Author(s)

Jean-Baptiste Camps & Florian Cafiero

References

Camps, Jean-Baptiste. ‘Detecting Contaminations in Textual Traditions Computer Assisted and Traditional Methods’. Leeds, International Medieval Congress, 2013, unpublished paper, https://www.academia.edu/3825633/Detecting_Contaminations_in_Textual_Traditions_Computer_Assisted_and_Traditional_Methods.

See Also

PCC.Exploratory, PCC.equipollent.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
# load a data set
data("fournival")

# identify conflicts on a subset
x = PCC.conflicts(fournival)
# identify problematic variant locations
x = PCC.overconflicting(x, ask = FALSE, threshold = 0.06)
# eliminate them
x = PCC.elimination(x)
# examinate the rest of the problematic cases, to detect
# plausible contaminations
PCC.contam(x)

stemmatology documentation built on May 2, 2019, 5:10 a.m.